lli-14 (1) Linux Manual Page
NAME
lli – manual page for lli 14
DESCRIPTION
OVERVIEW: llvm interpreter & dynamic compiler
USAGE: lli [options] <input bitcode> <program arguments>…
OPTIONS:
Color Options:
-
–color– Use colors in output (default=autodetect)
-
-O=<char> – Optimization level. [-O0,-O1,-O2, or-O3](default = ‘-O2’)–aarch64-neon-syntax=<value> – Choose style of NEON code to emit from AArch64 backend:- =generic
- – Emit generic NEON assembly
- =apple
- – Emit Apple-style NEON assembly
-
–aarch64-use-aa– Enable the use of AA during codegen.–abort-on-max-devirt-iterations-reached– Abort when the max iterations for devirtualization CGSCC repeat pass is reached–addrsig– Emit an address-significance table–align-loops=<uint> – Default alignment for loops–allow-ginsert-as-artifact– Allow G_INSERT to be considered an artifact. Hack around AMDGPU test infinite loops.–amdgpu-bypass-slow-div– Skip 64-bit divide for dynamic 32-bit values–amdgpu-disable-loop-alignment– Do not align and prefetch loops–amdgpu-disable-power-sched– Disable scheduling to minimize mAI power bursts–amdgpu-dpp-combine– Enable DPP combiner–amdgpu-dump-hsa-metadata– Dump AMDGPU HSA Metadata–amdgpu-enable-flat-scratch– Use flat scratch instructions–amdgpu-enable-merge-m0– Merge and hoist M0 initializations–amdgpu-promote-alloca-to-vector-limit=<uint> – Maximum byte size to consider promote alloca to vector–amdgpu-sdwa-peephole– Enable SDWA peepholer–amdgpu-use-aa-in-codegen– Enable the use of AA during codegen.–amdgpu-verify-hsa-metadata– Verify AMDGPU HSA Metadata–amdgpu-vgpr-index-mode– Use GPR indexing mode instead of movrel for vector indexing–arm-add-build-attributes––arm-implicit-it=<value> – Allow conditional instructions outdside of an IT block- =always
- – Accept in both ISAs, emit implicit ITs in Thumb
- =never
- – Warn in ARM, reject in Thumb
- =arm
- – Accept in ARM, reject in Thumb
- =thumb
- – Warn in ARM, emit implicit ITs in Thumb
-
–asm-show-inst– Emit internal instruction representation to assembly file–atomic-counter-update-promoted– Do counter update using atomic fetch add -
- for promoted counters only
-
–atomic-first-counter– Use atomic fetch add for first counter in a function (usually the entry counter)–basic-block-sections=<all | <function list (file)> | labels | none> – Emit basic blocks into separate sections–bounds-checking-single-trap– Use one trap block per function–cfg-hide-cold-paths=<number> – Hide blocks with relative frequency below the given value–cfg-hide-deoptimize-paths––cfg-hide-unreachable-paths––code-model=<value> – Choose code model- =tiny
- – Tiny code model
- =small
- – Small code model
- =kernel
- – Kernel code model
- =medium
- – Medium code model
- =large
- – Large code model
-
–compile-threads=<uint> – Choose the number of compile threads (jit-kind=orc-lazy only)–cost-kind=<value> – Target cost kind- =throughput
- – Reciprocal throughput
- =latency
- – Instruction latency
- =code-size
- – Code size
- =size-latency
- – Code size and latency
-
–data-sections– Emit data into separate sections–debug-entry-values– Enable debug info for the debug entry values.–debug-info-correlate– Use debug info to correlate profiles.–debugger-tune=<value> – Tune debug info for a particular debugger- =gdb
- – gdb
- =lldb
- – lldb
- =dbx
- – dbx
- =sce
- – SCE targets (e.g. PS4)
-
–debugify-level=<value> – Kind of debug info to add- =locations
- – Locations only
- =location+variables
- – Locations and Variables
-
–debugify-quiet– Suppress verbose debugify output–denormal-fp-math=<value> – Select which denormal numbers the code is permitted to require- =ieee
- – IEEE 754 denormal numbers
- =preserve-sign
- – the sign of a flushed-to-zero number is preserved in the sign of 0
- =positive-zero
- – denormals are flushed to positive zero
-
–denormal-fp-math-f32=<value> – Select which denormal numbers the code is permitted to require for float- =ieee
- – IEEE 754 denormal numbers
- =preserve-sign
- – the sign of a flushed-to-zero number is preserved in the sign of 0
- =positive-zero
- – denormals are flushed to positive zero
-
–disable-i2p-p2i-opt– Disables inttoptr/ptrtoint roundtrip optimization–disable-lazy-compilation– Disable JIT lazy compilation–disable-promote-alloca-to-lds– Disable promote alloca to LDS–disable-promote-alloca-to-vector– Disable promote alloca to vector–disable-tail-calls– Never emit tail calls–dlopen=<string> – Dynamic libraries to load before linking–do-counter-promotion– Do counter register promotion–dot-cfg-mssa=<file name for generated dot file> – file name for generated dot file–dwarf-version=<int> – Dwarf version–dwarf64– Generate debugging info in the 64-bit DWARF format–emit-call-site-info– Emit call site debug information, if debug information is enabled.–emscripten-cxx-exceptions-allowed=<string> – The list of function names in which Emscripten-style exception handling is enabled (see emscripten EMSCRIPTEN_CATCHING_ALLOWED options)–emulated-tls– Use emulated TLS model–enable-cache-manager– Use cache manager to save/load modules–enable-cse-in-irtranslator– Should enable CSE in irtranslator–enable-cse-in-legalizer– Should enable CSE in Legalizer–enable-emscripten-cxx-exceptions– WebAssembly Emscripten-style exception handling–enable-emscripten-sjlj– WebAssembly Emscripten-style setjmp/longjmp handling–enable-gvn-hoist– Enable the GVN hoisting pass (default = off)–enable-gvn-memdep––enable-gvn-sink– Enable the GVN sinking pass (default = off)–enable-load-in-loop-pre––enable-load-pre––enable-loop-simplifycfg-term-folding––enable-name-compression– Enable name/filename string compression–enable-no-infs-fp-math– Enable FP math optimizations that assume no +-Infs–enable-no-nans-fp-math– Enable FP math optimizations that assume no NaNs–enable-no-signed-zeros-fp-math– Enable FP math optimizations that assume the sign of 0 is insignificant–enable-no-trapping-fp-math– Enable setting the FP exceptions build attribute not to use exceptions–enable-split-backedge-in-load-pre––enable-unsafe-fp-math– Enable optimizations that may decrease FP precision–entry-function=<function> – Specify the entry function (default = ‘main’) of the executable–exception-model=<value> – exception model- =default
- – default exception handling model
- =dwarf
- – DWARF-like CFI based exception handling
- =sjlj
- – SjLj exception handling
- =arm
- – ARM EHABI exceptions
- =wineh
- – Windows exception model
- =wasm
- – WebAssembly exception handling
-
–experimental-debug-variable-locations– Use experimental new value-tracking variable locations–extra-archive=<input archive> – Extra archive files to be loaded–extra-module=<input bitcode> – Extra modules to be loaded–extra-object=<input object> – Extra object files to be loaded–fake-argv0=<executable> – Override the ‘argv[0]’ value passed into the executing program–fatal-warnings– Treat warnings as errors–filetype=<value> – Choose a file type (not all types are supported by all targets):- =asm
- – Emit an assembly (‘.s’) file
- =obj
- – Emit a native object (‘.o’) file
- =null
- – Emit nothing, for performance testing
-
–float-abi=<value> – Choose float ABI type- =default
- – Target default float ABI type
- =soft
- – Soft float ABI (implied by
-soft-float) - =hard
- – Hard float ABI (uses FP registers)
-
–force-dwarf-frame-section– Always emit a debug frame section.–force-interpreter– Force interpretation: disable JIT–fp-contract=<value> – Enable aggressive formation of fused FP ops- =fast
- – Fuse FP ops whenever profitable
- =on
- – Only fuse ‘blessed’ FP ops.
- =off
- – Only fuse FP ops when the result won’t be affected.
-
–frame-pointer=<value> – Specify frame pointer elimination optimization- =all
- – Disable frame pointer elimination
- =non-leaf
- – Disable frame pointer elimination for non-leaf frame
- =none
- – Enable frame pointer elimination
–fs-profile-debug-bw-threshold=<uint> – Only show debug message if the source branch weight is greater-
- than this value.
-
–fs-profile-debug-prob-diff-threshold=<uint> – Only show debug message if the branch probility is greater than this value (in percentage).–function-sections– Emit functions into separate sections–generate-merged-base-profiles– When generating nested context-sensitive profiles, always generate extra base profile for function with all its context profiles merged into it.–gpsize=<uint> – Global Pointer Addressing Size. -
- The default size is 8.
-
–hash-based-counter-split– Rename counter variable of a comdat function based on cfg hash–hot-cold-split– Enable hot-cold splitting pass–ignore-xcoff-visibility– Not emit the visibility attribute for asm in AIX OS or give all symbols ‘unspecified’ visibility in XCOFF object file–import-all-index– Import all external functions in index.–incremental-linker-compatible– When used with filetype=obj, emit an object file which can be used with an incremental linker–instcombine-code-sinking– Enable code sinking–instcombine-guard-widening-window=<uint> – How wide an instruction window to bypass looking for another guard–instcombine-max-iterations=<uint> – Limit the maximum number of instruction combining iterations–instcombine-max-num-phis=<uint> – Maximum number phis to handle in intptr/ptrint folding–instcombine-maxarray-size=<uint> – Maximum array size considered when doing a combine–instcombine-negator-enabled– Should we attempt to sink negations?–instcombine-negator-max-depth=<uint> – What is the maximal lookup depth when trying to check for viability of negation sinking.–instrprof-atomic-counter-update-all– Make all profile counter updates atomic (for testing only)–internalize-public-api-file=<filename> – A file containing list of symbol names to preserve–internalize-public-api-list=<list> – A list of symbol names to preserve–iterative-counter-promotion– Allow counter promotion across the whole loop nest.–jd=<string> – Specifies the JITDylib to be used for any subsequent-extra-modulearguments.–jit-kind=<value> – Choose underlying JIT kind.- =mcjit
- – MCJIT
- =orc
- – Orc JIT
- =orc-lazy
- – Orc-based lazy JIT.
-
–jit-linker=<value> – Choose the dynamic linker/loader.- =default
- – Default for platform and JIT-kind
- =rtdyld
- – RuntimeDyld
- =jitlink
- – Orc-specific linker
-
–load=<pluginfilename> – Load the specified plugin–lto-embed-bitcode=<value> – Embed LLVM bitcode in object files produced by LTO- =none
- – Do not embed
- =optimized
- – Embed after all optimization passes
- =post-merge-pre-opt
- – Embed post merge, but before optimizations
-
–lto-pass-remarks-filter=<regex> – Only record optimization remarks from passes whose names match the given regular expression–lto-pass-remarks-format=<format> – The format used for serializing remarks (default: YAML)–lto-pass-remarks-output=<filename> – Output filename for pass remarks–march=<string> – Architecture to generate code for (see–version)–matrix-default-layout=<value> – Sets the default matrix layout- =column-major
- – Use column-major layout
- =row-major
- – Use row-major layout
-
–mattr=<a1,+a2,-a3,…> – Target specific attributes (-mattr=,help/ for details)–max-counter-promotions=<int> – Max number of allowed counter promotions–max-counter-promotions-per-loop=<uint> – Max number counter promotions per loop to avoid increasing register pressure too much–mc-relax-all– When used with filetype=obj, relax all fixups in the emitted object file–mcabac– tbd–mcjit-remote-process=<filename> – Specify the filename of the process to launch for remote MCJIT execution. -
- If none is specified, remote execution will be simulated in-process.
-
–mcpu=<cpu-name> – Target a specific cpu type (-mcpu=,help/ for details)–meabi=<value> – Set EABI type (default depends on triple):- =default
- – Triple default EABI version
- =4
- – EABI version 4
- =5
- – EABI version 5
- =gnu
- – EABI GNU
-
–merror-missing-parenthesis– Error for missing parenthesis around predicate registers–merror-noncontigious-register– Error for register names that aren’t contigious–mhvx– Enable Hexagon Vector eXtensions–mhvx=<value> – Enable Hexagon Vector eXtensions- =v60
- – Build for HVX v60
- =v62
- – Build for HVX v62
- =v65
- – Build for HVX v65
- =v66
- – Build for HVX v66
- =v67
- – Build for HVX v67
- =v68
- – Build for HVX v68
- =v69
- – Build for HVX v69
-
–mips-compact-branches=<value> – MIPS Specific: Compact branch policy.- =never
- – Do not use compact branches if possible.
- =optimal
- – Use compact branches where appropriate (default).
- =always
- – Always use compact branches if possible.
-
–mips16-constant-islands– Enable mips16 constant islands.–mips16-hard-float– Enable mips16 hard float.–mir-strip-debugify-only– Should mir-strip-debug only strip debug info from debugified modules by default–mno-compound– Disable looking for compound instructions for Hexagon–mno-fixup– Disable fixing up resolved relocations for Hexagon–mno-ldc1-sdc1– Expand double precision loads and stores to their single precision counterparts–mno-pairing– Disable looking for duplex instructions for Hexagon–mtriple=<string> – Override target triple for module–mwarn-missing-parenthesis– Warn for missing parenthesis around predicate registers–mwarn-noncontigious-register– Warn for register names that arent contigious–mwarn-sign-mismatch– Warn for mismatching a signed and unsigned value–no-deprecated-warn– Suppress all deprecated warnings–no-discriminators– Disable generation of discriminator information.–no-process-syms– Do not resolve lli process symbols in JIT’d code–no-type-check– Suppress type errors (Wasm)–no-warn– Suppress all warnings–no-xray-index– Don’t emit xray_fn_idx section–nozero-initialized-in-bss– Don’t place zero-initialized symbols into bss section–nvptx-sched4reg– NVPTX Specific: schedule for register pressue–object-cache-dir=<string> – Directory to store cached object files (must be user writable)–opaque-pointers– Use opaque pointers–per-module-lazy– Performs lazy compilation on whole module boundaries rather than individual functions–poison-checking-function-local– Check that returns are non-poison (for testing)–print-pipeline-passes– Print a ‘-passes’ compatible string describing the pipeline (best-effort only).–r600-ir-structurize– Use StructurizeCFG IR pass–rdf-dump––rdf-limit=<uint> ––relax-elf-relocations– Emit GOTPCRELX/REX_GOTPCRELX instead of GOTPCREL on x86-64 ELF–relocation-model=<value> – Choose relocation model- =static
- – Non-relocatable code
- =pic
- – Fully relocatable, position independent code
- =dynamic-no-pic
- – Relocatable external references, non-relocatable code
- =ropi
- – Code and read-only data relocatable, accessed PC-relative
- =rwpi
- – Read-write data relocatable, accessed relative to static base
- =ropi-rwpi
- – Combination of ropi and rwpi
-
–remote-mcjit– Execute MCJIT’ed code in a separate process.–runtime-counter-relocation– Enable relocating counters at runtime.–safepoint-ir-verifier-print-only––sample-profile-check-record-coverage=<N> – Emit a warning if less than N% of records in the input profile are matched to the IR.–sample-profile-check-sample-coverage=<N> – Emit a warning if less than N% of samples in the input profile are matched to the IR.–sample-profile-max-propagate-iterations=<uint> – Maximum number of iterations to go through when propagating sample block/edge weights through the CFG.–skip-ret-exit-block– Suppress counter promotion if exit blocks contain ret.–soft-float– Generate software floating point library calls–speculative-counter-promotion-max-exiting=<uint> – The max number of exiting blocks of a loop to allow -
- speculative counter promotion
–speculative-counter-promotion-to-loop– When the option is false, if the target block is in a loop, the promotion will be disallowed unless the promoted counter- update can be further/iteratively promoted into an acyclic region.
-
–split-machine-functions– Split out cold basic blocks from machine functions based on profile information–stack-size-section– Emit a section containing stack size metadata–stack-symbol-ordering– Order local stack symbols.–stackrealign– Force align the stack to the minimum alignment–strict-dwarf– use strict dwarf–summary-file=<string> – The summary file to use for function importing.–swift-async-fp=<value> – Determine when the Swift async frame pointer should be set- =auto
- – Determine based on deployment target
- =always
- – Always set the bit
- =never
- – Never set the bit
-
–tail-predication=<value> – MVE tail-predication pass options- =disabled
- – Don’t tail-predicate loops
- =enabled-no-reductions
- – Enable tail-predication, but not for reduction loops
- =enabled
- – Enable tail-predication, including reduction loops
- =force-enabled-no-reductions
- – Enable tail-predication, but not for reduction loops, and force this which might be unsafe
- =force-enabled
- – Enable tail-predication, including reduction loops, and force this which might be unsafe
-
–tailcallopt– Turn fastcc calls into tail calls by (potentially) changing ABI.–thinlto-assume-merged– Assume the input has already undergone ThinLTO function importing and the other pre-optimization pipeline changes.–thread-entry=<string> – calls the given entry-point on a new thread (jit-kind=orc-lazy only)–thread-model=<value> – Choose threading model- =posix
- – POSIX thread model
- =single
- – Single thread model
-
–threads=<int> ––tls-size=<uint> – Bit size of immediate TLS offsets–unique-basic-block-section-names– Give unique names to every basic block section–unique-section-names– Give unique names to every section–use-ctors– Use .ctors instead of .init_array.–vec-extabi– Enable the AIX Extended Altivec ABI.–verify-region-info– Verify region info (time consuming)–vp-counters-per-site=<number> – The average number of profile counters allocated per value profiling site.–vp-static-alloc– Do static counter allocation for value profiler–wasm-enable-eh– WebAssembly exception handling–wasm-enable-sjlj– WebAssembly setjmp/longjmp handling–x86-align-branch=<string> – Specify types of branches to align (plus separated list of types): -
- jcc indicates conditional jumps fused indicates fused conditional jumps jmp indicates direct unconditional jumps call indicates direct and indirect calls ret indicates rets indirect indicates indirect unconditional jumps
-
–x86-align-branch-boundary=<uint> – Control how the assembler should align branches with NOP. If the boundary’s size is not 0, it should be a power of 2 and no less than 32. Branches will be aligned to prevent from being across or against the boundary of specified size. The default value 0 does not align branches.–x86-branches-within-32B-boundaries– Align selected instructions to mitigate negative performance impact of Intel’s micro code update for errata skx102. -
- May break assumptions about labels corresponding to particular instructions, and should be used with caution.
-
–x86-pad-max-prefix-size=<uint> – Maximum number of prefixes to use for padding–xcoff-traceback-table– Emit the XCOFF traceback table
-
–help– Display available options (–help-hiddenfor more)–help-list– Display list of available options (–help-list-hiddenfor more)–version– Display the version of this program
-
–polly– Enable the polly optimizer (with-O1,-O2or-O3)–polly-2nd-level-tiling– Enable a 2nd level loop of loop tiling–polly-ast-print-accesses– Print memory access functions–polly-context=<isl parameter set> – Provide additional constraints on the context parameters–polly-dce-precise-steps=<int> – The number of precise steps between two approximating iterations. (A value of-1schedules another approximation stage before the actual dead code elimination.–polly-delicm-max-ops=<int> – Maximum number of isl operations to invest for lifetime analysis; 0=no limit–polly-detect-full-functions– Allow the detection of full functions–polly-dump-after– Dump module after Polly transformations into a file suffixed with "-after"–polly-dump-after-file=<string> – Dump module after Polly transformations to the given file–polly-dump-before– Dump module before Polly transformations into a file suffixed with "-before"–polly-dump-before-file=<string> – Dump module before Polly transformations to the given file–polly-enable-simplify– Simplify SCoP after optimizations–polly-ignore-func=<string> – Ignore functions that match a regex. Multiple regexes can be comma separated. Scop detection will ignore all functions that match ANY of the regexes provided.–polly-isl-arg=<argument> – Option passed to ISL–polly-on-isl-error-abort– Abort if an isl error is encountered–polly-only-func=<string> – Only run on functions that match a regex. Multiple regexes can be comma separated. Scop detection will run on all functions that match ANY of the regexes provided.–polly-only-region=<identifier> – Only run on certain regions (The provided identifier must appear in the name of the region’s entry block–polly-only-scop-detection– Only run scop detection, but no other optimizations–polly-optimized-scops– Polly – Dump polyhedral description of Scops optimized with the isl scheduling optimizer and the set of post-scheduling transformations is applied on the schedule tree–polly-parallel– Generate thread parallel code (isl codegen only)–polly-parallel-force– Force generation of thread parallel code ignoring any cost model–polly-pattern-matching-based-opts– Perform optimizations based on pattern matching–polly-postopts– Apply post-rescheduling optimizations such as tiling (requires-polly-reschedule)–polly-pragma-based-opts– Apply user-directed transformation from metadata–polly-pragma-ignore-depcheck– Skip the dependency check for pragma-based transformations–polly-process-unprofitable– Process scops that are unlikely to benefit from Polly optimizations.–polly-register-tiling– Enable register tiling–polly-report– Print information about the activities of Polly–polly-reschedule– Optimize SCoPs using ISL–polly-show– Highlight the code regions that will be optimized in a (CFG BBs and LLVM-IR instructions)–polly-show-only– Highlight the code regions that will be optimized in a (CFG only BBs)–polly-stmt-granularity=<value> – Algorithm to use for splitting basic blocks into multiple statements- =bb
- – One statement per basic block
- =scalar-indep
- – Scalar independence heuristic
- =store
- – Store-level granularity
-
–polly-target=<value> – The hardware to target- =cpu
- – generate CPU code
-
–polly-tiling– Enable loop tiling–polly-vectorizer=<value> – Select the vectorization strategy- =none
- – No Vectorization
- =polly
- – Polly internal vectorizer
- =stripmine
- – Strip-mine outer loops for the loop-vectorizer to trigger
