|  | .. _cfi: | 
|  |  | 
|  | ============================ | 
|  | Control-Flow Integrity (CFI) | 
|  | ============================ | 
|  |  | 
|  | This document describes the current control-flow integrity (CFI) mechanism in | 
|  | QEMU. How it can be enabled, its benefits and deficiencies, and how it affects | 
|  | new and existing code in QEMU | 
|  |  | 
|  | Basics | 
|  | ------ | 
|  |  | 
|  | CFI is a hardening technique that focusing on guaranteeing that indirect | 
|  | function calls have not been altered by an attacker. | 
|  | The type used in QEMU is a forward-edge control-flow integrity that ensures | 
|  | function calls performed through function pointers, always call a "compatible" | 
|  | function. A compatible function is a function with the same signature of the | 
|  | function pointer declared in the source code. | 
|  |  | 
|  | This type of CFI is entirely compiler-based and relies on the compiler knowing | 
|  | the signature of every function and every function pointer used in the code. | 
|  | As of now, the only compiler that provides support for CFI is Clang. | 
|  |  | 
|  | CFI is best used on production binaries, to protect against unknown attack | 
|  | vectors. | 
|  |  | 
|  | In case of a CFI violation (i.e. call to a non-compatible function) QEMU will | 
|  | terminate abruptly, to stop the possible attack. | 
|  |  | 
|  | Building with CFI | 
|  | ----------------- | 
|  |  | 
|  | NOTE: CFI requires the use of link-time optimization. Therefore, when CFI is | 
|  | selected, LTO will be automatically enabled. | 
|  |  | 
|  | To build with CFI, the minimum requirement is Clang 6+. If you | 
|  | are planning to also enable fuzzing, then Clang 11+ is needed (more on this | 
|  | later). | 
|  |  | 
|  | Given the use of LTO, a version of AR that supports LLVM IR is required. | 
|  | The easies way of doing this is by selecting the AR provided by LLVM:: | 
|  |  | 
|  | AR=llvm-ar-9 CC=clang-9 CXX=clang++-9 /path/to/configure --enable-cfi | 
|  |  | 
|  | CFI is enabled on every binary produced. | 
|  |  | 
|  | If desired, an additional flag to increase the verbosity of the output in case | 
|  | of a CFI violation is offered (``--enable-debug-cfi``). | 
|  |  | 
|  | Using QEMU built with CFI | 
|  | ------------------------- | 
|  |  | 
|  | A binary with CFI will work exactly like a standard binary. In case of a CFI | 
|  | violation, the binary will terminate with an illegal instruction signal. | 
|  |  | 
|  | Incompatible code with CFI | 
|  | -------------------------- | 
|  |  | 
|  | As mentioned above, CFI is entirely compiler-based and therefore relies on | 
|  | compile-time knowledge of the code. This means that, while generally supported | 
|  | for most code, some specific use pattern can break CFI compatibility, and | 
|  | create false-positives. The two main patterns that can cause issues are: | 
|  |  | 
|  | * Just-in-time compiled code: since such code is created at runtime, the jump | 
|  | to the buffer containing JIT code will fail. | 
|  |  | 
|  | * Libraries loaded dynamically, e.g. with dlopen/dlsym, since the library was | 
|  | not known at compile time. | 
|  |  | 
|  | Current areas of QEMU that are not entirely compatible with CFI are: | 
|  |  | 
|  | 1. TCG, since the idea of TCG is to pre-compile groups of instructions at | 
|  | runtime to speed-up interpretation, quite similarly to a JIT compiler | 
|  |  | 
|  | 2. TCI, where the interpreter has to interpret the generic *call* operation | 
|  |  | 
|  | 3. Plugins, since a plugin is implemented as an external library | 
|  |  | 
|  | 4. Modules, since they are implemented as an external library | 
|  |  | 
|  | 5. Directly calling signal handlers from the QEMU source code, since the | 
|  | signal handler may have been provided by an external library or even plugged | 
|  | at runtime. | 
|  |  | 
|  | Disabling CFI for a specific function | 
|  | ------------------------------------- | 
|  |  | 
|  | If you are working on function that is performing a call using an | 
|  | incompatible way, as described before, you can selectively disable CFI checks | 
|  | for such function by using the decorator ``QEMU_DISABLE_CFI`` at function | 
|  | definition, and add an explanation on why the function is not compatible | 
|  | with CFI. An example of the use of ``QEMU_DISABLE_CFI`` is provided here:: | 
|  |  | 
|  | /* | 
|  | * Disable CFI checks. | 
|  | * TCG creates binary blobs at runtime, with the transformed code. | 
|  | * A TB is a blob of binary code, created at runtime and called with an | 
|  | * indirect function call. Since such function did not exist at compile time, | 
|  | * the CFI runtime has no way to verify its signature and would fail. | 
|  | * TCG is not considered a security-sensitive part of QEMU so this does not | 
|  | * affect the impact of CFI in environment with high security requirements | 
|  | */ | 
|  | QEMU_DISABLE_CFI | 
|  | static inline tcg_target_ulong cpu_tb_exec(CPUState *cpu, TranslationBlock *itb) | 
|  |  | 
|  | NOTE: CFI needs to be disabled at the **caller** function, (i.e. a compatible | 
|  | cfi function that calls a non-compatible one), since the check is performed | 
|  | when the function call is performed. | 
|  |  | 
|  | CFI and fuzzing | 
|  | --------------- | 
|  |  | 
|  | There is generally no advantage of using CFI and fuzzing together, because | 
|  | they target different environments (production for CFI, debug for fuzzing). | 
|  |  | 
|  | CFI could be used in conjunction with fuzzing to identify a broader set of | 
|  | bugs that may not end immediately in a segmentation fault or triggering | 
|  | an assertion. However, other sanitizers such as address and ub sanitizers | 
|  | can identify such bugs in a more precise way than CFI. | 
|  |  | 
|  | There is, however, an interesting use case in using CFI in conjunction with | 
|  | fuzzing, that is to make sure that CFI is not triggering any false positive | 
|  | in remote-but-possible parts of the code. | 
|  |  | 
|  | CFI can be enabled with fuzzing, but with some caveats: | 
|  | 1. Fuzzing relies on the linker performing function wrapping at link-time. | 
|  | The standard BFD linker does not support function wrapping when LTO is | 
|  | also enabled. The workaround is to use LLVM's lld linker. | 
|  | 2. Fuzzing also relies on a custom linker script, which is only supported by | 
|  | lld with version 11+. | 
|  |  | 
|  | In other words, to compile with fuzzing and CFI, clang 11+ is required, and | 
|  | lld needs to be used as a linker:: | 
|  |  | 
|  | AR=llvm-ar-11 CC=clang-11 CXX=clang++-11 /path/to/configure --enable-cfi \ | 
|  | -enable-fuzzing --extra-ldflags="-fuse-ld=lld" | 
|  |  | 
|  | and then, compile the fuzzers as usual. |