|  | .. | 
|  | Copyright (c) 2017 Linaro Limited | 
|  | Written by Peter Maydell | 
|  |  | 
|  | =================== | 
|  | Load and Store APIs | 
|  | =================== | 
|  |  | 
|  | QEMU internally has multiple families of functions for performing | 
|  | loads and stores. This document attempts to enumerate them all | 
|  | and indicate when to use them. It does not provide detailed | 
|  | documentation of each API -- for that you should look at the | 
|  | documentation comments in the relevant header files. | 
|  |  | 
|  |  | 
|  | ``ld*_p and st*_p`` | 
|  | ~~~~~~~~~~~~~~~~~~~ | 
|  |  | 
|  | These functions operate on a host pointer, and should be used | 
|  | when you already have a pointer into host memory (corresponding | 
|  | to guest ram or a local buffer). They deal with doing accesses | 
|  | with the desired endianness and with correctly handling | 
|  | potentially unaligned pointer values. | 
|  |  | 
|  | Function names follow the pattern: | 
|  |  | 
|  | load: ``ld{sign}{size}_{endian}_p(ptr)`` | 
|  |  | 
|  | store: ``st{size}_{endian}_p(ptr, val)`` | 
|  |  | 
|  | ``sign`` | 
|  | - (empty) : for 32 or 64 bit sizes | 
|  | - ``u`` : unsigned | 
|  | - ``s`` : signed | 
|  |  | 
|  | ``size`` | 
|  | - ``b`` : 8 bits | 
|  | - ``w`` : 16 bits | 
|  | - ``24`` : 24 bits | 
|  | - ``l`` : 32 bits | 
|  | - ``q`` : 64 bits | 
|  |  | 
|  | ``endian`` | 
|  | - ``he`` : host endian | 
|  | - ``be`` : big endian | 
|  | - ``le`` : little endian | 
|  |  | 
|  | The ``_{endian}`` infix is omitted for target-endian accesses. | 
|  |  | 
|  | The target endian accessors are only available to source | 
|  | files which are built per-target. | 
|  |  | 
|  | There are also functions which take the size as an argument: | 
|  |  | 
|  | load: ``ldn{endian}_p(ptr, sz)`` | 
|  |  | 
|  | which performs an unsigned load of ``sz`` bytes from ``ptr`` | 
|  | as an ``{endian}`` order value and returns it in a uint64_t. | 
|  |  | 
|  | store: ``stn{endian}_p(ptr, sz, val)`` | 
|  |  | 
|  | which stores ``val`` to ``ptr`` as an ``{endian}`` order value | 
|  | of size ``sz`` bytes. | 
|  |  | 
|  |  | 
|  | Regexes for git grep: | 
|  | - ``\<ld[us]\?[bwlq]\(_[hbl]e\)\?_p\>`` | 
|  | - ``\<st[bwlq]\(_[hbl]e\)\?_p\>`` | 
|  | - ``\<st24\(_[hbl]e\)\?_p\>`` | 
|  | - ``\<ldn_\([hbl]e\)\?_p\>`` | 
|  | - ``\<stn_\([hbl]e\)\?_p\>`` | 
|  |  | 
|  | ``cpu_{ld,st}*_mmu`` | 
|  | ~~~~~~~~~~~~~~~~~~~~ | 
|  |  | 
|  | These functions operate on a guest virtual address, plus a context | 
|  | known as a "mmu index" which controls how that virtual address is | 
|  | translated, plus a ``MemOp`` which contains alignment requirements | 
|  | among other things.  The ``MemOp`` and mmu index are combined into | 
|  | a single argument of type ``MemOpIdx``. | 
|  |  | 
|  | The meaning of the indexes are target specific, but specifying a | 
|  | particular index might be necessary if, for instance, the helper | 
|  | requires a "always as non-privileged" access rather than the | 
|  | default access for the current state of the guest CPU. | 
|  |  | 
|  | These functions may cause a guest CPU exception to be taken | 
|  | (e.g. for an alignment fault or MMU fault) which will result in | 
|  | guest CPU state being updated and control longjmp'ing out of the | 
|  | function call.  They should therefore only be used in code that is | 
|  | implementing emulation of the guest CPU. | 
|  |  | 
|  | The ``retaddr`` parameter is used to control unwinding of the | 
|  | guest CPU state in case of a guest CPU exception.  This is passed | 
|  | to ``cpu_restore_state()``.  Therefore the value should either be 0, | 
|  | to indicate that the guest CPU state is already synchronized, or | 
|  | the result of ``GETPC()`` from the top level ``HELPER(foo)`` | 
|  | function, which is a return address into the generated code [#gpc]_. | 
|  |  | 
|  | .. [#gpc] Note that ``GETPC()`` should be used with great care: calling | 
|  | it in other functions that are *not* the top level | 
|  | ``HELPER(foo)`` will cause unexpected behavior. Instead, the | 
|  | value of ``GETPC()`` should be read from the helper and passed | 
|  | if needed to the functions that the helper calls. | 
|  |  | 
|  | Function names follow the pattern: | 
|  |  | 
|  | load: ``cpu_ld{size}{end}_mmu(env, ptr, oi, retaddr)`` | 
|  |  | 
|  | store: ``cpu_st{size}{end}_mmu(env, ptr, val, oi, retaddr)`` | 
|  |  | 
|  | ``size`` | 
|  | - ``b`` : 8 bits | 
|  | - ``w`` : 16 bits | 
|  | - ``l`` : 32 bits | 
|  | - ``q`` : 64 bits | 
|  |  | 
|  | ``end`` | 
|  | - (empty) : for target endian, or 8 bit sizes | 
|  | - ``_be`` : big endian | 
|  | - ``_le`` : little endian | 
|  |  | 
|  | Regexes for git grep: | 
|  | - ``\<cpu_ld[bwlq]\(_[bl]e\)\?_mmu\>`` | 
|  | - ``\<cpu_st[bwlq]\(_[bl]e\)\?_mmu\>`` | 
|  |  | 
|  |  | 
|  | ``cpu_{ld,st}*_mmuidx_ra`` | 
|  | ~~~~~~~~~~~~~~~~~~~~~~~~~~ | 
|  |  | 
|  | These functions work like the ``cpu_{ld,st}_mmu`` functions except | 
|  | that the ``mmuidx`` parameter is not combined with a ``MemOp``, | 
|  | and therefore there is no required alignment supplied or enforced. | 
|  |  | 
|  | Function names follow the pattern: | 
|  |  | 
|  | load: ``cpu_ld{sign}{size}{end}_mmuidx_ra(env, ptr, mmuidx, retaddr)`` | 
|  |  | 
|  | store: ``cpu_st{size}{end}_mmuidx_ra(env, ptr, val, mmuidx, retaddr)`` | 
|  |  | 
|  | ``sign`` | 
|  | - (empty) : for 32 or 64 bit sizes | 
|  | - ``u`` : unsigned | 
|  | - ``s`` : signed | 
|  |  | 
|  | ``size`` | 
|  | - ``b`` : 8 bits | 
|  | - ``w`` : 16 bits | 
|  | - ``l`` : 32 bits | 
|  | - ``q`` : 64 bits | 
|  |  | 
|  | ``end`` | 
|  | - (empty) : for target endian, or 8 bit sizes | 
|  | - ``_be`` : big endian | 
|  | - ``_le`` : little endian | 
|  |  | 
|  | Regexes for git grep: | 
|  | - ``\<cpu_ld[us]\?[bwlq]\(_[bl]e\)\?_mmuidx_ra\>`` | 
|  | - ``\<cpu_st[bwlq]\(_[bl]e\)\?_mmuidx_ra\>`` | 
|  |  | 
|  | ``cpu_{ld,st}*_data_ra`` | 
|  | ~~~~~~~~~~~~~~~~~~~~~~~~ | 
|  |  | 
|  | These functions work like the ``cpu_{ld,st}_mmuidx_ra`` functions | 
|  | except that the ``mmuidx`` parameter is taken from the current mode | 
|  | of the guest CPU, as determined by ``cpu_mmu_index(env, false)``. | 
|  |  | 
|  | These are generally the preferred way to do accesses by guest | 
|  | virtual address from helper functions, unless the access should | 
|  | be performed with a context other than the default, or alignment | 
|  | should be enforced for the access. | 
|  |  | 
|  | Function names follow the pattern: | 
|  |  | 
|  | load: ``cpu_ld{sign}{size}{end}_data_ra(env, ptr, ra)`` | 
|  |  | 
|  | store: ``cpu_st{size}{end}_data_ra(env, ptr, val, ra)`` | 
|  |  | 
|  | ``sign`` | 
|  | - (empty) : for 32 or 64 bit sizes | 
|  | - ``u`` : unsigned | 
|  | - ``s`` : signed | 
|  |  | 
|  | ``size`` | 
|  | - ``b`` : 8 bits | 
|  | - ``w`` : 16 bits | 
|  | - ``l`` : 32 bits | 
|  | - ``q`` : 64 bits | 
|  |  | 
|  | ``end`` | 
|  | - (empty) : for target endian, or 8 bit sizes | 
|  | - ``_be`` : big endian | 
|  | - ``_le`` : little endian | 
|  |  | 
|  | Regexes for git grep: | 
|  | - ``\<cpu_ld[us]\?[bwlq]\(_[bl]e\)\?_data_ra\>`` | 
|  | - ``\<cpu_st[bwlq]\(_[bl]e\)\?_data_ra\>`` | 
|  |  | 
|  | ``cpu_{ld,st}*_data`` | 
|  | ~~~~~~~~~~~~~~~~~~~~~ | 
|  |  | 
|  | These functions work like the ``cpu_{ld,st}_data_ra`` functions | 
|  | except that the ``retaddr`` parameter is 0, and thus does not | 
|  | unwind guest CPU state. | 
|  |  | 
|  | This means they must only be used from helper functions where the | 
|  | translator has saved all necessary CPU state.  These functions are | 
|  | the right choice for calls made from hooks like the CPU ``do_interrupt`` | 
|  | hook or when you know for certain that the translator had to save all | 
|  | the CPU state anyway. | 
|  |  | 
|  | Function names follow the pattern: | 
|  |  | 
|  | load: ``cpu_ld{sign}{size}{end}_data(env, ptr)`` | 
|  |  | 
|  | store: ``cpu_st{size}{end}_data(env, ptr, val)`` | 
|  |  | 
|  | ``sign`` | 
|  | - (empty) : for 32 or 64 bit sizes | 
|  | - ``u`` : unsigned | 
|  | - ``s`` : signed | 
|  |  | 
|  | ``size`` | 
|  | - ``b`` : 8 bits | 
|  | - ``w`` : 16 bits | 
|  | - ``l`` : 32 bits | 
|  | - ``q`` : 64 bits | 
|  |  | 
|  | ``end`` | 
|  | - (empty) : for target endian, or 8 bit sizes | 
|  | - ``_be`` : big endian | 
|  | - ``_le`` : little endian | 
|  |  | 
|  | Regexes for git grep: | 
|  | - ``\<cpu_ld[us]\?[bwlq]\(_[bl]e\)\?_data\>`` | 
|  | - ``\<cpu_st[bwlq]\(_[bl]e\)\?_data\+\>`` | 
|  |  | 
|  | ``cpu_ld*_code`` | 
|  | ~~~~~~~~~~~~~~~~ | 
|  |  | 
|  | These functions perform a read for instruction execution.  The ``mmuidx`` | 
|  | parameter is taken from the current mode of the guest CPU, as determined | 
|  | by ``cpu_mmu_index(env, true)``.  The ``retaddr`` parameter is 0, and | 
|  | thus does not unwind guest CPU state, because CPU state is always | 
|  | synchronized while translating instructions.  Any guest CPU exception | 
|  | that is raised will indicate an instruction execution fault rather than | 
|  | a data read fault. | 
|  |  | 
|  | In general these functions should not be used directly during translation. | 
|  | There are wrapper functions that are to be used which also take care of | 
|  | plugins for tracing. | 
|  |  | 
|  | Function names follow the pattern: | 
|  |  | 
|  | load: ``cpu_ld{sign}{size}_code(env, ptr)`` | 
|  |  | 
|  | ``sign`` | 
|  | - (empty) : for 32 or 64 bit sizes | 
|  | - ``u`` : unsigned | 
|  | - ``s`` : signed | 
|  |  | 
|  | ``size`` | 
|  | - ``b`` : 8 bits | 
|  | - ``w`` : 16 bits | 
|  | - ``l`` : 32 bits | 
|  | - ``q`` : 64 bits | 
|  |  | 
|  | Regexes for git grep: | 
|  | - ``\<cpu_ld[us]\?[bwlq]_code\>`` | 
|  |  | 
|  | ``translator_ld*`` | 
|  | ~~~~~~~~~~~~~~~~~~ | 
|  |  | 
|  | These functions are a wrapper for ``cpu_ld*_code`` which also perform | 
|  | any actions required by any tracing plugins.  They are only to be | 
|  | called during the translator callback ``translate_insn``. | 
|  |  | 
|  | There is a set of functions ending in ``_swap`` which, if the parameter | 
|  | is true, returns the value in the endianness that is the reverse of | 
|  | the guest native endianness, as determined by ``TARGET_BIG_ENDIAN``. | 
|  |  | 
|  | Function names follow the pattern: | 
|  |  | 
|  | load: ``translator_ld{sign}{size}(env, ptr)`` | 
|  |  | 
|  | swap: ``translator_ld{sign}{size}_swap(env, ptr, swap)`` | 
|  |  | 
|  | ``sign`` | 
|  | - (empty) : for 32 or 64 bit sizes | 
|  | - ``u`` : unsigned | 
|  | - ``s`` : signed | 
|  |  | 
|  | ``size`` | 
|  | - ``b`` : 8 bits | 
|  | - ``w`` : 16 bits | 
|  | - ``l`` : 32 bits | 
|  | - ``q`` : 64 bits | 
|  |  | 
|  | Regexes for git grep: | 
|  | - ``\<translator_ld[us]\?[bwlq]\(_swap\)\?\>`` | 
|  |  | 
|  | ``helper_{ld,st}*_mmu`` | 
|  | ~~~~~~~~~~~~~~~~~~~~~~~~~ | 
|  |  | 
|  | These functions are intended primarily to be called by the code | 
|  | generated by the TCG backend.  Like the ``cpu_{ld,st}_mmu`` functions | 
|  | they perform accesses by guest virtual address, with a given ``MemOpIdx``. | 
|  |  | 
|  | They differ from ``cpu_{ld,st}_mmu`` in that they take the endianness | 
|  | of the operation only from the MemOpIdx, and loads extend the return | 
|  | value to the size of a host general register (``tcg_target_ulong``). | 
|  |  | 
|  | load: ``helper_ld{sign}{size}_mmu(env, addr, opindex, retaddr)`` | 
|  |  | 
|  | store: ``helper_{size}_mmu(env, addr, val, opindex, retaddr)`` | 
|  |  | 
|  | ``sign`` | 
|  | - (empty) : for 32 or 64 bit sizes | 
|  | - ``u`` : unsigned | 
|  | - ``s`` : signed | 
|  |  | 
|  | ``size`` | 
|  | - ``b`` : 8 bits | 
|  | - ``w`` : 16 bits | 
|  | - ``l`` : 32 bits | 
|  | - ``q`` : 64 bits | 
|  |  | 
|  | Regexes for git grep: | 
|  | - ``\<helper_ld[us]\?[bwlq]_mmu\>`` | 
|  | - ``\<helper_st[bwlq]_mmu\>`` | 
|  |  | 
|  | ``address_space_*`` | 
|  | ~~~~~~~~~~~~~~~~~~~ | 
|  |  | 
|  | These functions are the primary ones to use when emulating CPU | 
|  | or device memory accesses. They take an AddressSpace, which is the | 
|  | way QEMU defines the view of memory that a device or CPU has. | 
|  | (They generally correspond to being the "master" end of a hardware bus | 
|  | or bus fabric.) | 
|  |  | 
|  | Each CPU has an AddressSpace. Some kinds of CPU have more than | 
|  | one AddressSpace (for instance Arm guest CPUs have an AddressSpace | 
|  | for the Secure world and one for NonSecure if they implement TrustZone). | 
|  | Devices which can do DMA-type operations should generally have an | 
|  | AddressSpace. There is also a "system address space" which typically | 
|  | has all the devices and memory that all CPUs can see. (Some older | 
|  | device models use the "system address space" rather than properly | 
|  | modelling that they have an AddressSpace of their own.) | 
|  |  | 
|  | Functions are provided for doing byte-buffer reads and writes, | 
|  | and also for doing one-data-item loads and stores. | 
|  |  | 
|  | In all cases the caller provides a MemTxAttrs to specify bus | 
|  | transaction attributes, and can check whether the memory transaction | 
|  | succeeded using a MemTxResult return code. | 
|  |  | 
|  | ``address_space_read(address_space, addr, attrs, buf, len)`` | 
|  |  | 
|  | ``address_space_write(address_space, addr, attrs, buf, len)`` | 
|  |  | 
|  | ``address_space_rw(address_space, addr, attrs, buf, len, is_write)`` | 
|  |  | 
|  | ``address_space_ld{sign}{size}_{endian}(address_space, addr, attrs, txresult)`` | 
|  |  | 
|  | ``address_space_st{size}_{endian}(address_space, addr, val, attrs, txresult)`` | 
|  |  | 
|  | ``sign`` | 
|  | - (empty) : for 32 or 64 bit sizes | 
|  | - ``u`` : unsigned | 
|  |  | 
|  | (No signed load operations are provided.) | 
|  |  | 
|  | ``size`` | 
|  | - ``b`` : 8 bits | 
|  | - ``w`` : 16 bits | 
|  | - ``l`` : 32 bits | 
|  | - ``q`` : 64 bits | 
|  |  | 
|  | ``endian`` | 
|  | - ``le`` : little endian | 
|  | - ``be`` : big endian | 
|  |  | 
|  | The ``_{endian}`` suffix is omitted for byte accesses. | 
|  |  | 
|  | Regexes for git grep: | 
|  | - ``\<address_space_\(read\|write\|rw\)\>`` | 
|  | - ``\<address_space_ldu\?[bwql]\(_[lb]e\)\?\>`` | 
|  | - ``\<address_space_st[bwql]\(_[lb]e\)\?\>`` | 
|  |  | 
|  | ``address_space_write_rom`` | 
|  | ~~~~~~~~~~~~~~~~~~~~~~~~~~~ | 
|  |  | 
|  | This function performs a write by physical address like | 
|  | ``address_space_write``, except that if the write is to a ROM then | 
|  | the ROM contents will be modified, even though a write by the guest | 
|  | CPU to the ROM would be ignored. This is used for non-guest writes | 
|  | like writes from the gdb debug stub or initial loading of ROM contents. | 
|  |  | 
|  | Note that portions of the write which attempt to write data to a | 
|  | device will be silently ignored -- only real RAM and ROM will | 
|  | be written to. | 
|  |  | 
|  | Regexes for git grep: | 
|  | - ``address_space_write_rom`` | 
|  |  | 
|  | ``{ld,st}*_phys`` | 
|  | ~~~~~~~~~~~~~~~~~ | 
|  |  | 
|  | These are functions which are identical to | 
|  | ``address_space_{ld,st}*``, except that they always pass | 
|  | ``MEMTXATTRS_UNSPECIFIED`` for the transaction attributes, and ignore | 
|  | whether the transaction succeeded or failed. | 
|  |  | 
|  | The fact that they ignore whether the transaction succeeded means | 
|  | they should not be used in new code, unless you know for certain | 
|  | that your code will only be used in a context where the CPU or | 
|  | device doing the access has no way to report such an error. | 
|  |  | 
|  | ``load: ld{sign}{size}_{endian}_phys`` | 
|  |  | 
|  | ``store: st{size}_{endian}_phys`` | 
|  |  | 
|  | ``sign`` | 
|  | - (empty) : for 32 or 64 bit sizes | 
|  | - ``u`` : unsigned | 
|  |  | 
|  | (No signed load operations are provided.) | 
|  |  | 
|  | ``size`` | 
|  | - ``b`` : 8 bits | 
|  | - ``w`` : 16 bits | 
|  | - ``l`` : 32 bits | 
|  | - ``q`` : 64 bits | 
|  |  | 
|  | ``endian`` | 
|  | - ``le`` : little endian | 
|  | - ``be`` : big endian | 
|  |  | 
|  | The ``_{endian}_`` infix is omitted for byte accesses. | 
|  |  | 
|  | Regexes for git grep: | 
|  | - ``\<ldu\?[bwlq]\(_[bl]e\)\?_phys\>`` | 
|  | - ``\<st[bwlq]\(_[bl]e\)\?_phys\>`` | 
|  |  | 
|  | ``cpu_physical_memory_*`` | 
|  | ~~~~~~~~~~~~~~~~~~~~~~~~~ | 
|  |  | 
|  | These are convenience functions which are identical to | 
|  | ``address_space_*`` but operate specifically on the system address space, | 
|  | always pass a ``MEMTXATTRS_UNSPECIFIED`` set of memory attributes and | 
|  | ignore whether the memory transaction succeeded or failed. | 
|  | For new code they are better avoided: | 
|  |  | 
|  | * there is likely to be behaviour you need to model correctly for a | 
|  | failed read or write operation | 
|  | * a device should usually perform operations on its own AddressSpace | 
|  | rather than using the system address space | 
|  |  | 
|  | ``cpu_physical_memory_read`` | 
|  |  | 
|  | ``cpu_physical_memory_write`` | 
|  |  | 
|  | ``cpu_physical_memory_rw`` | 
|  |  | 
|  | Regexes for git grep: | 
|  | - ``\<cpu_physical_memory_\(read\|write\|rw\)\>`` | 
|  |  | 
|  | ``cpu_memory_rw_debug`` | 
|  | ~~~~~~~~~~~~~~~~~~~~~~~ | 
|  |  | 
|  | Access CPU memory by virtual address for debug purposes. | 
|  |  | 
|  | This function is intended for use by the GDB stub and similar code. | 
|  | It takes a virtual address, converts it to a physical address via | 
|  | an MMU lookup using the current settings of the specified CPU, | 
|  | and then performs the access (using ``address_space_rw`` for | 
|  | reads or ``cpu_physical_memory_write_rom`` for writes). | 
|  | This means that if the access is a write to a ROM then this | 
|  | function will modify the contents (whereas a normal guest CPU access | 
|  | would ignore the write attempt). | 
|  |  | 
|  | ``cpu_memory_rw_debug`` | 
|  |  | 
|  | ``dma_memory_*`` | 
|  | ~~~~~~~~~~~~~~~~ | 
|  |  | 
|  | These behave like ``address_space_*``, except that they perform a DMA | 
|  | barrier operation first. | 
|  |  | 
|  | **TODO**: We should provide guidance on when you need the DMA | 
|  | barrier operation and when it's OK to use ``address_space_*``, and | 
|  | make sure our existing code is doing things correctly. | 
|  |  | 
|  | ``dma_memory_read`` | 
|  |  | 
|  | ``dma_memory_write`` | 
|  |  | 
|  | ``dma_memory_rw`` | 
|  |  | 
|  | Regexes for git grep: | 
|  | - ``\<dma_memory_\(read\|write\|rw\)\>`` | 
|  | - ``\<ldu\?[bwlq]\(_[bl]e\)\?_dma\>`` | 
|  | - ``\<st[bwlq]\(_[bl]e\)\?_dma\>`` | 
|  |  | 
|  | ``pci_dma_*`` and ``{ld,st}*_pci_dma`` | 
|  | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | 
|  |  | 
|  | These functions are specifically for PCI device models which need to | 
|  | perform accesses where the PCI device is a bus master. You pass them a | 
|  | ``PCIDevice *`` and they will do ``dma_memory_*`` operations on the | 
|  | correct address space for that device. | 
|  |  | 
|  | ``pci_dma_read`` | 
|  |  | 
|  | ``pci_dma_write`` | 
|  |  | 
|  | ``pci_dma_rw`` | 
|  |  | 
|  | ``load: ld{sign}{size}_{endian}_pci_dma`` | 
|  |  | 
|  | ``store: st{size}_{endian}_pci_dma`` | 
|  |  | 
|  | ``sign`` | 
|  | - (empty) : for 32 or 64 bit sizes | 
|  | - ``u`` : unsigned | 
|  |  | 
|  | (No signed load operations are provided.) | 
|  |  | 
|  | ``size`` | 
|  | - ``b`` : 8 bits | 
|  | - ``w`` : 16 bits | 
|  | - ``l`` : 32 bits | 
|  | - ``q`` : 64 bits | 
|  |  | 
|  | ``endian`` | 
|  | - ``le`` : little endian | 
|  | - ``be`` : big endian | 
|  |  | 
|  | The ``_{endian}_`` infix is omitted for byte accesses. | 
|  |  | 
|  | Regexes for git grep: | 
|  | - ``\<pci_dma_\(read\|write\|rw\)\>`` | 
|  | - ``\<ldu\?[bwlq]\(_[bl]e\)\?_pci_dma\>`` | 
|  | - ``\<st[bwlq]\(_[bl]e\)\?_pci_dma\>`` |