• Alexey Kardashevskiy's avatar
    vfio-pci: Allow mapping MSIX BAR · a32295c6
    Alexey Kardashevskiy authored
    By default VFIO disables mapping of MSIX BAR to the userspace as
    the userspace may program it in a way allowing spurious interrupts;
    instead the userspace uses the VFIO_DEVICE_SET_IRQS ioctl.
    In order to eliminate guessing from the userspace about what is
    mmapable, VFIO also advertises a sparse list of regions allowed to mmap.
    
    This works fine as long as the system page size equals to the MSIX
    alignment requirement which is 4KB. However with a bigger page size
    the existing code prohibits mapping non-MSIX parts of a page with MSIX
    structures so these parts have to be emulated via slow reads/writes on
    a VFIO device fd. If these emulated bits are accessed often, this has
    serious impact on performance.
    
    This allows mmap of the entire BAR containing MSIX vector table.
    
    This removes the sparse capability for PCI devices as it becomes useless.
    
    As the userspace needs to know for sure whether mmapping of the MSIX
    vector containing data can succeed, this adds a new capability -
    VFIO_REGION_INFO_CAP_MSIX_MAPPABLE - which explicitly tells the userspace
    that the entire BAR can be mmapped.
    
    This does not touch the MSIX mangling in the BAR read/write handlers as
    we are doing this just to enable direct access to non MSIX registers.
    Signed-off-by: default avatarAlexey Kardashevskiy <aik@ozlabs.ru>
    [aw - fixup whitespace, trim function name]
    Signed-off-by: default avatarAlex Williamson <alex.williamson@redhat.com>
    a32295c6
vfio_pci.c 35.4 KB