• Jens Axboe's avatar
    io_uring: add support for registering ring file descriptors · e7a6c00d
    Jens Axboe authored
    Lots of workloads use multiple threads, in which case the file table is
    shared between them. This makes getting and putting the ring file
    descriptor for each io_uring_enter(2) system call more expensive, as it
    involves an atomic get and put for each call.
    
    Similarly to how we allow registering normal file descriptors to avoid
    this overhead, add support for an io_uring_register(2) API that allows
    to register the ring fds themselves:
    
    1) IORING_REGISTER_RING_FDS - takes an array of io_uring_rsrc_update
       structs, and registers them with the task.
    2) IORING_UNREGISTER_RING_FDS - takes an array of io_uring_src_update
       structs, and unregisters them.
    
    When a ring fd is registered, it is internally represented by an offset.
    This offset is returned to the application, and the application then
    uses this offset and sets IORING_ENTER_REGISTERED_RING for the
    io_uring_enter(2) system call. This works just like using a registered
    file descriptor, rather than a real one, in an SQE, where
    IOSQE_FIXED_FILE gets set to tell io_uring that we're using an internal
    offset/descriptor rather than a real file descriptor.
    
    In initial testing, this provides a nice bump in performance for
    threaded applications in real world cases where the batch count (eg
    number of requests submitted per io_uring_enter(2) invocation) is low.
    In a microbenchmark, submitting NOP requests, we see the following
    increases in performance:
    
    Requests per syscall	Baseline	Registered	Increase
    ----------------------------------------------------------------
    1			 ~7030K		 ~8080K		+15%
    2			~13120K		~14800K		+13%
    4			~22740K		~25300K		+11%
    Co-developed-by: default avatarXiaoguang Wang <xiaoguang.wang@linux.alibaba.com>
    Signed-off-by: default avatarJens Axboe <axboe@kernel.dk>
    e7a6c00d
io_uring.c 282 KB