• David Howells's avatar
    rxrpc: Don't hold a ref for connection workqueue · 3cec055c
    David Howells authored
    Currently, rxrpc gives the connection's work item a ref on the connection
    when it queues it - and this is called from the timer expiration function.
    The problem comes when queue_work() fails (ie. the work item is already
    queued): the timer routine must put the ref - but this may cause the
    cleanup code to run.
    
    This has the unfortunate effect that the cleanup code may then be run in
    softirq context - which means that any spinlocks it might need to touch
    have to be guarded to disable softirqs (ie. they need a "_bh" suffix).
    
     (1) Don't give a ref to the work item.
    
     (2) Simplify handling of service connections by adding a separate active
         count so that the refcount isn't also used for this.
    
     (3) Connection destruction for both client and service connections can
         then be cleaned up by putting rxrpc_put_connection() out of line and
         making a tidy progression through the destruction code (offloaded to a
         workqueue if put from softirq or processor function context).  The RCU
         part of the cleanup then only deals with the freeing at the end.
    
     (4) Make rxrpc_queue_conn() return immediately if it sees the active count
         is -1 rather then queuing the connection.
    
     (5) Make sure that the cleanup routine waits for the work item to
         complete.
    
     (6) Stash the rxrpc_net pointer in the conn struct so that the rcu free
         routine can use it, even if the local endpoint has been freed.
    
    Unfortunately, neither the timer nor the work item can simply get around
    the problem by just using refcount_inc_not_zero() as the waits would still
    have to be done, and there would still be the possibility of having to put
    the ref in the expiration function.
    
    Note the connection work item is mostly going to go away with the main
    event work being transferred to the I/O thread, so the wait in (6) will
    become obsolete.
    Signed-off-by: default avatarDavid Howells <dhowells@redhat.com>
    cc: Marc Dionne <marc.dionne@auristor.com>
    cc: linux-afs@lists.infradead.org
    3cec055c
conn_event.c 11.9 KB