Commit cdea5459 authored by Rik van Riel's avatar Rik van Riel Committed by Darrick J. Wong

xfs: fix missed wakeup on l_flush_wait

The code in xlog_wait uses the spinlock to make adding the task to
the wait queue, and setting the task state to UNINTERRUPTIBLE atomic
with respect to the waker.

Doing the wakeup after releasing the spinlock opens up the following
race condition:

Task 1					task 2
add task to wait queue
					wake up task
set task state to UNINTERRUPTIBLE

This issue was found through code inspection as a result of kworkers
being observed stuck in UNINTERRUPTIBLE state with an empty
wait queue. It is rare and largely unreproducable.

Simply moving the spin_unlock to after the wake_up_all results
in the waker not being able to see a task on the waitqueue before
it has set its state to UNINTERRUPTIBLE.

This bug dates back to the conversion of this code to generic
waitqueue infrastructure from a counting semaphore back in 2008
which didn't place the wakeups consistently w.r.t. to the relevant
spin locks.

[dchinner: Also fix a similar issue in the shutdown path on
xc_commit_wait. Update commit log with more details of the issue.]

Fixes: d748c623 ("[XFS] Convert l_flushsema to a sv_t")
Reported-by: default avatarChris Mason <clm@fb.com>
Signed-off-by: default avatarRik van Riel <riel@surriel.com>
Signed-off-by: default avatarDave Chinner <dchinner@redhat.com>
Reviewed-by: default avatarDarrick J. Wong <darrick.wong@oracle.com>
Signed-off-by: default avatarDarrick J. Wong <darrick.wong@oracle.com>
parent 7c107afb
...@@ -2646,7 +2646,6 @@ xlog_state_do_callback( ...@@ -2646,7 +2646,6 @@ xlog_state_do_callback(
int funcdidcallbacks; /* flag: function did callbacks */ int funcdidcallbacks; /* flag: function did callbacks */
int repeats; /* for issuing console warnings if int repeats; /* for issuing console warnings if
* looping too many times */ * looping too many times */
int wake = 0;
spin_lock(&log->l_icloglock); spin_lock(&log->l_icloglock);
first_iclog = iclog = log->l_iclog; first_iclog = iclog = log->l_iclog;
...@@ -2842,11 +2841,9 @@ xlog_state_do_callback( ...@@ -2842,11 +2841,9 @@ xlog_state_do_callback(
#endif #endif
if (log->l_iclog->ic_state & (XLOG_STATE_ACTIVE|XLOG_STATE_IOERROR)) if (log->l_iclog->ic_state & (XLOG_STATE_ACTIVE|XLOG_STATE_IOERROR))
wake = 1;
spin_unlock(&log->l_icloglock);
if (wake)
wake_up_all(&log->l_flush_wait); wake_up_all(&log->l_flush_wait);
spin_unlock(&log->l_icloglock);
} }
...@@ -3946,7 +3943,9 @@ xfs_log_force_umount( ...@@ -3946,7 +3943,9 @@ xfs_log_force_umount(
* item committed callback functions will do this again under lock to * item committed callback functions will do this again under lock to
* avoid races. * avoid races.
*/ */
spin_lock(&log->l_cilp->xc_push_lock);
wake_up_all(&log->l_cilp->xc_commit_wait); wake_up_all(&log->l_cilp->xc_commit_wait);
spin_unlock(&log->l_cilp->xc_push_lock);
xlog_state_do_callback(log, true, NULL); xlog_state_do_callback(log, true, NULL);
#ifdef XFSERRORDEBUG #ifdef XFSERRORDEBUG
......
Markdown is supported
0%
or
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment