Commit 9a5280b3 authored by Dave Chinner's avatar Dave Chinner Committed by Dave Chinner

xfs: reorder iunlink remove operation in xfs_ifree

The O_TMPFILE creation implementation creates a specific order of
operations for inode allocation/freeing and unlinked list
modification. Currently both are serialised by the AGI, so the order
doesn't strictly matter as long as the are both in the same
transaction.

However, if we want to move the unlinked list insertions largely out
from under the AGI lock, then we have to be concerned about the
order in which we do unlinked list modification operations.
O_TMPFILE creation tells us this order is inode allocation/free,
then unlinked list modification.

Change xfs_ifree() to use this same ordering on unlinked list
removal. This way we always guarantee that when we enter the
iunlinked list removal code from this path, we already have the AGI
locked and we don't have to worry about lock nesting AGI reads
inside unlink list locks because it's already locked and attached to
the transaction.

We can do this safely as the inode freeing and unlinked list removal
are done in the same transaction and hence are atomic operations
with respect to log recovery.
Reported-by: default avatarFrank Hofmann <fhofmann@cloudflare.com>
Fixes: 298f7bec ("xfs: pin inode backing buffer to the inode log item")
Signed-off-by: default avatarDave Chinner <dchinner@redhat.com>
Reviewed-by: default avatarDarrick J. Wong <darrick.wong@oracle.com>
Signed-off-by: default avatarDave Chinner <david@fromorbit.com>
parent d65a92de
...@@ -2594,14 +2594,13 @@ xfs_ifree_cluster( ...@@ -2594,14 +2594,13 @@ xfs_ifree_cluster(
} }
/* /*
* This is called to return an inode to the inode free list. * This is called to return an inode to the inode free list. The inode should
* The inode should already be truncated to 0 length and have * already be truncated to 0 length and have no pages associated with it. This
* no pages associated with it. This routine also assumes that * routine also assumes that the inode is already a part of the transaction.
* the inode is already a part of the transaction.
* *
* The on-disk copy of the inode will have been added to the list * The on-disk copy of the inode will have been added to the list of unlinked
* of unlinked inodes in the AGI. We need to remove the inode from * inodes in the AGI. We need to remove the inode from that list atomically with
* that list atomically with respect to freeing it here. * respect to freeing it here.
*/ */
int int
xfs_ifree( xfs_ifree(
...@@ -2623,13 +2622,16 @@ xfs_ifree( ...@@ -2623,13 +2622,16 @@ xfs_ifree(
pag = xfs_perag_get(mp, XFS_INO_TO_AGNO(mp, ip->i_ino)); pag = xfs_perag_get(mp, XFS_INO_TO_AGNO(mp, ip->i_ino));
/* /*
* Pull the on-disk inode from the AGI unlinked list. * Free the inode first so that we guarantee that the AGI lock is going
* to be taken before we remove the inode from the unlinked list. This
* makes the AGI lock -> unlinked list modification order the same as
* used in O_TMPFILE creation.
*/ */
error = xfs_iunlink_remove(tp, pag, ip); error = xfs_difree(tp, pag, ip->i_ino, &xic);
if (error) if (error)
goto out; return error;
error = xfs_difree(tp, pag, ip->i_ino, &xic); error = xfs_iunlink_remove(tp, pag, ip);
if (error) if (error)
goto out; goto out;
......
Markdown is supported
0%
or
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment