• Joan Bruguera Micó's avatar
    mm: shrinkers: fix race condition on debugfs cleanup · 26e239b3
    Joan Bruguera Micó authored
    When something registers and unregisters many shrinkers, such as:
        for x in $(seq 10000); do unshare -Ui true; done
    
    Sometimes the following error is printed to the kernel log:
        debugfs: Directory '...' with parent 'shrinker' already present!
    
    This occurs since commit badc28d4 ("mm: shrinkers: fix deadlock in
    shrinker debugfs") / v6.2: Since the call to `debugfs_remove_recursive`
    was moved outside the `shrinker_rwsem`/`shrinker_mutex` lock, but the call
    to `ida_free` stayed inside, a newly registered shrinker can be
    re-assigned that ID and attempt to create the debugfs directory before the
    directory from the previous shrinker has been removed.
    
    The locking changes in commit f95bdb70 ("mm: vmscan: make global slab
    shrink lockless") made the race condition more likely, though it existed
    before then.
    
    Commit badc28d4 ("mm: shrinkers: fix deadlock in shrinker debugfs")
    could be reverted since the issue is addressed should no longer occur
    since the count and scan operations are lockless since commit 20cd1892
    ("mm: shrinkers: make count and scan in shrinker debugfs lockless"). 
    However, since this is a contended lock, prefer instead moving `ida_free`
    outside the lock to avoid the race.
    
    Link: https://lkml.kernel.org/r/20230503013232.299211-1-joanbrugueram@gmail.com
    Fixes: badc28d4 ("mm: shrinkers: fix deadlock in shrinker debugfs")
    Signed-off-by: default avatarJoan Bruguera Micó <joanbrugueram@gmail.com>
    Cc: Qi Zheng <zhengqi.arch@bytedance.com>
    Cc: Roman Gushchin <roman.gushchin@linux.dev>
    Signed-off-by: default avatarAndrew Morton <akpm@linux-foundation.org>
    26e239b3
vmscan.c 222 KB