Commits · 7d7f71cd8763a296d02dff9514447aa3de199c47 · Kirill Smelkov / linux

10 Jul, 2024 10 commits

bcachefs: Add missing bch2_trans_begin() · 7d7f71cd

Kent Overstreet authored Jul 05, 2024

this fixes a 'transaction should be locked' error in backpointers fsck
Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>

7d7f71cd

bcachefs: Fix missing error check in journal_entry_btree_keys_validate() · 0f6f8f76

Kent Overstreet authored Jul 04, 2024

Closes: https://syzkaller.appspot.com/bug?extid=8996d8f176cf946ef641Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>

0f6f8f76

bcachefs: Warn on attempting a move with no replicas · f49d2c98

Kent Overstreet authored Jul 03, 2024

Instead of popping an assert in bch2_write(), WARN and print out some
debugging info.
Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>

f49d2c98

bcachefs: bch2_data_update_to_text() · ad8b68cd
Kent Overstreet authored Jul 03, 2024
```
Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>
```
ad8b68cd
bcachefs: Log mount failure error code · 0f1f7324
Kent Overstreet authored Jul 03, 2024
```
Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>
```
0f1f7324
bcachefs: Fix undefined behaviour in eytzinger1_first() · 8ed58789
Kent Overstreet authored Jul 03, 2024
```
Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>
```
8ed58789

bcachefs: Mark bch_inode_info as SLAB_ACCOUNT · 86d81ec5

Youling Tang authored Jul 03, 2024

After commit 230e9fc2 ("slab: add SLAB_ACCOUNT flag"), we need to mark
the inode cache as SLAB_ACCOUNT, similar to commit 5d097056 ("kmemcg:
account for certain kmem allocations to memcg")
Signed-off-by: Youling Tang <tangyouling@kylinos.cn>
Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>

86d81ec5

bcachefs: Fix bch2_inode_insert() race path for tmpfiles · b02f973e
Kent Overstreet authored Jul 01, 2024
```
Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>
```
b02f973e

closures: fix closure_sync + closure debugging · 29f1c1ae

Kent Overstreet authored Jun 29, 2024

originally, stack closures were only used synchronously, and with the
original implementation of closure_sync() the ref never hit 0; thus,
closure_put_after_sub() assumes that if the ref hits 0 it's on the debug
list, in debug mode.

that's no longer true with the current implementation of closure_sync,
so we need a new magic so closure_debug_destroy() doesn't pop an assert.
Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>

29f1c1ae

bcachefs: Fix journal getting stuck on a flush commit · 04357732
Kent Overstreet authored Jun 29, 2024
```
silly race
Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>
```
04357732

02 Jul, 2024 1 commit

bcachefs: io clock: run timer fns under clock lock · a2d23f3d

Kent Overstreet authored Jun 29, 2024

We don't have a way to flush a timer that's executing the callback, and
this is simple and limited enough in scope that we can just use the lock
instead.

Needed for the next patch that adds direct wakeups from the allocator to
copygc, where we're now more frequently calling io_timer_del() on an
expiring timer.
Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>

a2d23f3d

29 Jun, 2024 6 commits

bcachefs: Repair fragmentation_lru in alloc_write_key() · b5cbb42d

Kent Overstreet authored Jun 29, 2024

fragmentation_lru derives from dirty_sectors, and wasn't being checked.
Co-developed-by: Daniel Hill <daniel@gluo.nz>
Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>

b5cbb42d

bcachefs: add check for missing fragmentation in check_alloc_to_lru_ref() · d39881d2

Kent Overstreet authored Jun 29, 2024

We need to make sure we're not missing any fragmenation entries in the
LRU BTREE after repairing ALLOC BTREE

Also, use the new bch2_btree_write_buffer_maybe_flush() helper; this was
only working without it before since bucket invalidation (usually)
wasn't happening while fsck was running.
Co-developed-by: Daniel Hill <daniel@gluo.nz>
Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>

d39881d2

bcachefs: bch2_btree_write_buffer_maybe_flush() · 92e1c29a

Kent Overstreet authored Jun 29, 2024

Add a new helper for checking references to write buffer btrees, where
we need a flush before we definitively know we have an inconsistency.
Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>

92e1c29a

bcachefs: Add missing printbuf_tabstops_reset() calls · ef05bdf5

Kent Overstreet authored Jun 29, 2024

Fixes warnings from bch2_print_allocator_stuck()
Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>

ef05bdf5

bcachefs: Fix loop restart in bch2_btree_transactions_read() · 67c56411

Kent Overstreet authored Jun 28, 2024

Accidental infinite loop; also fix btree_deadlock_to_text()
Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>

67c56411

bcachefs: Fix bch2_read_retry_nodecode() · 1539bdf5

Kent Overstreet authored Jun 28, 2024

BCH_READ_NODECODE mode - used by the move paths - really wants to use
only the original rbio, but the retry path really wants to clone - oof.

Make sure to copy the crc of the pointer we read from back to the
original rbio, or we'll see spurious checksum errors later.
Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>

1539bdf5

28 Jun, 2024 5 commits

bcachefs: Don't use the new_fs() bucket alloc path on an initialized fs · 44ec5990

Kent Overstreet authored Jun 28, 2024

On a new filesystem or device we have to allocate the journal with a
bump allocator, because allocation info isn't ready yet - but when
hot-adding a device that doesn't have a journal, we don't want to use
that path.

Reported-by: syzbot+24a867cb90d8315cccff@syzkaller.appspotmail.com
Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>

44ec5990

bcachefs: Fix shift greater than integer size · a0bd30e4

Kent Overstreet authored Jun 28, 2024

Reported-by: syzbot+e5292b50f1957164a4b6@syzkaller.appspotmail.com
Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>

a0bd30e4

bcachefs: Change bch2_fs_journal_stop() BUG_ON() to warning · 600b8be5
Kent Overstreet authored Jun 28, 2024
```
Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>
```
600b8be5

bcachefs: Delete old faulty bch2_trans_unlock() call · 84db6000

Kent Overstreet authored Jun 28, 2024

the unlock is now in read_extent, this fixes an assertion pop in
read_from_stale_dirty_pointer()
Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>

84db6000

bcachefs: Switch online_reserved shutdown assert to WARN() · 759b2e80
Kent Overstreet authored Jun 28, 2024
```
Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>
```
759b2e80

26 Jun, 2024 1 commit

bcachefs: Fix kmalloc bug in __snapshot_t_mut · 64cd7de9

Pei Li authored Jun 25, 2024

When allocating too huge a snapshot table, we should fail gracefully
in __snapshot_t_mut() instead of fail in kmalloc().

Reported-by: syzbot+770e99b65e26fa023ab1@syzkaller.appspotmail.com
Closes: https://syzkaller.appspot.com/bug?extid=770e99b65e26fa023ab1
Tested-by: syzbot+770e99b65e26fa023ab1@syzkaller.appspotmail.com
Signed-off-by: Pei Li <peili.dev@gmail.com>
Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>

64cd7de9

25 Jun, 2024 3 commits

bcachefs: Discard, invalidate workers are now per device · 64ee1431

Kent Overstreet authored Jun 23, 2024

There's no reason for discards to be single threaded across all devices;
this will improve performance on multi device setups.

Additionally, making them per-device simplifies the refcounting on
bch_dev->io_ref; we now hold it for the duration that the discard path
is running, which fixes a race between the discard path and device
removal.
Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>

64ee1431

bcachefs: Fix shift-out-of-bounds in bch2_blacklist_entries_gc · 472237b6

Pei Li authored Jun 25, 2024

This series fix the shift-out-of-bounds issue in
bch2_blacklist_entries_gc().

Instead of passing 0 to eytzinger0_first() when iterating the entries,
we explicitly check 0 and initialize i to be 0.

syzbot has tested the proposed patch and the reproducer did not trigger
any issue:

Reported-and-tested-by: syzbot+835d255ad6bc7f29ee12@syzkaller.appspotmail.com
Closes: https://syzkaller.appspot.com/bug?extid=835d255ad6bc7f29ee12Signed-off-by: Pei Li <peili.dev@gmail.com>
Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>

472237b6

bcachefs: slab-use-after-free Read in bch2_sb_errors_from_cpu · 211c581d

Pei Li authored Jun 25, 2024

Acquire fsck_error_counts_lock before accessing the critical section
protected by this lock.

syzbot has tested the proposed patch and the reproducer did not trigger
any issue.

Reported-by: syzbot+a2bc0e838efd7663f4d9@syzkaller.appspotmail.com
Closes: https://syzkaller.appspot.com/bug?extid=a2bc0e838efd7663f4d9Signed-off-by: Pei Li <peili.dev@gmail.com>
Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>

211c581d

23 Jun, 2024 8 commits

bcachefs: Add missing bch2_journal_do_writes() call · 89d21b69

Kent Overstreet authored Jun 23, 2024

This fixes a rare deadlock when we're doing an emergency shutdown due to
failure to do a journal write.
Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>

89d21b69

bcachefs: Fix null ptr deref in journal_pins_to_text() · d6b52f68
Kent Overstreet authored Jun 23, 2024
```
Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>
```
d6b52f68

bcachefs: Add missing recalc_capacity() call · 36da8e38

Kent Overstreet authored Jun 23, 2024

This fixes filesystem size not changing on device removal.
Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>

36da8e38

bcachefs: Fix btree_trans list ordering · 1aaf5cb4

Kent Overstreet authored Jun 22, 2024

The debug code relies on btree_trans_list being ordered so that it can
resume on subsequent calls or lock restarts.

However, it was using trans->locknig_wait.task.pid, which is incorrect
since btree_trans objects are cached and reused - typically by different
tasks.

Fix this by switching to pointer order, and also sort them lazily when
required - speeding up the btree_trans_get() fastpath.
Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>

1aaf5cb4

bcachefs: Fix race between trans_put() and btree_transactions_read() · de611ab6

Kent Overstreet authored Jun 22, 2024

debug.c was using closure_get() on a different thread's closure where
the we don't know if the object being refcounted is alive.

We keep btree_trans objects on a list so they can be printed by debug
code, and because it is cost prohibitive to touch the btree_trans list
every time we allocate and free btree_trans objects, cached objects are
also on this list.

However, we do not want the debug code to see cached but not in use
btree_trans objects - critically because the btree_paths array will have
been freed (if it was reallocated).

closure_get() is also incorrect to use when that get may race with it
hitting zero, i.e. we must already have a ref on the object or know the
ref can't currently hit 0 for other reasons (as used in the cycle
detector).

to fix this, use the previously introduced closure_get_not_zero(),
closure_return_sync(), and closure_init_stack_release(); the debug code
now can only take a ref on a trans object if it's alive and in use.
Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>

de611ab6

closures: closure_get_not_zero(), closure_return_sync() · 06efa5f3

Kent Overstreet authored Jun 22, 2024

Provide new primitives for solving a lifetime issue with bcachefs
btree_trans objects.

closure_sync_return(): like closure_sync(), wait synchronously for any
outstanding gets. like closure_return, the closure is considered
"finished" and the ref left at 0.

closure_get_not_zero(): get a ref on a closure if it's alive, i.e. the
ref is not zero.
Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>

06efa5f3

bcachefs: Make btree_deadlock_to_text() clearer · 18e92841

Kent Overstreet authored Jun 22, 2024

btree_deadlock_to_text() searches the list of btree transactions to find
a deadlock - when it finds one it's done; it's not like other *_read()
functions that's printing each object.

Factor out btree_deadlock_to_text() to make this clearer.
Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>

18e92841

bcachefs: fix seqmutex_relock() · f44cc269

Kent Overstreet authored Jun 22, 2024

We were grabbing the sequence number before unlock incremented it - fix
this by moving the increment to seqmutex_lock() (so the seqmutex_relock()
failure path skips the mutex_trylock()), and returning the sequence
number from unlock(), to make the API simpler and safer.
Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>

f44cc269

22 Jun, 2024 1 commit

bcachefs: Fix freeing of error pointers · 9bd01500

Kent Overstreet authored Jun 22, 2024

This fixes incorrect/missign checking of strndup_user() returns.
Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>

9bd01500

21 Jun, 2024 5 commits

bcachefs: Move the ei_flags setting to after initialization · bd4da046

Youling Tang authored Jun 04, 2024

`inode->ei_flags` setting and cleaning should be done after initialization,
otherwise the operation is invalid.

Fixes: 9ca4853b ("bcachefs: Fix quota support for snapshots")
Signed-off-by: Youling Tang <tangyouling@kylinos.cn>
Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>

bd4da046

bcachefs: Fix a UAF after write_super() · 2fe79ce7

Kent Overstreet authored Jun 20, 2024

write_super() may reallocate the superblock buffer - but
bch_sb_field_ext was referencing it; don't use it after the write_super
call.

Reported-by: syzbot+8992fc10a192067b8d8a@syzkaller.appspotmail.com
Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>

2fe79ce7

bcachefs: Use bch2_print_string_as_lines for long err · e6b3a655

Kent Overstreet authored Jun 20, 2024

printk strings get truncated to 1024 bytes; if we have a long error
message (journal debug info) we need to use a helper.
Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>

e6b3a655

bcachefs: Fix I_NEW warning in race path in bch2_inode_insert() · dd908648

Kent Overstreet authored Jun 20, 2024

discard_new_inode() is the correct interface for tearing down an indoe
that was fully created but not made visible to other threads, but it
expects I_NEW to be set, which we don't use.

Reported-by: https://github.com/koverstreet/bcachefs/issues/690
Fixes: bcachefs: Fix race path in bch2_inode_insert()
Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>

dd908648

bcachefs: Replace bare EEXIST with private error codes · 50479406
Kent Overstreet authored May 26, 2024
```
Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>
```
50479406