Commits · be11ae16c4907fc9ede68cf5589d0bdd2b195d01 · Kirill Smelkov / linux

08 May, 2024 40 commits

bcachefs: __mark_pointer now takes bch_alloc_v4 · be11ae16
Kent Overstreet authored Apr 30, 2024
```
Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>
```
be11ae16

bcachefs: kill bch2_dev_usage_update_m() · c02eb9e8

Kent Overstreet authored Apr 30, 2024

by using bucket_m_to_alloc() more, we can get some nice code cleanup.
Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>

c02eb9e8

bcachefs: alloc_data_type_set() · fa9bb741
Kent Overstreet authored Apr 30, 2024
```
Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>
```
fa9bb741
bcachefs: dirty_sectors -> replicas_sectors · 2685c67d
Kent Overstreet authored Apr 30, 2024
```
Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>
```
2685c67d

bcachefs: delete old gen check bch2_alloc_write_key() · d3c44cfd

Kent Overstreet authored Apr 30, 2024

this was from metadata only gc - we don't need it anymore
Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>

d3c44cfd

bcachefs: Correct the FS_IOC_GETFLAGS to FS_IOC32_GETFLAGS in bch2_compat_fs_ioctl() · 75a53a0a

Youling Tang authored Apr 30, 2024

It should be FS_IOC32_GETFLAGS instead of FS_IOC_GETFLAGS in
compat ioctl.
Signed-off-by: Youling Tang <tangyouling@kylinos.cn>
Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>

75a53a0a

bcachefs: Fix error path of bch2_link_trans() · 9862022d

Youling Tang authored Apr 30, 2024

In bch2_link_trans(), if bch2_inode_nlink_inc() fails, it needs to
call bch2_trans_iter_exit() in the error path.
Signed-off-by: Youling Tang <tangyouling@kylinos.cn>
Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>

9862022d

bcachefs: Change destroy_inode to free_inode · 36aa49d3

Youling Tang authored Apr 26, 2024

The vfs[1] documentation describes free_inode as follows:
```
free_inode
    this method is called from RCU callback. If you use call_rcu()
    in ->destroy_inode to free ‘struct inode’ memory, then it’s
    better to release memory in this method.
```

free_inode will be called by the RCU callback, so it might be better
to move the inode free operation to destroy_inode.

Similar to commit ae6b47b5 ("fs/ntfs3: Change destroy_inode to
free_inode").

Link:
[1]: https://www.kernel.org/doc/html/latest/filesystems/vfs.htmlSigned-off-by: Youling Tang <tangyouling@kylinos.cn>
Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>

36aa49d3

bcachefs: Simplify resuming of journal position · c8bda9f2
Kent Overstreet authored Apr 26, 2024
```
Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>
```
c8bda9f2
bcachefs: check inode backpointer in bch2_lookup() · 83c38e3e
Kent Overstreet authored Apr 25, 2024
```
Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>
```
83c38e3e
bcachefs: check for inodes that should have backpointers in fsck · 4da1713a
Kent Overstreet authored Apr 25, 2024
```
Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>
```
4da1713a

bcachefs: bch_member.last_journal_bucket · 45150765

Kent Overstreet authored Apr 26, 2024

On recovery from clean shutdown we don't typically read the journal, but
we still want to avoid overwriting existing entries in the journal for
list_journal debugging.

Thus, add some fields to the member info section so we can remember
where we left off.
Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>

45150765

bcachefs: uninline set_btree_iter_dontneed() · c7495413
Kent Overstreet authored Apr 25, 2024
```
Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>
```
c7495413

bcachefs: eliminate the uninitialized compilation warning in bch2_reconstruct_snapshots · 0af0b963

Hongbo Li authored Apr 26, 2024

When compiling the bcachefs-tools, the following compilation warning
is reported:
    libbcachefs/snapshot.c: In function ‘bch2_reconstruct_snapshots’:
    libbcachefs/snapshot.c:915:19: warning: ‘tree_id’ may be used uninitialized in this function [-Wmaybe-uninitialized]
      915 |  snapshot->v.tree = cpu_to_le32(tree_id);
    libbcachefs/snapshot.c:903:6: note: ‘tree_id’ was declared here
      903 |  u32 tree_id;
       |      ^~~~~~~

This is a false alert, because @tree_id is changed in
bch2_snapshot_tree_create after it returns 0. And if this function
returns other value, @tree_id wouldn't be used. Thus there should
be nothing wrong in logical.

Although the report itself is a false alert, we can still make it more
explicit by setting the initial value of @tree_id to 0 (an invalid
tree ID).

Fixes: a292be3b ("bcachefs: Reconstruct missing snapshot nodes")
Signed-off-by: Hongbo Li <lihongbo22@huawei.com>
Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>

0af0b963

bcachefs: fix btree_path_clone() ip_allocated · 56522d72
Kent Overstreet authored Apr 25, 2024
```
Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>
```
56522d72

bcachefs: Fix format specifiers in bch2_btree_key_cache_to_text() · 8bb0eddb

Nathan Chancellor authored Apr 23, 2024

When building for a 32-bit target, for which 'size_t' is 'unsigned int',
there are two warnings around mismatched format specifiers and argument
types:

  In file included from fs/bcachefs/vstructs.h:5,
                   from fs/bcachefs/bcachefs_format.h:79,
                   from fs/bcachefs/bcachefs.h:207,
                   from fs/bcachefs/btree_key_cache.c:3:
  fs/bcachefs/btree_key_cache.c: In function 'bch2_btree_key_cache_to_text':
  fs/bcachefs/btree_key_cache.c:1046:25: error: format '%lu' expects argument of type 'long unsigned int', but argument 3 has type 'size_t' {aka 'unsigned int'} [-Werror=format=]
   1046 |         prt_printf(out, "nonpcpu freelist:\t%lu\r\n",   bc->nr_freed_nonpcpu);
        |                         ^~~~~~~~~~~~~~~~~~~~~~~~~~~~    ~~~~~~~~~~~~~~~~~~~~
        |                                                           |
        |                                                           size_t {aka unsigned int}
  fs/bcachefs/util.h:192:63: note: in definition of macro 'prt_printf'
    192 | #define prt_printf(_out, ...)           bch2_prt_printf(_out, __VA_ARGS__)
        |                                                               ^~~~~~~~~~~
  fs/bcachefs/btree_key_cache.c:1046:47: note: format string is defined here
   1046 |         prt_printf(out, "nonpcpu freelist:\t%lu\r\n",   bc->nr_freed_nonpcpu);
        |                                             ~~^
        |                                               |
        |                                               long unsigned int
        |                                             %u
  fs/bcachefs/btree_key_cache.c:1047:25: error: format '%lu' expects argument of type 'long unsigned int', but argument 3 has type 'size_t' {aka 'unsigned int'} [-Werror=format=]
   1047 |         prt_printf(out, "pcpu freelist:\t%lu\r\n",      bc->nr_freed_pcpu);
        |                         ^~~~~~~~~~~~~~~~~~~~~~~~~       ~~~~~~~~~~~~~~~~~
        |                                                           |
        |                                                           size_t {aka unsigned int}
  fs/bcachefs/util.h:192:63: note: in definition of macro 'prt_printf'
    192 | #define prt_printf(_out, ...)           bch2_prt_printf(_out, __VA_ARGS__)
        |                                                               ^~~~~~~~~~~
  fs/bcachefs/btree_key_cache.c:1047:44: note: format string is defined here
   1047 |         prt_printf(out, "pcpu freelist:\t%lu\r\n",      bc->nr_freed_pcpu);
        |                                          ~~^
        |                                            |
        |                                            long unsigned int
        |                                          %u
  cc1: all warnings being treated as error

Use the proper 'size_t' specifier, '%zu', to clear up the warnings for
these platforms.

Fixes: f2d47ec26af5 ("bcachefs: Btree key cache instrumentation")
Signed-off-by: Nathan Chancellor <nathan@kernel.org>
Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>

8bb0eddb

bcachefs: Fix type of flags parameter for some ->trigger() implementations · 2d288745

Nathan Chancellor authored Apr 23, 2024

When building with clang's -Wincompatible-function-pointer-types-strict
(a warning designed to catch potential kCFI failures at build time),
there are several warnings along the lines of:

  fs/bcachefs/bkey_methods.c:118:2: error: incompatible function pointer types initializing 'int (*)(struct btree_trans *, enum btree_id, unsigned int, struct bkey_s_c, struct bkey_s, enum btree_iter_update_trigger_flags)' with an expression of type 'int (struct btree_trans *, enum btree_id, unsigned int, struct bkey_s_c, struct bkey_s, unsigned int)' [-Werror,-Wincompatible-function-pointer-types-strict]
    118 |         BCH_BKEY_TYPES()
        |         ^~~~~~~~~~~~~~~~
  fs/bcachefs/bcachefs_format.h:394:2: note: expanded from macro 'BCH_BKEY_TYPES'
    394 |         x(inode,                8)                      \
        |         ^~~~~~~~~~~~~~~~~~~~~~~~~~
  fs/bcachefs/bkey_methods.c:117:41: note: expanded from macro 'x'
    117 | #define x(name, nr) [KEY_TYPE_##name]   = bch2_bkey_ops_##name,
        |                                           ^~~~~~~~~~~~~~~~~~~~
  <scratch space>:277:1: note: expanded from here
    277 | bch2_bkey_ops_inode
        | ^~~~~~~~~~~~~~~~~~~
  fs/bcachefs/inode.h:26:13: note: expanded from macro 'bch2_bkey_ops_inode'
     26 |         .trigger        = bch2_trigger_inode,           \
      |                           ^~~~~~~~~~~~~~~~~~

There are several functions that did not have their flags parameter
converted to 'enum btree_iter_update_trigger_flags' in the recent
unification, which will cause kCFI failures at runtime because the
types, while ABI compatible (hence no warning from the non-strict
version of this warning), do not match exactly.

Fix up these functions (as well as a few other obvious functions that
should have it, even if there are no warnings currently) to resolve the
warnings and potential kCFI runtime failures.

Fixes: 31e4ef3280c8 ("bcachefs: iter/update/trigger/str_hash flag cleanup")
Signed-off-by: Nathan Chancellor <nathan@kernel.org>
Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>

2d288745

bcachefs: Kill gc_init_recurse() · 24b27975

Kent Overstreet authored Apr 06, 2024

This unifies the online and offline btree gc passes; we're not yet
running it online.

We now iterate over one level of the btree at a time - the same as
check_extents_to_backpointers(); this ordering preserves order of keys
regardless of btree splits and merges, which will be important when we
re-enable online gc.
Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>

24b27975

bcachefs: do reflink_p repair from BTREE_TRIGGER_check_repair · c451986b
Kent Overstreet authored Apr 07, 2024
```
Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>
```
c451986b

bcachefs: Run bch2_check_fix_ptrs() via triggers · f40d13f9

Kent Overstreet authored Apr 07, 2024

Currently, the reflink_p gc trigger does repair as well - turning a
reflink_p key into an error key if the reflink_v it points to doesn't
exist.

This won't work with online check/repair, because the repair path once
online will be subject to transaction restarts, but BTREE_TRIGGER_gc is
not idempotant - we can't run it multiple times if we get a transaction
restart.

So we need to split these paths; to do so this patch calls
check_fix_ptrs() by a new general path - a new trigger type,
BTREE_TRIGGER_check_repair.
Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>

f40d13f9

bcachefs: kill gc looping for bucket gens · 930e1a92

Kent Overstreet authored Apr 16, 2024

looping when we change a bucket gen is not ideal - it means we risk
failing if we'd go into an infinite loop, and it's better to make
forward progress even if fsck doesn't fix everything.
Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>

930e1a92

bcachefs: bch2_bucket_ref_update() · 70e3e039

Kent Overstreet authored Apr 19, 2024

If we hit an inconsistency when updating allocation information, we
don't want to fail the update if it's for a deletion - only if it's for
a new key.

Rename check_bucket_ref() -> bucket_ref_update() so we can centralize
the logic to do this.
Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>

70e3e039

bcachefs: Consolidate mark_stripe_bucket() and trans_mark_stripe_bucket() · 9cc455d1

Kent Overstreet authored Apr 22, 2024

This eliminates some duplicated logic, and the gc path now handles
stripe updates and deletions - we need this since soon we're bringing
back runtime gc.
Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>

9cc455d1

bcachefs: mark_stripe_bucket cleanup · d9307646

Kent Overstreet authored Apr 20, 2024

Start to work on unifying mark_stripe_bucket() and
trans_mark_stripe_bucket(); first, clean up all the unnecessary and
gratuitious differences.
Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>

d9307646

bcachefs: bucket_data_type_mismatch() · c4e8db2b

Kent Overstreet authored Apr 22, 2024

We're working on potentially unifying bch2_check_bucket_ref() and
bch2_check_fix_ptrs() - or at least eliminating gratuitious differences.

Most immediately, there's a bunch of cleanups to be done regarding
BCH_DATA_stripe.
Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>

c4e8db2b

bcachefs: Clean up inode alloc · b769590f

Kent Overstreet authored Apr 20, 2024

There's no need to be using new_inode(); we can skip all that
indirection and make the code easier to follow.
Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>

b769590f

bcachefs: journal seq blacklist gc no longer has to walk btree · f0415829

Kent Overstreet authored Apr 20, 2024

Since btree_ptr_v2, we no longer require the journal seq blacklist table
for skipping blacklisted bsets (btree node entries); the pointer to a
given node indicates how much data is present.

Therefore there's no longer any need for journal seq blacklist gc to
walk the btree - we can prune entries older than journal last_seq.
Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>

f0415829

bcachefs: plumb data_type into bch2_bucket_alloc_trans() · e7f63c67

Kent Overstreet authored Apr 20, 2024

prep work for making the allocator try to keep btree nodes within the
existing member info btree allocated bitmap
Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>

e7f63c67

bcachefs: Add btree_allocated_bitmap to member_to_text() · 018b32a6
Kent Overstreet authored Apr 20, 2024
```
Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>
```
018b32a6

bcachefs: Btree key cache instrumentation · 5147b9ae

Kent Overstreet authored Apr 20, 2024

It turns out the btree key cache shrinker wasn't actually reclaiming
anything, prior to the previous patch. This adds instrumentation so that
if we have further issues we can see what's going on.

Specifically, sysfs internal/btree_key_cache is greatly expanded with
new counters, and the SRCU sequence numbers of the first 10 entries on
each pending freelist, and we also add trigger_btree_key_cache_shrink
for testing without having to prune all the system caches.
Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>

5147b9ae

bcachefs: Remove calls to folio_set_error · e4f2c4df

Matthew Wilcox (Oracle) authored Apr 20, 2024

Common code doesn't test the error flag, so we don't need to set it in
bcachefs.  We can use folio_end_read() to combine the setting (or not)
of the uptodate flag and clearing the lock flag.

Cc: Kent Overstreet <kent.overstreet@linux.dev>
Cc: Brian Foster <bfoster@redhat.com>
Cc: linux-bcachefs@vger.kernel.org
Signed-off-by: Matthew Wilcox (Oracle) <willy@infradead.org>
Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>

e4f2c4df

bcachefs: Move gc of bucket.oldest_gen to workqueue · 10330402

Kent Overstreet authored Apr 19, 2024

This is a nice cleanup - and we've also been having problems with
kthread creation in the mount path.
Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>

10330402

bcachefs: fix flag printing in journal_buf_to_text() · b25fd02a
Kent Overstreet authored Apr 19, 2024
```
Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>
```
b25fd02a

bcachefs: Sync journal when we complete a recovery pass · aef7eecb

Kent Overstreet authored Apr 17, 2024

Make things easier when we're debugging long fsck runs - persist the
work that successful recovery passes did.
Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>

aef7eecb

bcachefs: make btree read errors silent during scan · f7643bc9
Kent Overstreet authored Apr 17, 2024
```
Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>
```
f7643bc9

bcachefs: Rip bch2_snapshot_equiv() out of fsck · 5a2d1521

Kent Overstreet authored Apr 16, 2024

Originally, when deleting snapshots we didn't collapse redundant
snapshot nodes; thus, the notion of a class of equivalent snapshot nodes
leaked into fsck.

Now we do, so snapshot ID equivalence classes are purely local to
snapshot deletion.
Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>

5a2d1521

bcachefs: Check for writing btree_ptr_v2.sectors_written == 0 · 9de40d77
Kent Overstreet authored Apr 16, 2024
```
Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>
```
9de40d77
bcachefs: Add asserts to bch2_dev_btree_bitmap_marked_sectors() · 60f2b1bc
Kent Overstreet authored Apr 16, 2024
```
Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>
```
60f2b1bc
bcachefs: fs_alloc_debug_to_text() · 427e1bb8
Kent Overstreet authored Apr 16, 2024
```
Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>
```
427e1bb8
bcachefs: assert that online_reserved == 0 on shutdown · feb25553
Kent Overstreet authored Apr 14, 2024
```
Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>
```
feb25553