Commits · 66487c54ad8207ef1c029df45ffaeaafbcc11f77 · Kirill Smelkov / linux

22 Oct, 2023 40 commits

bcachefs: Fix is_ancestor bitmap · 66487c54

Kent Overstreet authored Jul 13, 2023

The is_ancestor bitmap is at optimization for bch2_snapshot_is_ancestor;
once we get sufficiently close to the ancestor ID we're searching for we
test a bitmap.

But initialization of the is_ancestor bitmap was broken; we do it by
using bch2_snapshot_parent(), but we call that on nodes that haven't
been initialized yet with bch2_mark_snapshot().

Fix this by adding a separate loop in bch2_snapshots_read() for
initializing the is_ancestor bitmap, and also add some new debug asserts
for checking this sort of breakage in the future.
Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>

66487c54

bcachefs: move check_pos_snapshot_overwritten() to snapshot.c · fa5bed37
Kent Overstreet authored Aug 18, 2023
```
Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>
```
fa5bed37

bcachefs: Fix bch2_mount error path · 7573041a

Kent Overstreet authored Aug 18, 2023

In the bch2_mount() error path, we were calling
deactivate_locked_super(), which calls ->kill_sb(), which in our case
was calling bch2_fs_free() without __bch2_fs_stop().

This changes bch2_mount() to just call bch2_fs_stop() directly.
Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>

7573041a

bcachefs: Delete a faulty assertion · adc0e950
Kent Overstreet authored Aug 18, 2023
```
Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>
```
adc0e950

bcachefs: Improve btree_path_relock_fail tracepoint · 55d5276d

Kent Overstreet authored Aug 17, 2023

In https://github.com/koverstreet/bcachefs/issues/450, we're seeing
unexplained btree_path_relock_fail events - according to the information
currently in the tracepoint, it appears the relock should be succeeding.

This adds lock counts to the tracepoint to help track it down.
Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>

55d5276d

bcachefs: Fix divide by zero in rebalance_work() · d0445e13

Kent Overstreet authored Aug 17, 2023

This fixes https://github.com/koverstreet/bcachefs-tools/issues/159Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>

d0445e13

bcachefs: Split out snapshot.c · 8e877caa

Kent Overstreet authored Aug 16, 2023

subvolume.c has gotten a bit large, this splits out a separate file just
for managing snapshot trees - BTREE_ID_snapshots.
Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>

8e877caa

bcachefs: stack_trace_save_tsk() depends on CONFIG_STACKTRACE · e5570df2
Kent Overstreet authored Aug 16, 2023
```
Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>
```
e5570df2

bcachefs: Fix swallowing of data in buffered write path · 62898dd1

Kent Overstreet authored Aug 14, 2023

In __bch2_buffered_write, if we fail to write to an entire !uptodate
folio, we have to back out the write, bail out and retry.

But we were missing an iov_iter_revert() call, so the data written to
the folio was lost and the rest of the write shifted to the wrong
offset.
Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>

62898dd1

bcachefs: fix up wonky error handling in bch2_seek_pagecache_hole() · 8c9b0f7b

Brian Foster authored Aug 14, 2023

The folio_hole_offset() helper returns a mix of bool and int types.
The latter is to support a possible -EAGAIN error code when using
nonblocking locks. This is not only confusing, but the only caller
also essentially ignores errors outside of stopping the range
iteration. This means an -EAGAIN error can't return directly from
folio_hole_offset() and may be lost via bch2_clamp_data_hole().

Fix up the error handling and make it more readable.
__filemap_get_folio() returns -ENOENT instead of NULL when no folio
exists, so reuse the same error code in folio_hole_offset(). Fix up
bch2_seek_pagecache_hole() to return the current offset on -ENOENT,
but otherwise return unexpected error code up to the caller.
Signed-off-by: Brian Foster <bfoster@redhat.com>
Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>

8c9b0f7b

bcachefs: Fix bkey format calculation · 029b85fe

Kent Overstreet authored Aug 13, 2023

For extents, we increase the number of bits of the size field to allow
extents to get bigger due to merging - but this code didn't check for
overflow.
Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>

029b85fe

bcachefs: Fix bch2_extent_fallocate() · c8ef8c3e

Kent Overstreet authored Aug 13, 2023

 - There was no need for a retry loop in bch2_extent_fallocate(); if we
   have to retry we may be overwriting something different and we need
   to return an error and let the caller retry.
 - The bch2_alloc_sectors_start() error path was wrong, and wasn't
   running our cleanup at the end of the function

This also fixes a very rare open bucket leak due to the missing cleanup.
Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>

c8ef8c3e

bcachefs: Zero btree_paths on allocation · ff5b741c

Kent Overstreet authored Aug 13, 2023

This fixes a bug in the cycle detector, bch2_check_for_deadlock() - we
have to make sure the node pointers in the btree paths array are set to
something not-garbage before another thread may see them.
Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>

ff5b741c

bcachefs: Fix 'pointer to invalid device' check · e9679b4a

Kent Overstreet authored Aug 13, 2023

This fixes the device removal tests, which have been failing at random
due to the fact that when we're running the .key_invalid checks in the
write path the key may actually no longer exist - we might be racing
with the keys being deleted.
Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>

e9679b4a

bcachefs: Lower BCH_NAME_MAX to 512 · a125c074

Joshua Ashton authored Aug 13, 2023

To ensure we aren't shooting ourselves in the foot after merge for
potentially doing future revisions for dirent or for storing multiple
names for casefolding, limit this to 512 for now.

Previously this define was linked to the max size a d_name in
bch_dirent could be.
Signed-off-by: Joshua Ashton <joshua@froggi.es>
Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>

a125c074

bcachefs: Optimize bch2_dirent_name_bytes · 29c336af

Joshua Ashton authored Aug 12, 2023

Avoids doing a full strnlen for getting the length of the name of a
dirent entry.

Given the fact that the name of dirents is stored at the end of the
bkey's value, and we know the length of that in u64s, we can find the
last u64 and figure out how many NUL bytes are at the end of the string.

On little endian systems this ends up being the leading zeros of the
last u64, whereas on big endian systems this ends up being the trailing
zeros of the last u64.
We can take that value in bits and divide it by 8 to get the number of
NUL bytes at the end.

There is no endian-fixup or other compatibility here as this is string
data interpreted as a u64.
Signed-off-by: Joshua Ashton <joshua@froggi.es>
Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>

29c336af

bcachefs: Introduce bch2_dirent_get_name · 01a7e74f

Joshua Ashton authored Aug 12, 2023

A nice cleanup that avoids a bunch of open-coding name/string usage
around dirent usage.

Will be used by casefolding impl in future commits.
Signed-off-by: Joshua Ashton <joshua@froggi.es>
Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>

01a7e74f

bcachefs: six locks: Guard against wakee exiting in __six_lock_wakeup() · f854ce4d
Kent Overstreet authored Aug 12, 2023
```
Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>
```
f854ce4d
bcachefs: Don't open code closure_nr_remaining() · 93ee2c4b
Kent Overstreet authored Aug 12, 2023
```
Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>
```
93ee2c4b

bcachefs: Fix lifetime in bch2_write_done(), add assertion · df5a915a

Kent Overstreet authored Aug 12, 2023

We're hunting for an open_bucket leak, add an assertion to help track it
down: also, we can't use the bch_fs after dropping our write ref to it.
Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>

df5a915a

bcachefs: Add a comment for should_drop_open_bucket() · 73ded163
Kent Overstreet authored Aug 12, 2023
```
Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>
```
73ded163

bcachefs: six locks: Fix missing barrier on wait->lock_acquired · c294ea50

Kent Overstreet authored Aug 12, 2023

Six locks do lock handoff via the wakeup path: the thread doing the
wakeup also takes the lock on behalf of the waiter, which means the
waiter only has to look at its waitlist entry, and doesn't have to touch
the lock cacheline while another thread is using it.

Linus noticed that this needs a real barrier, which this patch fixes.

Also add a comment for the should_sleep_fn() error path.
Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>
Cc: Linus Torvalds <torvalds@linux-foundation.org>
Cc: Boqun Feng <boqun.feng@gmail.com>
Cc: linux-bcachefs@vger.kernel.org
Cc: linux-kernel@vger.kernel.org

c294ea50

bcachefs: Check for directories in deleted inodes btree · c0363591
Kent Overstreet authored Aug 12, 2023
```
Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>
```
c0363591

bcachefs: Add btree_trans* to inode_set_fn · 791236b8

Joshua Ashton authored Aug 12, 2023

This will be used when we need to re-hash a directory tree when setting
flags.

It is not possible to have concurrent btree_trans on a thread.
Signed-off-by: Joshua Ashton <joshua@froggi.es>
Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>

791236b8

bcachefs: Improve bch2_write_points_to_text() · e6375481

Kent Overstreet authored Aug 12, 2023

Now we also print the open_buckets owned by each write_point - this is
to help with debugging a shutdown hang.
Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>

e6375481

bcachefs: Fix check_version_upgrade() · e0a2b00a

Kent Overstreet authored Aug 11, 2023

We were failing to upgrade to the latest compatible version - whoops.
Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>

e0a2b00a

bcachefs: Fix 'journal not marked as containing replicas' · 83b3d959

Kent Overstreet authored Aug 11, 2023

This fixes the replicas_write_errors test: the patch
  bcachefs: mark journal replicas before journal write submission

partially fixed replicas marking for the journal, but it broke the case
where one replica failed - this patch re-adds marking after the journal
write completes, when we know how many replicas succeeded.

Additionally, we do not consider it a fsck error when the very last
journal entry is not correctly marked, since there is an inherent race
there.
Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>

83b3d959

bcachefs: btree_journal_iter.c · 401585fe

Kent Overstreet authored Aug 05, 2023

Split out a new file from recovery.c for managing the list of keys we
read from the journal: before journal replay finishes the btree iterator
code needs to be able to iterate over and return keys from the journal
as well, so there's a fair bit of code here.
Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>

401585fe

bcachefs: sb-clean.c · a37ad1a3

Kent Overstreet authored Aug 05, 2023

Pull code for bch_sb_field_clean out into its own file.
Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>

a37ad1a3

bcachefs: Move bch_sb_field_crypt code to checksum.c · 0ec39856
Kent Overstreet authored Aug 05, 2023
```
Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>
```
0ec39856

bcachefs: sb-members.c · fb8e5b4c

Kent Overstreet authored Aug 05, 2023

Split out a new file for bch_sb_field_members - we'll likely want to
move more code here in the future.
Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>

fb8e5b4c

bcachefs: Split up btree_update_leaf.c · 8079aab0

Kent Overstreet authored Aug 05, 2023

We now have
  btree_trans_commit.c
  btree_update.c
Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>

8079aab0

bcachefs: Split up fs-io.[ch] · dbbfca9f

Kent Overstreet authored Aug 03, 2023

fs-io.c is too big - time for some reorganization
 - fs-dio.c: direct io
 - fs-pagecache.c: pagecache data structures (bch_folio), utility code
Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>

dbbfca9f

bcachefs: Fix assorted checkpatch nits · 1e81f89b
Kent Overstreet authored Aug 07, 2023
```
Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>
```
1e81f89b

bcachefs: Fix for sb buffer being misaligned · 6fe893ea

Kent Overstreet authored Aug 07, 2023

On old kernels, kmalloc() may return an allocation that's not naturally
aligned - this resulted in a bug where we allocated a bio with not
enough biovecs. Fix this by using buf_pages().
Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>

6fe893ea

bcachefs: Convert journal validation to bkey_invalid_flags · c4e382e2

Kent Overstreet authored Aug 06, 2023

This fixes a bug where we were already passing bkey_invalid_flags
around, but treating the parameter as just read/write - so the compat
code wasn't being run correctly.
Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>

c4e382e2

bcachefs: Improve journal_entry_err_msg() · a8712967
Kent Overstreet authored Aug 06, 2023
```
Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>
```
a8712967

bcachefs: BCH_COMPAT_bformat_overflow_done no longer required · e08e63e4

Kent Overstreet authored Aug 06, 2023

Awhile back, we changed bkey_format generation to ensure that the packed
representation could never represent fields larger than the unpacked
representation.

This was to ensure that bkey_packed_successor() always gave a sensible
result, but in the current code bkey_packed_successor() is only used in
a debug assertion - not for anything important.

This kills the requirement that we've gotten rid of those weird bkey
formats, and instead changes the assertion to check if we're dealing
with an old weird bkey format.
Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>

e08e63e4

bcachefs: kill EBUG_ON() redefinition in bkey.c · 58705da6

Kent Overstreet authored Aug 06, 2023

our debug mode assertions in bkey.c haven't been getting run, whoops
Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>

58705da6

bcachefs: Add logging to bch2_inode_peek() & related · e691b391

Kent Overstreet authored Aug 06, 2023

Add error messages when we fail to lookup an inode, and also add a few
missing bch2_err_class() calls.
Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>

e691b391