Commits · 99c87fe0f584f8d778a323141504d1ba5c89a4a5 · Kirill Smelkov / linux

16 Aug, 2024 4 commits

bcachefs: fix incorrect i_state usage · 99c87fe0

Kent Overstreet authored Aug 16, 2024

Reported-by: syzbot+95e40eae71609e40d851@syzkaller.appspotmail.com
Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>

99c87fe0

bcachefs: avoid overflowing LRU_TIME_BITS for cached data lru · 9482f3b0

Kent Overstreet authored Aug 16, 2024

Reported-by: syzbot+510b0b28f8e6de64d307@syzkaller.appspotmail.com
Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>

9482f3b0

bcachefs: Fix forgetting to pass trans to fsck_err() · 075cabf3

Kent Overstreet authored Aug 16, 2024

Reported-by: syzbot+e3938cd6d761b78750e6@syzkaller.appspotmail.com
Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>

075cabf3

bcachefs: Increase size of cuckoo hash table on too many rehashes · c2f6e16a

Kent Overstreet authored Aug 15, 2024

Also, improve the calculation of the new table size, so that it can
shrink when needed.
Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>

c2f6e16a

14 Aug, 2024 14 commits

bcachefs: bcachefs_metadata_version_disk_accounting_inum · 58474f76

Kent Overstreet authored Aug 12, 2024

This adds another disk accounting counter to track usage per inode
number (any snapshot ID).

This will be used for a couple things:

- It'll give us a way to tell the user how much space a given file ista
  consuming in all snapshots; i.e. how much extra space it's consuming
  due to snapshot versioning.

- It counts number of extents and total size of extents (both in btree
  keyspace sectors and actual disk usage), meaning it gives us average
  extent size: that is, it'll let us cheaply find fragmented files that
  should be defragmented.
Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>

58474f76

bcachefs: Kill __bch2_accounting_mem_mod() · 5132b99b

Kent Overstreet authored Aug 12, 2024

The next patch will be adding a disk accounting counter type which is
not kept in the in-memory eytzinger tree.

As prep, fold __bch2_accounting_mem_mod() into
bch2_accounting_mem_mod_locked() so that we can check for that counter
type and bail out without calling bpos_to_disk_accounting_pos() twice.
Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>

5132b99b

bcachefs: Make bkey_fsck_err() a wrapper around fsck_err() · d97de0d0

Kent Overstreet authored Aug 12, 2024

bkey_fsck_err() was added as an interface that looks like fsck_err(),
but previously all it did was ensure that the appropriate error counter
was incremented in the superblock.

This is a cleanup and bugfix patch that converts it to a wrapper around
fsck_err(). This is needed to fix an issue with the upgrade path to
disk_accounting_v3, where the "silent fix" error list now includes
bkey_fsck errors; fsck_err() handles this in a unified way, and since we
need to change printing of bkey fsck errors from the caller to the inner
bkey_fsck_err() calls, this ends up being a pretty big change.

Als,, rename .invalid() methods to .validate(), for clarity, while we're
changing the function signature anyways (to drop the printbuf argument).
Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>

d97de0d0

bcachefs: Fix warning in __bch2_fsck_err() for trans not passed in · c9947102
Kent Overstreet authored Aug 12, 2024
```
Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>
```
c9947102
bcachefs: Add a time_stat for blocked on key cache flush · 06a8693b
Kent Overstreet authored Aug 10, 2024
```
Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>
```
06a8693b

bcachefs: Improve trans_blocked_journal_reclaim tracepoint · 790666c8

Kent Overstreet authored Aug 10, 2024

include information about the state of the btree key cache
Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>

790666c8

bcachefs: Add hysteresis to waiting on btree key cache flush · 7254555c

Kent Overstreet authored Aug 10, 2024

This helps ensure key cache reclaim isn't contending with threads
waiting for the key cache to be helped, and fixes a severe performance
bug.
Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>

7254555c

lib/generic-radix-tree.c: Fix rare race in __genradix_ptr_alloc() · b2f11c6f

Kent Overstreet authored Aug 10, 2024

If we need to increase the tree depth, allocate a new node, and then
race with another thread that increased the tree depth before us, we'll
still have a preallocated node that might be used later.

If we then use that node for a new non-root node, it'll still have a
pointer to the old root instead of being zeroed - fix this by zeroing it
in the cmpxchg failure path.
Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>

b2f11c6f

bcachefs: Convert for_each_btree_node() to lockrestart_do() · 968feb85

Kent Overstreet authored Aug 07, 2024

for_each_btree_node() now works similarly to for_each_btree_key(), where
the loop body is passed as an argument to be passed to lockrestart_do().

This now calls trans_begin() on every loop iteration - which fixes an
SRCU warning in backpointers fsck.
Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>

968feb85

bcachefs: Add missing downgrade table entry · 48d6cc1b
Kent Overstreet authored Aug 13, 2024
```
Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>
```
48d6cc1b
bcachefs: disk accounting: ignore unknown types · 486d9207
Kent Overstreet authored Aug 13, 2024
```
forward compat fix
Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>
```
486d9207
bcachefs: bch2_accounting_invalid() fixup · d9e61576
Kent Overstreet authored Aug 13, 2024
```
Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>
```
d9e61576

bcachefs: Fix bch2_trigger_alloc when upgrading from old versions · bd864bc2

Kent Overstreet authored Aug 12, 2024

bch2_trigger_alloc was assuming that the new key would always be newly
created and thus always an alloc_v4 key, but - not when called from
btree_gc.
Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>

bd864bc2

bcachefs: delete faulty fastpath in bch2_btree_path_traverse_cached() · a24e6e71

Kent Overstreet authored Aug 13, 2024

bch2_btree_path_traverse_cached() was previously checking if it could
just relock the path, which is a common idiom in path traversal.

However, it was using btree_node_relock(), not btree_path_relock();
btree_path_relock() only succeeds if the path was in state
BTREE_ITER_NEED_RELOCK.

If the path was in state BTREE_ITER_NEED_TRAVERSE a full traversal is
needed; this led to a null ptr deref in
bch2_btree_path_traverse_cached().

And the short circuit check here isn't needed, since it was already done
in the main bch2_btree_path_traverse_one().
Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>

a24e6e71

09 Aug, 2024 3 commits

bcachefs: bcachefs_metadata_version_disk_accounting_v3 · 8a2491db

Kent Overstreet authored Aug 09, 2024

bcachefs_metadata_version_disk_accounting_v2 erroneously had padding
bytes in disk_accounting_key, which is a problem because we have to
guarantee that all unused bytes in disk_accounting_key are zeroed.

Fortunately 6.11 isn't out yet, so it's cheap to fix this by spinning a
new version.
Reported-by: Gabriel de Perthuis <g2p.code@gmail.com>
Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>

8a2491db

bcachefs: improve bch2_dev_usage_to_text() · 1a9e219d
Kent Overstreet authored Aug 08, 2024
```
Add a line for capacity
Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>
```
1a9e219d

bcachefs: bch2_accounting_invalid() · 077e4737

Kent Overstreet authored Aug 08, 2024

Implement bch2_accounting_invalid(); check for junk at the end, and
replicas accounting entries in particular need to be checked or we'll
pop asserts later.
Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>

077e4737

08 Aug, 2024 4 commits

bcachefs: Switch to .get_inode_acl() · f39bae2e

Kent Overstreet authored Aug 07, 2024

.set_acl() requires a dentry, and if one isn't passed it marks the VFS
inode as not having an ACL.

This has been causing inodes with ACLs to have them "disappear" on
bcachefs filesystem, depending on which path those inodes get pulled
into the cache from.

Switching to .get_inode_acl(), like other local filesystems, fixes this.
Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>

f39bae2e

bcachefs: Use bch2_wait_on_allocator() in btree node alloc path · 73dc1656
Kent Overstreet authored Aug 07, 2024
```
If the allocator gets stuck, we need to know why.
Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>
```
73dc1656

bcachefs: Make allocator stuck timeout configurable, ratelimit messages · cecf7279

Kent Overstreet authored Aug 07, 2024

Limit these messages to once every 2 minutes to avoid spamming logs;
with multiple devices the output can be quite significant.

Also, up the default timeout to 30 seconds from 10 seconds.
Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>

cecf7279

bcachefs: Add missing path_traverse() to btree_iter_next_node() · 6d496e02

Kent Overstreet authored Aug 07, 2024

This fixes a bug exposed by the next path - we pop an assert in
path_set_should_be_locked().
Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>

6d496e02

07 Aug, 2024 6 commits

bcachefs: ec should not allocate from ro devs · 2caca9fb

Kent Overstreet authored Aug 06, 2024

This fixes a device removal deadlock when using erasure coding.
Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>

2caca9fb

bcachefs: Improved allocator debugging for ec · c1e44462

Kent Overstreet authored Aug 06, 2024

chasing down a device removal deadlock with erasure coding
Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>

c1e44462

bcachefs: Add missing bch2_trans_begin() call · 02026e89
Kent Overstreet authored Aug 06, 2024
```
Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>
```
02026e89

bcachefs: Add a comment for bucket helper types · 90b211fa

Kent Overstreet authored Jul 30, 2024

We've had bugs in the past with incorrect integer conversions in disk
accounting code, which is why bucket helpers now always return s64s; add
a comment explaining this.
Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>

90b211fa

bcachefs: Don't rely on implicit unsigned -> signed integer conversion · 7442b5cd

Kent Overstreet authored Jul 30, 2024

implicit integer conversion is a fertile source of bugs, and we really
would rather not have the min()/max() macros doing it implicitly.
bcachefs appears to be the only place in the kernel where this happens,
so let's fix it.
Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>

7442b5cd

lockdep: Fix lockdep_set_notrack_class() for CONFIG_LOCK_STAT · ff9bf4b3

Kent Overstreet authored Jul 30, 2024

We won't find a contended lock if it's not being tracked.
Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>

ff9bf4b3

31 Jul, 2024 1 commit

bcachefs: Fix double free of ca->buckets_nouse · e61dd678

Kent Overstreet authored Jul 30, 2024

Reported-by: Dan Carpenter <dan.carpenter@linaro.org>
Fixes: ffcbec60 ("bcachefs: Kill opts.buckets_nouse")
Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev>

e61dd678

28 Jul, 2024 8 commits

Linux 6.11-rc1 · 8400291e
Linus Torvalds authored Jul 28, 2024

8400291e

Merge tag 'kbuild-fixes-v6.11' of... · a0c04bd5

Linus Torvalds authored Jul 28, 2024

Merge tag 'kbuild-fixes-v6.11' of git://git.kernel.org/pub/scm/linux/kernel/git/masahiroy/linux-kbuild

Pull Kbuild fixes from Masahiro Yamada:

 - Fix RPM package build error caused by an incorrect locale setup

 - Mark modules.weakdep as ghost in RPM package

 - Fix the odd combination of -S and -c in stack protector scripts,
   which is an error with the latest Clang

* tag 'kbuild-fixes-v6.11' of git://git.kernel.org/pub/scm/linux/kernel/git/masahiroy/linux-kbuild:
  kbuild: Fix '-S -c' in x86 stack protector scripts
  kbuild: rpm-pkg: ghost modules.weakdep file
  kbuild: rpm-pkg: Fix C locale setup

a0c04bd5

minmax: simplify and clarify min_t()/max_t() implementation · 017fa3e8

Linus Torvalds authored Jul 28, 2024

This simplifies the min_t() and max_t() macros by no longer making them
work in the context of a C constant expression.

That means that you can no longer use them for static initializers or
for array sizes in type definitions, but there were only a couple of
such uses, and all of them were converted (famous last words) to use
MIN_T/MAX_T instead.

Cc: David Laight <David.Laight@aculab.com>
Cc: Lorenzo Stoakes <lorenzo.stoakes@oracle.com>
Signed-off-by: Linus Torvalds <torvalds@linux-foundation.org>

017fa3e8

minmax: add a few more MIN_T/MAX_T users · 4477b39c

Linus Torvalds authored Jul 28, 2024

Commit 3a7e02c0 ("minmax: avoid overly complicated constant
expressions in VM code") added the simpler MIN_T/MAX_T macros in order
to avoid some excessive expansion from the rather complicated regular
min/max macros.

The complexity of those macros stems from two issues:

 (a) trying to use them in situations that require a C constant
     expression (in static initializers and for array sizes)

 (b) the type sanity checking

and MIN_T/MAX_T avoids both of these issues.

Now, in the whole (long) discussion about all this, it was pointed out
that the whole type sanity checking is entirely unnecessary for
min_t/max_t which get a fixed type that the comparison is done in.

But that still leaves min_t/max_t unnecessarily complicated due to
worries about the C constant expression case.

However, it turns out that there really aren't very many cases that use
min_t/max_t for this, and we can just force-convert those.

This does exactly that.

Which in turn will then allow for much simpler implementations of
min_t()/max_t().  All the usual "macros in all upper case will evaluate
the arguments multiple times" rules apply.

We should do all the same things for the regular min/max() vs MIN/MAX()
cases, but that has the added complexity of various drivers defining
their own local versions of MIN/MAX, so that needs another level of
fixes first.

Link: https://lore.kernel.org/all/b47fad1d0cf8449886ad148f8c013dae@AcuMS.aculab.com/
Cc: David Laight <David.Laight@aculab.com>
Cc: Lorenzo Stoakes <lorenzo.stoakes@oracle.com>
Signed-off-by: Linus Torvalds <torvalds@linux-foundation.org>

4477b39c

Merge tag 'ubifs-for-linus-6.11-rc1-take2' of... · 7e2d0ba7

Linus Torvalds authored Jul 28, 2024

Merge tag 'ubifs-for-linus-6.11-rc1-take2' of git://git.kernel.org/pub/scm/linux/kernel/git/rw/ubifs

Pull UBI and UBIFS updates from Richard Weinberger:

 - Many fixes for power-cut issues by Zhihao Cheng

 - Another ubiblock error path fix

 - ubiblock section mismatch fix

 - Misc fixes all over the place

* tag 'ubifs-for-linus-6.11-rc1-take2' of git://git.kernel.org/pub/scm/linux/kernel/git/rw/ubifs:
  ubi: Fix ubi_init() ubiblock_exit() section mismatch
  ubifs: add check for crypto_shash_tfm_digest
  ubifs: Fix inconsistent inode size when powercut happens during appendant writing
  ubi: block: fix null-pointer-dereference in ubiblock_create()
  ubifs: fix kernel-doc warnings
  ubifs: correct UBIFS_DFS_DIR_LEN macro definition and improve code clarity
  mtd: ubi: Restore missing cleanup on ubi_init() failure path
  ubifs: dbg_orphan_check: Fix missed key type checking
  ubifs: Fix unattached inode when powercut happens in creating
  ubifs: Fix space leak when powercut happens in linking tmpfile
  ubifs: Move ui->data initialization after initializing security
  ubifs: Fix adding orphan entry twice for the same inode
  ubifs: Remove insert_dead_orphan from replaying orphan process
  Revert "ubifs: ubifs_symlink: Fix memleak of inode->i_link in error path"
  ubifs: Don't add xattr inode into orphan area
  ubifs: Fix unattached xattr inode if powercut happens after deleting
  mtd: ubi: avoid expensive do_div() on 32-bit machines
  mtd: ubi: make ubi_class constant
  ubi: eba: properly rollback inside self_check_eba

7e2d0ba7

kbuild: Fix '-S -c' in x86 stack protector scripts · 3415b10a

Nathan Chancellor authored Jul 26, 2024

After a recent change in clang to stop consuming all instances of '-S'
and '-c' [1], the stack protector scripts break due to the kernel's use
of -Werror=unused-command-line-argument to catch cases where flags are
not being properly consumed by the compiler driver:

  $ echo | clang -o - -x c - -S -c -Werror=unused-command-line-argument
  clang: error: argument unused during compilation: '-c' [-Werror,-Wunused-command-line-argument]

This results in CONFIG_STACKPROTECTOR getting disabled because
CONFIG_CC_HAS_SANE_STACKPROTECTOR is no longer set.

'-c' and '-S' both instruct the compiler to stop at different stages of
the pipeline ('-S' after compiling, '-c' after assembling), so having
them present together in the same command makes little sense. In this
case, the test wants to stop before assembling because it is looking at
the textual assembly output of the compiler for either '%fs' or '%gs',
so remove '-c' from the list of arguments to resolve the error.

All versions of GCC continue to work after this change, along with
versions of clang that do or do not contain the change mentioned above.

Cc: stable@vger.kernel.org
Fixes: 4f7fd4d7 ("[PATCH] Add the -fstack-protector option to the CFLAGS")
Fixes: 60a5317f ("x86: implement x86_32 stack protector")
Link: https://github.com/llvm/llvm-project/commit/6461e537815f7fa68cef06842505353cf5600e9c [1]
Signed-off-by: Nathan Chancellor <nathan@kernel.org>
Signed-off-by: Masahiro Yamada <masahiroy@kernel.org>

3415b10a

ubi: Fix ubi_init() ubiblock_exit() section mismatch · 92a286e9

Richard Weinberger authored Jul 13, 2024

Since ubiblock_exit() is now called from an init function,
the __exit section no longer makes sense.

Cc: Ben Hutchings <bwh@kernel.org>
Reported-by: kernel test robot <lkp@intel.com>
Closes: https://lore.kernel.org/oe-kbuild-all/202407131403.wZJpd8n2-lkp@intel.com/Signed-off-by: Richard Weinberger <richard@nod.at>
Reviewed-by: Zhihao Cheng <chengzhihao1@huawei.com>

92a286e9

Merge tag 'v6.11-merge' of git://git.kernel.org/pub/scm/linux/kernel/git/lenb/linux · e172f1e9

Linus Torvalds authored Jul 28, 2024

Pull turbostat updates from Len Brown:

 - Enable turbostat extensions to add both perf and PMT (Intel
   Platform Monitoring Technology) counters via the cmdline

 - Demonstrate PMT access with built-in support for Meteor Lake's
   Die C6 counter

* tag 'v6.11-merge' of git://git.kernel.org/pub/scm/linux/kernel/git/lenb/linux:
  tools/power turbostat: version 2024.07.26
  tools/power turbostat: Include umask=%x in perf counter's config
  tools/power turbostat: Document PMT in turbostat.8
  tools/power turbostat: Add MTL's PMT DC6 builtin counter
  tools/power turbostat: Add early support for PMT counters
  tools/power turbostat: Add selftests for added perf counters
  tools/power turbostat: Add selftests for SMI, APERF and MPERF counters
  tools/power turbostat: Move verbose counter messages to level 2
  tools/power turbostat: Move debug prints from stdout to stderr
  tools/power turbostat: Fix typo in turbostat.8
  tools/power turbostat: Add perf added counter example to turbostat.8
  tools/power turbostat: Fix formatting in turbostat.8
  tools/power turbostat: Extend --add option with perf counters
  tools/power turbostat: Group SMI counter with APERF and MPERF
  tools/power turbostat: Add ZERO_ARRAY for zero initializing builtin array
  tools/power turbostat: Replace enum rapl_source and cstate_source with counter_source
  tools/power turbostat: Remove anonymous union from rapl_counter_info_t
  tools/power/turbostat: Switch to new Intel CPU model defines

e172f1e9