Commits · cda79c545ead7e00b1adaf82a13fcea892bf1f43 · Kirill Smelkov / linux

17 Feb, 2017 23 commits

btrfs: remove unused parameter from read_block_for_search · cda79c54

David Sterba authored Feb 10, 2017

Never used in that function.
Reviewed-by: Liu Bo <bo.li.liu@oracle.com>
Signed-off-by: David Sterba <dsterba@suse.com>

cda79c54

btrfs: ulist: rename ulist_fini to ulist_release · 6655bc3d

David Sterba authored Feb 15, 2017

Change the name so it matches the naming we already use eg. for
btrfs_path.
Suggested-by: Qu Wenruo <quwenruo@cn.fujitsu.com>
Signed-off-by: David Sterba <dsterba@suse.com>

6655bc3d

btrfs: remove pointless rcu protection from btrfs_qgroup_inherit · 4ae8553c

David Sterba authored Feb 13, 2017

There was never need for RCU protection around reading nodesize or other
fairly constant filesystem data.
Signed-off-by: David Sterba <dsterba@suse.com>

4ae8553c

btrfs: qgroups: opencode qgroup_free helper · 0b08e1f4

David Sterba authored Feb 13, 2017

The helper name is not too helpful and is just wrapping a simple call.
Signed-off-by: David Sterba <dsterba@suse.com>

0b08e1f4

btrfs: remove unnecessary mutex lock in qgroup_account_snapshot · 9ea6e2b5

David Sterba authored Feb 13, 2017

The quota status used to be tracked as a variable, so the mutex was
needed (until "Btrfs: add a flags field to btrfs_fs_info" afcdd129).
Since the status is a bit modified atomically and we don't hold the
mutex beyond the check, we can drop it.
Signed-off-by: David Sterba <dsterba@suse.com>

9ea6e2b5

btrfs: check quota status earlier and don't do unnecessary frees · 81353d50

David Sterba authored Feb 13, 2017

Status of quotas should be the first check in
btrfs_qgroup_account_extent and we can return immediatelly, no need to
do no-op ulist frees.
Signed-off-by: David Sterba <dsterba@suse.com>

81353d50

btrfs: embed extent_changeset::range_changed to the structure · 53d32359

David Sterba authored Feb 13, 2017

We can embed range_changed to the extent changeset to address following
problems:

- no need to allocate ulist dynamically, we also get rid of the GFP_NOFS
  for free
- fix lack of allocation failure checking in btrfs_qgroup_reserve_data

The stack consuption where extent_changeset is used slightly increases:

before: 16
after: 16 - 8 (for pointer) + 32 (sizeof ulist) = 40

Which is bearable.
Reviewed-by: Qu Wenruo <quwenruo@cn.fujitsu.com>
Signed-off-by: David Sterba <dsterba@suse.com>

53d32359

btrfs: ulist: make the finalization function public · 9d037933

David Sterba authored Feb 13, 2017

Make ulist_fini externally visible so the ulist API is complete.
Signed-off-by: David Sterba <dsterba@suse.com>

9d037933

btrfs: qgroups: make __del_qgroup_relation static · 025db916
David Sterba authored Feb 13, 2017
```
Internal helper.
Signed-off-by: David Sterba <dsterba@suse.com>
```
025db916

btrfs: make space cache inode readahead failure nonfatal · 1d480538

David Sterba authored Jan 23, 2017

We do a readahead of the free space cache inode to speed things up but
the failure is not fatal, like in other readahead cases. Proper reads
would need to happen anyway and any errors would be caught there.
Reviewed-by: Liu Bo <bo.li.liu@oracle.com>
Signed-off-by: David Sterba <dsterba@suse.com>

1d480538

btrfs: use GFP_KERNEL in btrfs_add/del_qgroup_relation · 6602caf1

David Sterba authored Feb 13, 2017

Qgroup relations are added/deleted from ioctl, we hold the high level
qgroup lock, no deadlocks or recursion from the allocation possible
here.
Signed-off-by: David Sterba <dsterba@suse.com>

6602caf1

btrfs: use GFP_KERNEL in btrfs_quota_enable · 52bf8e7a

David Sterba authored Feb 13, 2017

We don't need to use GFP_NOFS here as this is called from ioctls an the
only lock held is the subvol_sem, which is of a high level and protects
creation/renames/deletion and is never held in the writeout paths.
Signed-off-by: David Sterba <dsterba@suse.com>

52bf8e7a

btrfs: use GFP_KERNEL in btrfs_read_qgroup_config · 323b88f4

David Sterba authored Feb 13, 2017

The qgroup config is read during mount, we do not have to use NOFS.
Signed-off-by: David Sterba <dsterba@suse.com>

323b88f4

btrfs: use GFP_KERNEL in create_snapshot · 23269bf5

David Sterba authored Feb 13, 2017

23269bf5

Btrfs: specify a new ordered extent type for create_io_em · 1af4a0aa

Liu Bo authored Feb 13, 2017

As 0 refers to an existing type BTRFS_ORDERED_IO_DONE, this specifies a
new type 'REGULAR' for regular IO.
Signed-off-by: Liu Bo <bo.li.liu@oracle.com>
Reviewed-by: David Sterba <dsterba@suse.com>
Signed-off-by: David Sterba <dsterba@suse.com>

1af4a0aa

Btrfs: create a helper to create em for IO · 6f9994db

Liu Bo authored Jan 31, 2017

We have similar codes to create and insert extent mapping around IO path,
this merges them into a single helper.
Signed-off-by: Liu Bo <bo.li.liu@oracle.com>
Reviewed-by: David Sterba <dsterba@suse.com>
Signed-off-by: David Sterba <dsterba@suse.com>

6f9994db

Btrfs: use helper to get used bytes of space_info · 4136135b

Liu Bo authored Feb 13, 2017

This uses a helper instead of open code around used byte of space_info
everywhere.
Signed-off-by: Liu Bo <bo.li.liu@oracle.com>
Reviewed-by: David Sterba <dsterba@suse.com>
Signed-off-by: David Sterba <dsterba@suse.com>

4136135b

Btrfs: try to avoid acquiring free space ctl's lock · 0c9b36e0

Liu Bo authored Feb 13, 2017

We don't need to take the lock if the block group has not been cached.
Signed-off-by: Liu Bo <bo.li.liu@oracle.com>
Reviewed-by: David Sterba <dsterba@suse.com>
Signed-off-by: David Sterba <dsterba@suse.com>

0c9b36e0

btrfs: Better csum error message for data csum mismatch · 6f6b643e

Qu Wenruo authored Feb 09, 2017

The original csum error message only outputs inode number, offset, check
sum and expected check sum.

However no root objectid is outputted, which sometimes makes debugging
quite painful under multi-subvolume case (including relocation).

Also the checksum output is decimal, which seldom makes sense for
users/developers and is hard to read in most time.

This patch will add root objectid, which will be %lld for rootid larger
than LAST_FREE_OBJECTID, and hex csum output for better readability.
Signed-off-by: Qu Wenruo <quwenruo@cn.fujitsu.com>
Reviewed-by: David Sterba <dsterba@suse.com>
Signed-off-by: David Sterba <dsterba@suse.com>

6f6b643e

Btrfs: add another missing end_page_writeback on submit_extent_page failure · fe01aa65

Takafumi Kubota authored Feb 09, 2017

If btrfs_bio_alloc fails in submit_extent_page, submit_extent_page returns
without clearing the writeback bit of the failed page.

__extent_writepage_io, that is a caller of submit_extent_page,
does not clear the remaining writeback bit anywhere.
As a result, this will cause the hang at filemap_fdatawait_range,
because it waits the writeback bit to be cleared from the failed page.
So, we have to call end_page_writeback to clear the writeback bit.

For reproducing the hang, we inject a fault like

   if (should_failtest()) { // I define should_failtest()
        bio = NULL;
   }
   else {
        bio = btrfs_bio_alloc(...);
   }

in submit_extent_page.

We should also check whether page has the bit before end_page_writeback,
to avoid the conflict against the other end_page_writeback in bio_endio.
Thus, we add PageWriteback checks not only in __extent_writepage_io,
but also in write_one_eb too, because it misses the check.
Signed-off-by: Takafumi Kubota <takafumi.kubota1012@sslab.ics.keio.ac.jp>
Reviewed-by: Liu Bo <bo.li.liu@oracle.com>
Cc: David Sterba <dsterba@suse.cz>
Signed-off-by: David Sterba <dsterba@suse.com>

fe01aa65

btrfs: remove unused ulist members · 66bbc1c0

David Sterba authored Feb 09, 2017

Commit "btrfs: ulist: Add ulist_del() function" (d4b80404)
removed some debugging code but left the structure defintions.
Reviewed-by: Qu Wenruo <quwenruo@cn.fujitsu.com>
Reviewed-by: Liu Bo <bo.li.liu@oracle.com>
Signed-off-by: David Sterba <dsterba@suse.com>

66bbc1c0

Btrfs: use helper to simplify lock/unlock pages · 76c0021d

Liu Bo authored Feb 10, 2017

Since we have a helper to set page bits, let lock_delalloc_pages and
__unlock_for_delalloc use it.
Signed-off-by: Liu Bo <bo.li.liu@oracle.com>
Reviewed-by: David Sterba <dsterba@suse.com>
Signed-off-by: David Sterba <dsterba@suse.com>

76c0021d

btrfs: teach __process_pages_contig about PAGE_LOCK operation · da2c7009

Liu Bo authored Feb 10, 2017

Signed-off-by: Liu Bo <bo.li.liu@oracle.com>
Reviewed-by: David Sterba <dsterba@suse.com>
[ changes to the helper separated from the following patch ]
Signed-off-by: David Sterba <dsterba@suse.com>

da2c7009

14 Feb, 2017 17 commits

Btrfs: create helper for processing bits on contiguous pages · 873695b3

Liu Bo authored Feb 02, 2017

This introduces a new helper which can be used to process pages bits.
Signed-off-by: Liu Bo <bo.li.liu@oracle.com>
Reviewed-by: David Sterba <dsterba@suse.com>
Signed-off-by: David Sterba <dsterba@suse.com>

873695b3

Btrfs: kill trans in run_delalloc_nocow and btrfs_cross_ref_exist · e4c3b2dc

Liu Bo authored Jan 30, 2017

run_delalloc_nocow has used trans in two places where they don't
actually need @trans.

For btrfs_lookup_file_extent, we search for file extents without COWing
anything, and for btrfs_cross_ref_exist, the only place where we need
@trans is deferencing it in order to get running_transaction which we
could easily get from the global fs_info.
Signed-off-by: Liu Bo <bo.li.liu@oracle.com>
Reviewed-by: David Sterba <dsterba@suse.com>
Signed-off-by: David Sterba <dsterba@suse.com>

e4c3b2dc

Btrfs: pass delayed_refs directly to btrfs_find_delayed_ref_head · f72ad18e

Liu Bo authored Jan 30, 2017

All we need is @delayed_refs, all callers have get it ahead of calling
btrfs_find_delayed_ref_head since lock needs to be acquired firstly,
there is no reason to deference it again inside the function.
Signed-off-by: Liu Bo <bo.li.liu@oracle.com>
Reviewed-by: David Sterba <dsterba@suse.com>
Signed-off-by: David Sterba <dsterba@suse.com>

f72ad18e

Btrfs: remove unused trans in read_block_for_search · d07b8528

Liu Bo authored Jan 30, 2017

@trans is not used at all, this removes it.
Signed-off-by: Liu Bo <bo.li.liu@oracle.com>
Reviewed-by: David Sterba <dsterba@suse.com>
Signed-off-by: David Sterba <dsterba@suse.com>

d07b8528

Btrfs: cleanup unused cached_state in __extent_writepage_io · bcf93489

Liu Bo authored Jan 25, 2017

@cached_state is no more required in __extent_writepage_io, also remove
the goto label.
Signed-off-by: Liu Bo <bo.li.liu@oracle.com>
Signed-off-by: David Sterba <dsterba@suse.com>

bcf93489

btrfs: allow unlink to exceed subvolume quota · 003d7c59

Jeff Mahoney authored Jan 25, 2017

Once a qgroup limit is exceeded, it's impossible to restore normal
operation to the subvolume without modifying the limit or removing
the subvolume. This is a surprising situation for many users used
to the typical workflow with quotas on other file systems where it's
possible to remove files until the used space is back under the limit.

When we go to unlink a file and start the transaction, we'll hit
the qgroup limit while trying to reserve space for the items we'll
modify while removing the file. We discussed last month how best
to handle this situation and agreed that there is no perfect solution.
The best principle-of-least-surprise solution is to handle it similarly
to how we already handle ENOSPC when unlinking, which is to allow
the operation to succeed with the expectation that it will ultimately
release space under most circumstances.

This patch modifies the transaction start path to select whether to
honor the qgroups limits. btrfs_start_transaction_fallback_global_rsv
is the only caller that skips enforcement. The reservation and tracking
still happens normally -- it just skips the enforcement step.
Signed-off-by: Jeff Mahoney <jeffm@suse.com>
Reviewed-by: Qu Wenruo <quwenruo@cn.fujitsu.com>
Signed-off-by: David Sterba <dsterba@suse.com>

003d7c59

Btrfs: fix wrong argument for btrfs_lookup_ordered_range · 9a9239ac

Liu Bo authored Jan 24, 2017

Commit Btrfs: btrfs_page_mkwrite: Reserve space in sectorsized units"
(d0b7da88) did this, but btrfs_lookup_ordered_range expects a 'length'
rather than a 'page_end'.
Signed-off-by: Liu Bo <bo.li.liu@oracle.com>
Reviewed-by: Chandan Rajendra <chandan@linux.vnet.ibm.com>
Signed-off-by: David Sterba <dsterba@suse.com>

9a9239ac

btrfs: raid56: Remove unused variable in lock_stripe_add · a7ceffbb

Qu Wenruo authored Jan 16, 2017

Variable 'walk' in lock_stripe_add() is not used.  Remove it.
Signed-off-by: Qu Wenruo <quwenruo@cn.fujitsu.com>
Signed-off-by: David Sterba <dsterba@suse.com>

a7ceffbb

Btrfs: refactor btrfs_extent_same() slightly · fc4badd9

Omar Sandoval authored Jan 17, 2017

This was originally a prep patch for changing the behavior on len=0, but
we went another direction with that. This still makes the function
slightly easier to follow.
Reviewed-by: Qu Wenruo <quwenruo@cn.fujitsu.com>
Signed-off-by: Omar Sandoval <osandov@fb.com>
Reviewed-by: David Sterba <dsterba@suse.com>
Signed-off-by: David Sterba <dsterba@suse.com>

fc4badd9

Btrfs: constify struct btrfs_{,disk_}key wherever possible · 310712b2

Omar Sandoval authored Jan 17, 2017

In a lot of places, it's unclear when it's safe to reuse a struct
btrfs_key after it has been passed to a helper function. Constify these
arguments wherever possible to make it obvious.
Signed-off-by: Omar Sandoval <osandov@fb.com>
Reviewed-by: David Sterba <dsterba@suse.com>
Signed-off-by: David Sterba <dsterba@suse.com>

310712b2

Btrfs: fix another race between truncate and lockless dio write · 4aaedfb0

Liu Bo authored Dec 14, 2016

Dio writes can update i_size in btrfs_get_blocks_direct when it
writes to offset beyond EOF so that endio can update disk_i_size
correctly (because we don't udpate disk_i_size beyond i_size).

However, when truncating down a file, we firstly update i_size
and then wait for in-flight lockless dio reads/writes, according
to the above, i_size may have been changed in dio writes, and
file extents don't get truncated.

For lockless dio writes are always overwrites, i_size is not
supposed to be changed, so this adds a check to filter out this
case.

The race could be reproduced by fstests/generic/299 with patch
"Btrfs: fix btrfs_ordered_update_i_size to update disk_i_size properly"
 applied.
Signed-off-by: Liu Bo <bo.li.liu@oracle.com>
Signed-off-by: David Sterba <dsterba@suse.com>

4aaedfb0

Btrfs: clean up btrfs_ordered_update_i_size · 62c821a8

Liu Bo authored Dec 13, 2016

Since we have a good helper entry_end, use it for ordered extent.
Signed-off-by: Liu Bo <bo.li.liu@oracle.com>
Reviewed-by: David Sterba <dsterba@suse.com>
[ whitespace reformatting ]
Signed-off-by: David Sterba <dsterba@suse.com>

62c821a8

Btrfs: fix comment in btrfs_page_mkwrite · 5416034f

Liu Bo authored Dec 13, 2016

The comment about "page_mkwrite gets called every time the page is
dirtied" in btrfs_page_mkwrite is not correct, it only gets called the
first time the page gets dirtied after the page faults in.

However, we don't need to touch the code because it works well, although
the proper logic is to check if delalloc bits has been set and if so, go
free reserved space, if not, set the delalloc bits for dirty page range.
Signed-off-by: Liu Bo <bo.li.liu@oracle.com>
Signed-off-by: David Sterba <dsterba@suse.com>

5416034f

Btrfs: fix btrfs_ordered_update_i_size to update disk_i_size properly · 19fd2df5

Liu Bo authored Dec 01, 2016

btrfs_ordered_update_i_size can be called by truncate and endio, but
only endio takes ordered_extent which contains the completed IO.

while truncating down a file, if there are some in-flight IOs,
btrfs_ordered_update_i_size in endio will set disk_i_size to
@orig_offset that is zero.  If truncating-down fails somehow, we try to
recover in memory isize with this zero'd disk_i_size.

Fix it by only updating disk_i_size with @orig_offset when
btrfs_ordered_update_i_size is not called from endio while truncating
down and waiting for in-flight IOs completing their work before recover
in-memory size.

Besides fixing the above issue, add an assertion for last_size to double
check we truncate down to the desired size.
Signed-off-by: Liu Bo <bo.li.liu@oracle.com>
Signed-off-by: David Sterba <dsterba@suse.com>

19fd2df5

btrfs: fix over-80 lines introduced by previous cleanups · f85b7379

David Sterba authored Jan 20, 2017

This goes as a separate patch because fixing that inside the patches
caused too many many conflicts.
Signed-off-by: David Sterba <dsterba@suse.com>

f85b7379

btrfs: Make count_inode_refs take btrfs_inode · f329e319

Nikolay Borisov authored Jan 18, 2017

Signed-off-by: Nikolay Borisov <n.borisov.lkml@gmail.com>
Signed-off-by: David Sterba <dsterba@suse.com>

f329e319

btrfs: Make count_inode_extrefs take btrfs_inode · 36283658

Nikolay Borisov authored Jan 18, 2017

Signed-off-by: Nikolay Borisov <n.borisov.lkml@gmail.com>
Signed-off-by: David Sterba <dsterba@suse.com>

36283658