Commits · 26fde4dfcbdcbbac394bb35de0c0f842de6972b5 · Kirill Smelkov / linux

13 Jul, 2017 40 commits

NFS: check for nfs_refresh_inode() errors in nfs_fhget() · 26fde4df

NeilBrown authored Jul 03, 2017

If an NFS server returns a filehandle that we have previously
seen, and reports a different type, then nfs_refresh_inode()
will log a warning and return an error.

nfs_fhget() does not check for this error and may return an
inode with a different type than the one that the server
reported.

This is likely to cause confusion, and is one way that
->open_context() could return a directory inode as discussed
in the previous patch.

So if nfs_refresh_inode() returns and error, return that error
from nfs_fhget() to avoid the confusion propagating.
Signed-off-by: NeilBrown <neilb@suse.com>
Signed-off-by: Anna Schumaker <Anna.Schumaker@Netapp.com>

26fde4df

NFS: guard against confused server in nfs_atomic_open() · eaa2b82c

NeilBrown authored Jul 03, 2017

A confused server could return a filehandle for an
NFSv4 OPEN request, which it previously returned for a directory.
So the inode returned by  ->open_context() in nfs_atomic_open()
could conceivably be a directory inode.

This has particular implications for the call to
nfs_file_set_open_context() in nfs_finish_open().
If that is called on a directory inode, then the nfs_open_context
that gets stored in the filp->private_data will be linked to
nfs_inode->open_files.

When the directory is closed, nfs_closedir() will (ultimately)
free the ->private_data, but not unlink it from nfs_inode->open_files
(because it doesn't expect an nfs_open_context there).

Subsequently the memory could get used for something else and eventually
if the ->open_files list is walked, the walker will fall off the end and
crash.

So: change nfs_finish_open() to only call nfs_file_set_open_context()
for regular-file inodes.

This failure mode has been seen in a production setting (unknown NFS
server implementation).  The kernel was v3.0 and the specific sequence
seen would not affect more recent kernels, but I think a risk is still
present, and caution is wise.
Signed-off-by: NeilBrown <neilb@suse.com>
Signed-off-by: Anna Schumaker <Anna.Schumaker@Netapp.com>

eaa2b82c

NFS: only invalidate dentrys that are clearly invalid. · cc89684c

NeilBrown authored Jul 05, 2017

Since commit bafc9b75 ("vfs: More precise tests in d_invalidate")
in v3.18, a return of '0' from ->d_revalidate() will cause the dentry
to be invalidated even if it has filesystems mounted on or it or on a
descendant.  The mounted filesystem is unmounted.

This means we need to be careful not to return 0 unless the directory
referred to truly is invalid.  So -ESTALE or -ENOENT should invalidate
the directory.  Other errors such a -EPERM or -ERESTARTSYS should be
returned from ->d_revalidate() so they are propagated to the caller.

A particular problem can be demonstrated by:

1/ mount an NFS filesystem using NFSv3 on /mnt
2/ mount any other filesystem on /mnt/foo
3/ ls /mnt/foo
4/ turn off network, or otherwise make the server unable to respond
5/ ls /mnt/foo &
6/ cat /proc/$!/stack # note that nfs_lookup_revalidate is in the call stack
7/ kill -9 $! # this results in -ERESTARTSYS being returned
8/ observe that /mnt/foo has been unmounted.

This patch changes nfs_lookup_revalidate() to only treat
  -ESTALE from nfs_lookup_verify_inode() and
  -ESTALE or -ENOENT from ->lookup()
as indicating an invalid inode.  Other errors are returned.

Also nfs_check_inode_attributes() is changed to return -ESTALE rather
than -EIO.  This is consistent with the error returned in similar
circumstances from nfs_update_inode().

As this bug allows any user to unmount a filesystem mounted on an NFS
filesystem, this fix is suitable for stable kernels.

Fixes: bafc9b75 ("vfs: More precise tests in d_invalidate")
Cc: stable@vger.kernel.org (v3.18+)
Signed-off-by: NeilBrown <neilb@suse.com>
Signed-off-by: Anna Schumaker <Anna.Schumaker@Netapp.com>

cc89684c

PNFS for stateid errors retry against MDS first · 22368ff1

Olga Kornievskaia authored Jun 23, 2017

Upon receiving a stateid error such as BAD_STATEID, the client
should retry the operation against the MDS before deciding to
do stateid recovery.

Previously, the code would initiate state recovery and it could
lead to a race in a state manager that could chose an incorrect
recovery method which would lead to the EIO failure for the
application.
Signed-off-by: Olga Kornievskaia <kolga@netapp.com>
Signed-off-by: Anna Schumaker <Anna.Schumaker@Netapp.com>

22368ff1

PNFS fix EACCESS on commit to DS handling · a0bc01e0

Olga Kornievskaia authored Jun 23, 2017

Commit fabbbee0 "PNFS fix fallback to MDS if got error on
commit to DS" moved the pnfs_set_lo_fail() to unhandled errors
which was not correct and lead to a kernel oops on umount.

Instead, fix the original EACCESS on commit to DS error by
getting the new layout and re-doing the IO.

Fixes: fabbbee0 ("PNFS fix fallback to MDS if got error on commit to DS")
Signed-off-by: Olga Kornievskaia <kolga@netapp.com>
Cc: stable@vger.kernel.org # v4.12
Signed-off-by: Anna Schumaker <Anna.Schumaker@Netapp.com>

a0bc01e0

NFS: silence a uninitialized variable warning · 4cd1ec95

Dan Carpenter authored Jun 23, 2017

Static checkers have gotten clever enough to complain that "id_long" is
uninitialized on the failure path. It's harmless, but simple to fix.
Signed-off-by: Dan Carpenter <dan.carpenter@oracle.com>
Signed-off-by: Anna Schumaker <Anna.Schumaker@Netapp.com>

4cd1ec95

nfs: Fix fscache stat printing in nfs_show_stats() · ce85bd29

Tuo Chen Peng authored Jun 06, 2017

nfs_show_stats() was incorrectly reading statistics for bytes when printing that
for fsc. It caused files like /proc/self/mountstats to report incorrect fsc
statistics for NFS mounts.
Signed-off-by: Tuo Chen Peng <tpeng@nvidia.com>
Signed-off-by: Anna Schumaker <Anna.Schumaker@Netapp.com>

ce85bd29

NFS: Fix initialization of nfs_page_array->npages · 2eb3aea7

Benjamin Coddington authored Jun 09, 2017

Commit 8ef9b0b9 open-coded nfs_pgarray_set(), and left out the
initialization of the nfs_page_array's npages.  This mistake didn't show up
until testing with block layouts, and there shows that all pNFS reads
return -EIO.

Fixes: 8ef9b0b9 ("NFS: move nfs_pgarray_set() to open code")
Signed-off-by: Benjamin Coddington <bcodding@redhat.com>
Cc: stable@vger.kernel.org # 4.12
Signed-off-by: Anna Schumaker <Anna.Schumaker@Netapp.com>

2eb3aea7

NFS: Fix commit policy for non-blocking calls to nfs_write_inode() · 1a4edf0f

Trond Myklebust authored Jun 20, 2017

Now that the writes will schedule a commit on their own, we don't
need nfs_write_inode() to schedule one if there are outstanding
writes, and we're being called in non-blocking mode.
Signed-off-by: Trond Myklebust <trond.myklebust@primarydata.com>
Signed-off-by: Anna Schumaker <Anna.Schumaker@Netapp.com>

1a4edf0f

NFS: Ensure we commit after writeback is complete · 919e3bd9

Trond Myklebust authored Jun 20, 2017

If the page cache is being flushed, then we want to ensure that we
do start a commit once the pages are done being flushed.
If we just wait until all I/O is done to that file, we can end up
livelocking until the balance_dirty_pages() mechanism puts its
foot down and forces I/O to stop.
So instead we do more or less the same thing that O_DIRECT does,
and set up a counter to tell us when the flush is done,
Signed-off-by: Trond Myklebust <trond.myklebust@primarydata.com>
Signed-off-by: Anna Schumaker <Anna.Schumaker@Netapp.com>

919e3bd9

NFS: Remove unused fields in the page I/O structures · b5973a8c

Trond Myklebust authored Jun 20, 2017

Remove the 'layout_private' fields that were only used by the pNFS OSD
layout driver.
Signed-off-by: Trond Myklebust <trond.myklebust@primarydata.com>
Signed-off-by: Anna Schumaker <Anna.Schumaker@Netapp.com>

b5973a8c

SUNRPC: Make slot allocation more reliable · 92ea011f

Trond Myklebust authored Jun 20, 2017

In xprt_alloc_slot(), the spin lock is only needed to provide atomicity
between the atomic_add_unless() failure and the call to xprt_add_backlog().
We do not actually need to hold it across the memory allocation itself.

By dropping the lock, we can use a more resilient GFP_NOFS allocation,
just as we now do in the rest of the RPC client code.
Signed-off-by: Trond Myklebust <trond.myklebust@primarydata.com>
Signed-off-by: Anna Schumaker <Anna.Schumaker@Netapp.com>

92ea011f

NFS: nfs_rename() - revalidate directories on -ERESTARTSYS · 818a8dbe

Benjamin Coddington authored Jun 16, 2017

An interrupted rename will leave the old dentry behind if the rename
succeeds.  Fix this by forcing a lookup the next time through
->d_revalidate.

A previous attempt at solving this problem took the approach to complete
the work of the rename asynchronously, however that approach was wrong
since it would allow the d_move() to occur after the directory's i_mutex
had been dropped by the original process.
Signed-off-by: Benjamin Coddington <bcodding@redhat.com>
Reviewed-by: Jeff Layton <jlayton@redhat.com>
Signed-off-by: Anna Schumaker <Anna.Schumaker@Netapp.com>

818a8dbe

NFS: convert flags to bool · a7a3b1e9

Benjamin Coddington authored Jun 20, 2017

NFS uses some int, and unsigned int :1, and bool as flags in structs and
args.  Assert the preference for uniformly replacing these with the bool
type.
Signed-off-by: Benjamin Coddington <bcodding@redhat.com>
Signed-off-by: Anna Schumaker <Anna.Schumaker@Netapp.com>

a7a3b1e9

NFS: Set FATTR4_WORD0_TYPE for . and .. entries · 18fe6a23

Anna Schumaker authored Jun 16, 2017

The current code worked okay for getdents(), but getdents64() expects
the d_type field to get filled out properly in the stat structure.
Setting this field fixes xfstests generic/401.
Signed-off-by: Anna Schumaker <Anna.Schumaker@Netapp.com>

18fe6a23

nfsd4: const-ify nfsd4_ops · 800222f8

Christoph Hellwig authored May 08, 2017

nfsd4_ops contains function pointers, and marking it as constant avoids
it being able to be used as an attach vector for code injections.
Signed-off-by: Christoph Hellwig <hch@lst.de>

800222f8

sunrpc: mark all struct svc_version instances as const · aa8217d5

Christoph Hellwig authored May 12, 2017

Signed-off-by: Christoph Hellwig <hch@lst.de>
Acked-by: Trond Myklebust <trond.myklebust@primarydata.com>

aa8217d5

sunrpc: mark all struct svc_procinfo instances as const · b9c744c1

Christoph Hellwig authored May 12, 2017

struct svc_procinfo contains function pointers, and marking it as
constant avoids it being able to be used as an attach vector for
code injections.
Signed-off-by: Christoph Hellwig <hch@lst.de>

b9c744c1

sunrpc: move pc_count out of struct svc_procinfo · 0becc118

Christoph Hellwig authored May 08, 2017

pc_count is the only writeable memeber of struct svc_procinfo, which is
a good candidate to be const-ified as it contains function pointers.

This patch moves it into out out struct svc_procinfo, and into a
separate writable array that is pointed to by struct svc_version.
Signed-off-by: Christoph Hellwig <hch@lst.de>

0becc118

nfsd4: properly type op_func callbacks · 72edc37a

Christoph Hellwig authored May 08, 2017

Pass union nfsd4_op_u to the op_func callbacks instead of using unsafe
function pointer casts.

It also adds two missing structures to struct nfsd4_op.u to facilitate
this.
Signed-off-by: Christoph Hellwig <hch@lst.de>

72edc37a

nfsd4: remove nfsd4op_rsize · 62bbf8bb

Christoph Hellwig authored May 08, 2017

Except for a lot of unnecessary casts this typedef only has one user,
so remove the casts and expand it in struct nfsd4_operation.
Signed-off-by: Christoph Hellwig <hch@lst.de>

62bbf8bb

nfsd4: properly type op_get_currentstateid callbacks · c2a1102a

Christoph Hellwig authored May 08, 2017

Pass union nfsd4_op_u to the op_set_currentstateid callbacks instead of
using unsafe function pointer casts.
Signed-off-by: Christoph Hellwig <hch@lst.de>

c2a1102a

nfsd4: properly type op_set_currentstateid callbacks · 6c9600a7

Christoph Hellwig authored May 08, 2017

Given the args union in struct nfsd4_op a name, and pass it to the
op_set_currentstateid callbacks instead of using unsafe function
pointer casts.
Signed-off-by: Christoph Hellwig <hch@lst.de>

6c9600a7

sunrpc: remove kxdrproc_t · 408b3d46
Christoph Hellwig authored May 08, 2017
```
Remove the now unused typedef.
Signed-off-by: Christoph Hellwig <hch@lst.de>
```
408b3d46

sunrpc: properly type pc_encode callbacks · d16d1867

Christoph Hellwig authored May 08, 2017

Drop the resp argument as it can trivially be derived from the rqstp
argument.  With that all functions now have the same prototype, and we
can remove the unsafe casting to kxdrproc_t.
Signed-off-by: Christoph Hellwig <hch@lst.de>
Acked-by: Trond Myklebust <trond.myklebust@primarydata.com>

d16d1867

sunrpc: properly type pc_decode callbacks · cc6acc20

Christoph Hellwig authored May 08, 2017

Drop the argp argument as it can trivially be derived from the rqstp
argument.  With that all functions now have the same prototype, and we
can remove the unsafe casting to kxdrproc_t.
Signed-off-by: Christoph Hellwig <hch@lst.de>

cc6acc20

sunrpc: properly type pc_release callbacks · 1150ded8

Christoph Hellwig authored May 08, 2017

Drop the p and resp arguments as they are always NULL or can trivially
be derived from the rqstp argument. With that all functions now have the
same prototype, and we can remove the unsafe casting to kxdrproc_t.
Signed-off-by: Christoph Hellwig <hch@lst.de>

1150ded8

sunrpc: properly type pc_func callbacks · 1c8a5409

Christoph Hellwig authored May 08, 2017

Drop the argp and resp arguments as they can trivially be derived from
the rqstp argument.  With that all functions now have the same prototype,
and we can remove the unsafe casting to svc_procfunc as well as the
svc_procfunc typedef itself.
Signed-off-by: Christoph Hellwig <hch@lst.de>

1c8a5409

nfsd: remove the unused PROC() macro in nfs3proc.c · 36ba89c2
Christoph Hellwig authored May 08, 2017
```
Signed-off-by: Christoph Hellwig <hch@lst.de>
```
36ba89c2
nfsd: use named initializers in PROC() · ec7e8cae
Christoph Hellwig authored May 08, 2017
```
Signed-off-by: Christoph Hellwig <hch@lst.de>
```
ec7e8cae
nfsd4: const-ify nfs_cb_version4 · 39d43f75
Christoph Hellwig authored May 12, 2017
```
Signed-off-by: Christoph Hellwig <hch@lst.de>
```
39d43f75

sunrpc: mark all struct rpc_procinfo instances as const · 511e936b

Christoph Hellwig authored May 12, 2017

struct rpc_procinfo contains function pointers, and marking it as
constant avoids it being able to be used as an attach vector for
code injections.
Signed-off-by: Christoph Hellwig <hch@lst.de>
Acked-by: Trond Myklebust <trond.myklebust@primarydata.com>

511e936b

nfs: use ARRAY_SIZE() in the nfsacl_version3 declaration · 9ae7d8ff
Christoph Hellwig authored May 12, 2017
```
Signed-off-by: Christoph Hellwig <hch@lst.de>
```
9ae7d8ff

sunrpc: move p_count out of struct rpc_procinfo · c551858a

Christoph Hellwig authored May 08, 2017

p_count is the only writeable memeber of struct rpc_procinfo, which is
a good candidate to be const-ified as it contains function pointers.

This patch moves it into out out struct rpc_procinfo, and into a
separate writable array that is pointed to by struct rpc_version and
indexed by p_statidx.
Signed-off-by: Christoph Hellwig <hch@lst.de>

c551858a

lockd: fix some weird indentation · e91ff8e3

Christoph Hellwig authored May 08, 2017

Remove double indentation of a few struct rpc_version and
struct rpc_program instance.
Signed-off-by: Christoph Hellwig <hch@lst.de>
Acked-by: Trond Myklebust <trond.myklebust@primarydata.com>

e91ff8e3

nfs: don't cast callback decode/proc/encode routines · 947c6e43

Christoph Hellwig authored May 11, 2017

Instead declare all functions with the proper methods signature.
Signed-off-by: Christoph Hellwig <hch@lst.de>
Reviewed-by: Jeff Layton <jlayton@redhat.com>
Acked-by: Trond Myklebust <trond.myklebust@primarydata.com>

947c6e43

nfs: fix decoder callback prototypes · fc016483

Christoph Hellwig authored May 08, 2017

Declare the p_decode callbacks with the proper prototype instead of
casting to kxdrdproc_t and losing all type safety.
Signed-off-by: Christoph Hellwig <hch@lst.de>
Reviewed-by: Jeff Layton <jlayton@redhat.com>
Acked-by: Trond Myklebust <trond.myklebust@primarydata.com>

fc016483

lockd: fix decoder callback prototypes · 04000564

Christoph Hellwig authored May 08, 2017

Declare the p_decode callbacks with the proper prototype instead of
casting to kxdrdproc_t and losing all type safety.
Signed-off-by: Christoph Hellwig <hch@lst.de>
Reviewed-by: Jeff Layton <jlayton@redhat.com>
Acked-by: Trond Myklebust <trond.myklebust@primarydata.com>

04000564

nfsd: fix decoder callback prototypes · 5362a4ec

Christoph Hellwig authored May 08, 2017

Declare the p_decode callbacks with the proper prototype instead of
casting to kxdrdproc_t and losing all type safety.
Signed-off-by: Christoph Hellwig <hch@lst.de>
Reviewed-by: Jeff Layton <jlayton@redhat.com>

5362a4ec

sunrpc/auth_gss: fix decoder callback prototypes · c56c620b

Christoph Hellwig authored May 08, 2017

Declare the p_decode callbacks with the proper prototype instead of
casting to kxdrdproc_t and losing all type safety.
Signed-off-by: Christoph Hellwig <hch@lst.de>
Reviewed-by: Jeff Layton <jlayton@redhat.com>
Acked-by: Trond Myklebust <trond.myklebust@primarydata.com>

c56c620b