Commit 7b103151 authored by Linus Torvalds's avatar Linus Torvalds

Merge tag 'for-linus-20190614' of git://git.kernel.dk/linux-block

Pull block fixes from Jens Axboe:

 - Remove references to old schedulers for the scheduler switching and
   blkio controller documentation (Andreas)

 - Kill duplicate check for report zone for null_blk (Chaitanya)

 - Two bcache fixes (Coly)

 - Ensure that mq-deadline is selected if zoned block device is enabled,
   as we need that to support them (Damien)

 - Fix io_uring memory leak (Eric)

 - ps3vram fallout from LBDAF removal (Geert)

 - Redundant blk-mq debugfs debugfs_create return check cleanup (Greg)

 - Extend NOPLM quirk for ST1000LM024 drives (Hans)

 - Remove error path warning that can now trigger after the queue
   removal/addition fixes (Ming)

* tag 'for-linus-20190614' of git://git.kernel.dk/linux-block:
  block/ps3vram: Use %llu to format sector_t after LBDAF removal
  libata: Extend quirks for the ST1000LM024 drives with NOLPM quirk
  bcache: only set BCACHE_DEV_WB_RUNNING when cached device attached
  bcache: fix stack corruption by PRECEDING_KEY()
  blk-mq: remove WARN_ON(!q->elevator) from blk_mq_sched_free_requests
  blkio-controller.txt: Remove references to CFQ
  block/switching-sched.txt: Update to blk-mq schedulers
  null_blk: remove duplicate check for report zone
  blk-mq: no need to check return value of debugfs_create functions
  io_uring: fix memory leak of UNIX domain socket inode
  block: force select mq-deadline for zoned block devices
parents 5dcedf46 1d0c0651
...@@ -13,11 +13,9 @@ you can do so by typing: ...@@ -13,11 +13,9 @@ you can do so by typing:
# mount none /sys -t sysfs # mount none /sys -t sysfs
As of the Linux 2.6.10 kernel, it is now possible to change the It is possible to change the IO scheduler for a given block device on
IO scheduler for a given block device on the fly (thus making it possible, the fly to select one of mq-deadline, none, bfq, or kyber schedulers -
for instance, to set the CFQ scheduler for the system default, but which can improve that device's throughput.
set a specific device to use the deadline or noop schedulers - which
can improve that device's throughput).
To set a specific scheduler, simply do this: To set a specific scheduler, simply do this:
...@@ -30,8 +28,8 @@ The list of defined schedulers can be found by simply doing ...@@ -30,8 +28,8 @@ The list of defined schedulers can be found by simply doing
a "cat /sys/block/DEV/queue/scheduler" - the list of valid names a "cat /sys/block/DEV/queue/scheduler" - the list of valid names
will be displayed, with the currently selected scheduler in brackets: will be displayed, with the currently selected scheduler in brackets:
# cat /sys/block/hda/queue/scheduler # cat /sys/block/sda/queue/scheduler
noop deadline [cfq] [mq-deadline] kyber bfq none
# echo deadline > /sys/block/hda/queue/scheduler # echo none >/sys/block/sda/queue/scheduler
# cat /sys/block/hda/queue/scheduler # cat /sys/block/sda/queue/scheduler
noop [deadline] cfq [none] mq-deadline kyber bfq
...@@ -8,61 +8,13 @@ both at leaf nodes as well as at intermediate nodes in a storage hierarchy. ...@@ -8,61 +8,13 @@ both at leaf nodes as well as at intermediate nodes in a storage hierarchy.
Plan is to use the same cgroup based management interface for blkio controller Plan is to use the same cgroup based management interface for blkio controller
and based on user options switch IO policies in the background. and based on user options switch IO policies in the background.
Currently two IO control policies are implemented. First one is proportional One IO control policy is throttling policy which can be used to
weight time based division of disk policy. It is implemented in CFQ. Hence specify upper IO rate limits on devices. This policy is implemented in
this policy takes effect only on leaf nodes when CFQ is being used. The second generic block layer and can be used on leaf nodes as well as higher
one is throttling policy which can be used to specify upper IO rate limits level logical devices like device mapper.
on devices. This policy is implemented in generic block layer and can be
used on leaf nodes as well as higher level logical devices like device mapper.
HOWTO HOWTO
===== =====
Proportional Weight division of bandwidth
-----------------------------------------
You can do a very simple testing of running two dd threads in two different
cgroups. Here is what you can do.
- Enable Block IO controller
CONFIG_BLK_CGROUP=y
- Enable group scheduling in CFQ
CONFIG_CFQ_GROUP_IOSCHED=y
- Compile and boot into kernel and mount IO controller (blkio); see
cgroups.txt, Why are cgroups needed?.
mount -t tmpfs cgroup_root /sys/fs/cgroup
mkdir /sys/fs/cgroup/blkio
mount -t cgroup -o blkio none /sys/fs/cgroup/blkio
- Create two cgroups
mkdir -p /sys/fs/cgroup/blkio/test1/ /sys/fs/cgroup/blkio/test2
- Set weights of group test1 and test2
echo 1000 > /sys/fs/cgroup/blkio/test1/blkio.weight
echo 500 > /sys/fs/cgroup/blkio/test2/blkio.weight
- Create two same size files (say 512MB each) on same disk (file1, file2) and
launch two dd threads in different cgroup to read those files.
sync
echo 3 > /proc/sys/vm/drop_caches
dd if=/mnt/sdb/zerofile1 of=/dev/null &
echo $! > /sys/fs/cgroup/blkio/test1/tasks
cat /sys/fs/cgroup/blkio/test1/tasks
dd if=/mnt/sdb/zerofile2 of=/dev/null &
echo $! > /sys/fs/cgroup/blkio/test2/tasks
cat /sys/fs/cgroup/blkio/test2/tasks
- At macro level, first dd should finish first. To get more precise data, keep
on looking at (with the help of script), at blkio.disk_time and
blkio.disk_sectors files of both test1 and test2 groups. This will tell how
much disk time (in milliseconds), each group got and how many sectors each
group dispatched to the disk. We provide fairness in terms of disk time, so
ideally io.disk_time of cgroups should be in proportion to the weight.
Throttling/Upper Limit policy Throttling/Upper Limit policy
----------------------------- -----------------------------
- Enable Block IO controller - Enable Block IO controller
...@@ -94,7 +46,7 @@ Throttling/Upper Limit policy ...@@ -94,7 +46,7 @@ Throttling/Upper Limit policy
Hierarchical Cgroups Hierarchical Cgroups
==================== ====================
Both CFQ and throttling implement hierarchy support; however, Throttling implements hierarchy support; however,
throttling's hierarchy support is enabled iff "sane_behavior" is throttling's hierarchy support is enabled iff "sane_behavior" is
enabled from cgroup side, which currently is a development option and enabled from cgroup side, which currently is a development option and
not publicly available. not publicly available.
...@@ -107,9 +59,8 @@ If somebody created a hierarchy like as follows. ...@@ -107,9 +59,8 @@ If somebody created a hierarchy like as follows.
| |
test3 test3
CFQ by default and throttling with "sane_behavior" will handle the Throttling with "sane_behavior" will handle the
hierarchy correctly. For details on CFQ hierarchy support, refer to hierarchy correctly. For throttling, all limits apply
Documentation/block/cfq-iosched.txt. For throttling, all limits apply
to the whole subtree while all statistics are local to the IOs to the whole subtree while all statistics are local to the IOs
directly generated by tasks in that cgroup. directly generated by tasks in that cgroup.
...@@ -130,10 +81,6 @@ CONFIG_DEBUG_BLK_CGROUP ...@@ -130,10 +81,6 @@ CONFIG_DEBUG_BLK_CGROUP
- Debug help. Right now some additional stats file show up in cgroup - Debug help. Right now some additional stats file show up in cgroup
if this option is enabled. if this option is enabled.
CONFIG_CFQ_GROUP_IOSCHED
- Enables group scheduling in CFQ. Currently only 1 level of group
creation is allowed.
CONFIG_BLK_DEV_THROTTLING CONFIG_BLK_DEV_THROTTLING
- Enable block device throttling support in block layer. - Enable block device throttling support in block layer.
...@@ -344,32 +291,3 @@ Common files among various policies ...@@ -344,32 +291,3 @@ Common files among various policies
- blkio.reset_stats - blkio.reset_stats
- Writing an int to this file will result in resetting all the stats - Writing an int to this file will result in resetting all the stats
for that cgroup. for that cgroup.
CFQ sysfs tunable
=================
/sys/block/<disk>/queue/iosched/slice_idle
------------------------------------------
On a faster hardware CFQ can be slow, especially with sequential workload.
This happens because CFQ idles on a single queue and single queue might not
drive deeper request queue depths to keep the storage busy. In such scenarios
one can try setting slice_idle=0 and that would switch CFQ to IOPS
(IO operations per second) mode on NCQ supporting hardware.
That means CFQ will not idle between cfq queues of a cfq group and hence be
able to driver higher queue depth and achieve better throughput. That also
means that cfq provides fairness among groups in terms of IOPS and not in
terms of disk time.
/sys/block/<disk>/queue/iosched/group_idle
------------------------------------------
If one disables idling on individual cfq queues and cfq service trees by
setting slice_idle=0, group_idle kicks in. That means CFQ will still idle
on the group in an attempt to provide fairness among groups.
By default group_idle is same as slice_idle and does not do anything if
slice_idle is enabled.
One can experience an overall throughput drop if you have created multiple
groups and put applications in that group which are not driving enough
IO to keep disk busy. In that case set group_idle=0, and CFQ will not idle
on individual groups and throughput should improve.
...@@ -73,6 +73,7 @@ config BLK_DEV_INTEGRITY ...@@ -73,6 +73,7 @@ config BLK_DEV_INTEGRITY
config BLK_DEV_ZONED config BLK_DEV_ZONED
bool "Zoned block device support" bool "Zoned block device support"
select MQ_IOSCHED_DEADLINE
---help--- ---help---
Block layer zoned block device support. This option enables Block layer zoned block device support. This option enables
support for ZAC/ZBC host-managed and host-aware zoned block devices. support for ZAC/ZBC host-managed and host-aware zoned block devices.
......
...@@ -821,38 +821,28 @@ static const struct blk_mq_debugfs_attr blk_mq_debugfs_ctx_attrs[] = { ...@@ -821,38 +821,28 @@ static const struct blk_mq_debugfs_attr blk_mq_debugfs_ctx_attrs[] = {
{}, {},
}; };
static bool debugfs_create_files(struct dentry *parent, void *data, static void debugfs_create_files(struct dentry *parent, void *data,
const struct blk_mq_debugfs_attr *attr) const struct blk_mq_debugfs_attr *attr)
{ {
if (IS_ERR_OR_NULL(parent)) if (IS_ERR_OR_NULL(parent))
return false; return;
d_inode(parent)->i_private = data; d_inode(parent)->i_private = data;
for (; attr->name; attr++) { for (; attr->name; attr++)
if (!debugfs_create_file(attr->name, attr->mode, parent, debugfs_create_file(attr->name, attr->mode, parent,
(void *)attr, &blk_mq_debugfs_fops)) (void *)attr, &blk_mq_debugfs_fops);
return false;
}
return true;
} }
int blk_mq_debugfs_register(struct request_queue *q) void blk_mq_debugfs_register(struct request_queue *q)
{ {
struct blk_mq_hw_ctx *hctx; struct blk_mq_hw_ctx *hctx;
int i; int i;
if (!blk_debugfs_root)
return -ENOENT;
q->debugfs_dir = debugfs_create_dir(kobject_name(q->kobj.parent), q->debugfs_dir = debugfs_create_dir(kobject_name(q->kobj.parent),
blk_debugfs_root); blk_debugfs_root);
if (!q->debugfs_dir)
return -ENOMEM;
if (!debugfs_create_files(q->debugfs_dir, q, debugfs_create_files(q->debugfs_dir, q, blk_mq_debugfs_queue_attrs);
blk_mq_debugfs_queue_attrs))
goto err;
/* /*
* blk_mq_init_sched() attempted to do this already, but q->debugfs_dir * blk_mq_init_sched() attempted to do this already, but q->debugfs_dir
...@@ -864,11 +854,10 @@ int blk_mq_debugfs_register(struct request_queue *q) ...@@ -864,11 +854,10 @@ int blk_mq_debugfs_register(struct request_queue *q)
/* Similarly, blk_mq_init_hctx() couldn't do this previously. */ /* Similarly, blk_mq_init_hctx() couldn't do this previously. */
queue_for_each_hw_ctx(q, hctx, i) { queue_for_each_hw_ctx(q, hctx, i) {
if (!hctx->debugfs_dir && blk_mq_debugfs_register_hctx(q, hctx)) if (!hctx->debugfs_dir)
goto err; blk_mq_debugfs_register_hctx(q, hctx);
if (q->elevator && !hctx->sched_debugfs_dir && if (q->elevator && !hctx->sched_debugfs_dir)
blk_mq_debugfs_register_sched_hctx(q, hctx)) blk_mq_debugfs_register_sched_hctx(q, hctx);
goto err;
} }
if (q->rq_qos) { if (q->rq_qos) {
...@@ -879,12 +868,6 @@ int blk_mq_debugfs_register(struct request_queue *q) ...@@ -879,12 +868,6 @@ int blk_mq_debugfs_register(struct request_queue *q)
rqos = rqos->next; rqos = rqos->next;
} }
} }
return 0;
err:
blk_mq_debugfs_unregister(q);
return -ENOMEM;
} }
void blk_mq_debugfs_unregister(struct request_queue *q) void blk_mq_debugfs_unregister(struct request_queue *q)
...@@ -894,7 +877,7 @@ void blk_mq_debugfs_unregister(struct request_queue *q) ...@@ -894,7 +877,7 @@ void blk_mq_debugfs_unregister(struct request_queue *q)
q->debugfs_dir = NULL; q->debugfs_dir = NULL;
} }
static int blk_mq_debugfs_register_ctx(struct blk_mq_hw_ctx *hctx, static void blk_mq_debugfs_register_ctx(struct blk_mq_hw_ctx *hctx,
struct blk_mq_ctx *ctx) struct blk_mq_ctx *ctx)
{ {
struct dentry *ctx_dir; struct dentry *ctx_dir;
...@@ -902,44 +885,24 @@ static int blk_mq_debugfs_register_ctx(struct blk_mq_hw_ctx *hctx, ...@@ -902,44 +885,24 @@ static int blk_mq_debugfs_register_ctx(struct blk_mq_hw_ctx *hctx,
snprintf(name, sizeof(name), "cpu%u", ctx->cpu); snprintf(name, sizeof(name), "cpu%u", ctx->cpu);
ctx_dir = debugfs_create_dir(name, hctx->debugfs_dir); ctx_dir = debugfs_create_dir(name, hctx->debugfs_dir);
if (!ctx_dir)
return -ENOMEM;
if (!debugfs_create_files(ctx_dir, ctx, blk_mq_debugfs_ctx_attrs))
return -ENOMEM;
return 0; debugfs_create_files(ctx_dir, ctx, blk_mq_debugfs_ctx_attrs);
} }
int blk_mq_debugfs_register_hctx(struct request_queue *q, void blk_mq_debugfs_register_hctx(struct request_queue *q,
struct blk_mq_hw_ctx *hctx) struct blk_mq_hw_ctx *hctx)
{ {
struct blk_mq_ctx *ctx; struct blk_mq_ctx *ctx;
char name[20]; char name[20];
int i; int i;
if (!q->debugfs_dir)
return -ENOENT;
snprintf(name, sizeof(name), "hctx%u", hctx->queue_num); snprintf(name, sizeof(name), "hctx%u", hctx->queue_num);
hctx->debugfs_dir = debugfs_create_dir(name, q->debugfs_dir); hctx->debugfs_dir = debugfs_create_dir(name, q->debugfs_dir);
if (!hctx->debugfs_dir)
return -ENOMEM;
if (!debugfs_create_files(hctx->debugfs_dir, hctx,
blk_mq_debugfs_hctx_attrs))
goto err;
hctx_for_each_ctx(hctx, ctx, i) { debugfs_create_files(hctx->debugfs_dir, hctx, blk_mq_debugfs_hctx_attrs);
if (blk_mq_debugfs_register_ctx(hctx, ctx))
goto err;
}
return 0;
err: hctx_for_each_ctx(hctx, ctx, i)
blk_mq_debugfs_unregister_hctx(hctx); blk_mq_debugfs_register_ctx(hctx, ctx);
return -ENOMEM;
} }
void blk_mq_debugfs_unregister_hctx(struct blk_mq_hw_ctx *hctx) void blk_mq_debugfs_unregister_hctx(struct blk_mq_hw_ctx *hctx)
...@@ -949,17 +912,13 @@ void blk_mq_debugfs_unregister_hctx(struct blk_mq_hw_ctx *hctx) ...@@ -949,17 +912,13 @@ void blk_mq_debugfs_unregister_hctx(struct blk_mq_hw_ctx *hctx)
hctx->debugfs_dir = NULL; hctx->debugfs_dir = NULL;
} }
int blk_mq_debugfs_register_hctxs(struct request_queue *q) void blk_mq_debugfs_register_hctxs(struct request_queue *q)
{ {
struct blk_mq_hw_ctx *hctx; struct blk_mq_hw_ctx *hctx;
int i; int i;
queue_for_each_hw_ctx(q, hctx, i) { queue_for_each_hw_ctx(q, hctx, i)
if (blk_mq_debugfs_register_hctx(q, hctx)) blk_mq_debugfs_register_hctx(q, hctx);
return -ENOMEM;
}
return 0;
} }
void blk_mq_debugfs_unregister_hctxs(struct request_queue *q) void blk_mq_debugfs_unregister_hctxs(struct request_queue *q)
...@@ -971,29 +930,16 @@ void blk_mq_debugfs_unregister_hctxs(struct request_queue *q) ...@@ -971,29 +930,16 @@ void blk_mq_debugfs_unregister_hctxs(struct request_queue *q)
blk_mq_debugfs_unregister_hctx(hctx); blk_mq_debugfs_unregister_hctx(hctx);
} }
int blk_mq_debugfs_register_sched(struct request_queue *q) void blk_mq_debugfs_register_sched(struct request_queue *q)
{ {
struct elevator_type *e = q->elevator->type; struct elevator_type *e = q->elevator->type;
if (!q->debugfs_dir)
return -ENOENT;
if (!e->queue_debugfs_attrs) if (!e->queue_debugfs_attrs)
return 0; return;
q->sched_debugfs_dir = debugfs_create_dir("sched", q->debugfs_dir); q->sched_debugfs_dir = debugfs_create_dir("sched", q->debugfs_dir);
if (!q->sched_debugfs_dir)
return -ENOMEM;
if (!debugfs_create_files(q->sched_debugfs_dir, q,
e->queue_debugfs_attrs))
goto err;
return 0; debugfs_create_files(q->sched_debugfs_dir, q, e->queue_debugfs_attrs);
err:
blk_mq_debugfs_unregister_sched(q);
return -ENOMEM;
} }
void blk_mq_debugfs_unregister_sched(struct request_queue *q) void blk_mq_debugfs_unregister_sched(struct request_queue *q)
...@@ -1008,36 +954,22 @@ void blk_mq_debugfs_unregister_rqos(struct rq_qos *rqos) ...@@ -1008,36 +954,22 @@ void blk_mq_debugfs_unregister_rqos(struct rq_qos *rqos)
rqos->debugfs_dir = NULL; rqos->debugfs_dir = NULL;
} }
int blk_mq_debugfs_register_rqos(struct rq_qos *rqos) void blk_mq_debugfs_register_rqos(struct rq_qos *rqos)
{ {
struct request_queue *q = rqos->q; struct request_queue *q = rqos->q;
const char *dir_name = rq_qos_id_to_name(rqos->id); const char *dir_name = rq_qos_id_to_name(rqos->id);
if (!q->debugfs_dir)
return -ENOENT;
if (rqos->debugfs_dir || !rqos->ops->debugfs_attrs) if (rqos->debugfs_dir || !rqos->ops->debugfs_attrs)
return 0; return;
if (!q->rqos_debugfs_dir) { if (!q->rqos_debugfs_dir)
q->rqos_debugfs_dir = debugfs_create_dir("rqos", q->rqos_debugfs_dir = debugfs_create_dir("rqos",
q->debugfs_dir); q->debugfs_dir);
if (!q->rqos_debugfs_dir)
return -ENOMEM;
}
rqos->debugfs_dir = debugfs_create_dir(dir_name, rqos->debugfs_dir = debugfs_create_dir(dir_name,
rqos->q->rqos_debugfs_dir); rqos->q->rqos_debugfs_dir);
if (!rqos->debugfs_dir)
return -ENOMEM;
if (!debugfs_create_files(rqos->debugfs_dir, rqos, debugfs_create_files(rqos->debugfs_dir, rqos, rqos->ops->debugfs_attrs);
rqos->ops->debugfs_attrs))
goto err;
return 0;
err:
blk_mq_debugfs_unregister_rqos(rqos);
return -ENOMEM;
} }
void blk_mq_debugfs_unregister_queue_rqos(struct request_queue *q) void blk_mq_debugfs_unregister_queue_rqos(struct request_queue *q)
...@@ -1046,27 +978,18 @@ void blk_mq_debugfs_unregister_queue_rqos(struct request_queue *q) ...@@ -1046,27 +978,18 @@ void blk_mq_debugfs_unregister_queue_rqos(struct request_queue *q)
q->rqos_debugfs_dir = NULL; q->rqos_debugfs_dir = NULL;
} }
int blk_mq_debugfs_register_sched_hctx(struct request_queue *q, void blk_mq_debugfs_register_sched_hctx(struct request_queue *q,
struct blk_mq_hw_ctx *hctx) struct blk_mq_hw_ctx *hctx)
{ {
struct elevator_type *e = q->elevator->type; struct elevator_type *e = q->elevator->type;
if (!hctx->debugfs_dir)
return -ENOENT;
if (!e->hctx_debugfs_attrs) if (!e->hctx_debugfs_attrs)
return 0; return;
hctx->sched_debugfs_dir = debugfs_create_dir("sched", hctx->sched_debugfs_dir = debugfs_create_dir("sched",
hctx->debugfs_dir); hctx->debugfs_dir);
if (!hctx->sched_debugfs_dir) debugfs_create_files(hctx->sched_debugfs_dir, hctx,
return -ENOMEM; e->hctx_debugfs_attrs);
if (!debugfs_create_files(hctx->sched_debugfs_dir, hctx,
e->hctx_debugfs_attrs))
return -ENOMEM;
return 0;
} }
void blk_mq_debugfs_unregister_sched_hctx(struct blk_mq_hw_ctx *hctx) void blk_mq_debugfs_unregister_sched_hctx(struct blk_mq_hw_ctx *hctx)
......
...@@ -18,74 +18,68 @@ struct blk_mq_debugfs_attr { ...@@ -18,74 +18,68 @@ struct blk_mq_debugfs_attr {
int __blk_mq_debugfs_rq_show(struct seq_file *m, struct request *rq); int __blk_mq_debugfs_rq_show(struct seq_file *m, struct request *rq);
int blk_mq_debugfs_rq_show(struct seq_file *m, void *v); int blk_mq_debugfs_rq_show(struct seq_file *m, void *v);
int blk_mq_debugfs_register(struct request_queue *q); void blk_mq_debugfs_register(struct request_queue *q);
void blk_mq_debugfs_unregister(struct request_queue *q); void blk_mq_debugfs_unregister(struct request_queue *q);
int blk_mq_debugfs_register_hctx(struct request_queue *q, void blk_mq_debugfs_register_hctx(struct request_queue *q,
struct blk_mq_hw_ctx *hctx); struct blk_mq_hw_ctx *hctx);
void blk_mq_debugfs_unregister_hctx(struct blk_mq_hw_ctx *hctx); void blk_mq_debugfs_unregister_hctx(struct blk_mq_hw_ctx *hctx);
int blk_mq_debugfs_register_hctxs(struct request_queue *q); void blk_mq_debugfs_register_hctxs(struct request_queue *q);
void blk_mq_debugfs_unregister_hctxs(struct request_queue *q); void blk_mq_debugfs_unregister_hctxs(struct request_queue *q);
int blk_mq_debugfs_register_sched(struct request_queue *q); void blk_mq_debugfs_register_sched(struct request_queue *q);
void blk_mq_debugfs_unregister_sched(struct request_queue *q); void blk_mq_debugfs_unregister_sched(struct request_queue *q);
int blk_mq_debugfs_register_sched_hctx(struct request_queue *q, void blk_mq_debugfs_register_sched_hctx(struct request_queue *q,
struct blk_mq_hw_ctx *hctx); struct blk_mq_hw_ctx *hctx);
void blk_mq_debugfs_unregister_sched_hctx(struct blk_mq_hw_ctx *hctx); void blk_mq_debugfs_unregister_sched_hctx(struct blk_mq_hw_ctx *hctx);
int blk_mq_debugfs_register_rqos(struct rq_qos *rqos); void blk_mq_debugfs_register_rqos(struct rq_qos *rqos);
void blk_mq_debugfs_unregister_rqos(struct rq_qos *rqos); void blk_mq_debugfs_unregister_rqos(struct rq_qos *rqos);
void blk_mq_debugfs_unregister_queue_rqos(struct request_queue *q); void blk_mq_debugfs_unregister_queue_rqos(struct request_queue *q);
#else #else
static inline int blk_mq_debugfs_register(struct request_queue *q) static inline void blk_mq_debugfs_register(struct request_queue *q)
{ {
return 0;
} }
static inline void blk_mq_debugfs_unregister(struct request_queue *q) static inline void blk_mq_debugfs_unregister(struct request_queue *q)
{ {
} }
static inline int blk_mq_debugfs_register_hctx(struct request_queue *q, static inline void blk_mq_debugfs_register_hctx(struct request_queue *q,
struct blk_mq_hw_ctx *hctx) struct blk_mq_hw_ctx *hctx)
{ {
return 0;
} }
static inline void blk_mq_debugfs_unregister_hctx(struct blk_mq_hw_ctx *hctx) static inline void blk_mq_debugfs_unregister_hctx(struct blk_mq_hw_ctx *hctx)
{ {
} }
static inline int blk_mq_debugfs_register_hctxs(struct request_queue *q) static inline void blk_mq_debugfs_register_hctxs(struct request_queue *q)
{ {
return 0;
} }
static inline void blk_mq_debugfs_unregister_hctxs(struct request_queue *q) static inline void blk_mq_debugfs_unregister_hctxs(struct request_queue *q)
{ {
} }
static inline int blk_mq_debugfs_register_sched(struct request_queue *q) static inline void blk_mq_debugfs_register_sched(struct request_queue *q)
{ {
return 0;
} }
static inline void blk_mq_debugfs_unregister_sched(struct request_queue *q) static inline void blk_mq_debugfs_unregister_sched(struct request_queue *q)
{ {
} }
static inline int blk_mq_debugfs_register_sched_hctx(struct request_queue *q, static inline void blk_mq_debugfs_register_sched_hctx(struct request_queue *q,
struct blk_mq_hw_ctx *hctx) struct blk_mq_hw_ctx *hctx)
{ {
return 0;
} }
static inline void blk_mq_debugfs_unregister_sched_hctx(struct blk_mq_hw_ctx *hctx) static inline void blk_mq_debugfs_unregister_sched_hctx(struct blk_mq_hw_ctx *hctx)
{ {
} }
static inline int blk_mq_debugfs_register_rqos(struct rq_qos *rqos) static inline void blk_mq_debugfs_register_rqos(struct rq_qos *rqos)
{ {
return 0;
} }
static inline void blk_mq_debugfs_unregister_rqos(struct rq_qos *rqos) static inline void blk_mq_debugfs_unregister_rqos(struct rq_qos *rqos)
......
...@@ -555,7 +555,6 @@ void blk_mq_sched_free_requests(struct request_queue *q) ...@@ -555,7 +555,6 @@ void blk_mq_sched_free_requests(struct request_queue *q)
int i; int i;
lockdep_assert_held(&q->sysfs_lock); lockdep_assert_held(&q->sysfs_lock);
WARN_ON(!q->elevator);
queue_for_each_hw_ctx(q, hctx, i) { queue_for_each_hw_ctx(q, hctx, i) {
if (hctx->sched_tags) if (hctx->sched_tags)
......
...@@ -4460,9 +4460,12 @@ static const struct ata_blacklist_entry ata_device_blacklist [] = { ...@@ -4460,9 +4460,12 @@ static const struct ata_blacklist_entry ata_device_blacklist [] = {
{ "ST3320[68]13AS", "SD1[5-9]", ATA_HORKAGE_NONCQ | { "ST3320[68]13AS", "SD1[5-9]", ATA_HORKAGE_NONCQ |
ATA_HORKAGE_FIRMWARE_WARN }, ATA_HORKAGE_FIRMWARE_WARN },
/* drives which fail FPDMA_AA activation (some may freeze afterwards) */ /* drives which fail FPDMA_AA activation (some may freeze afterwards)
{ "ST1000LM024 HN-M101MBB", "2AR10001", ATA_HORKAGE_BROKEN_FPDMA_AA }, the ST disks also have LPM issues */
{ "ST1000LM024 HN-M101MBB", "2BA30001", ATA_HORKAGE_BROKEN_FPDMA_AA }, { "ST1000LM024 HN-M101MBB", "2AR10001", ATA_HORKAGE_BROKEN_FPDMA_AA |
ATA_HORKAGE_NOLPM, },
{ "ST1000LM024 HN-M101MBB", "2BA30001", ATA_HORKAGE_BROKEN_FPDMA_AA |
ATA_HORKAGE_NOLPM, },
{ "VB0250EAVER", "HPG7", ATA_HORKAGE_BROKEN_FPDMA_AA }, { "VB0250EAVER", "HPG7", ATA_HORKAGE_BROKEN_FPDMA_AA },
/* Blacklist entries taken from Silicon Image 3124/3132 /* Blacklist entries taken from Silicon Image 3124/3132
......
...@@ -74,10 +74,6 @@ int null_zone_report(struct gendisk *disk, sector_t sector, ...@@ -74,10 +74,6 @@ int null_zone_report(struct gendisk *disk, sector_t sector,
struct nullb_device *dev = nullb->dev; struct nullb_device *dev = nullb->dev;
unsigned int zno, nrz = 0; unsigned int zno, nrz = 0;
if (!dev->zoned)
/* Not a zoned null device */
return -EOPNOTSUPP;
zno = null_zone_no(dev, sector); zno = null_zone_no(dev, sector);
if (zno < dev->nr_zones) { if (zno < dev->nr_zones) {
nrz = min_t(unsigned int, *nr_zones, dev->nr_zones - zno); nrz = min_t(unsigned int, *nr_zones, dev->nr_zones - zno);
......
...@@ -767,7 +767,7 @@ static int ps3vram_probe(struct ps3_system_bus_device *dev) ...@@ -767,7 +767,7 @@ static int ps3vram_probe(struct ps3_system_bus_device *dev)
strlcpy(gendisk->disk_name, DEVICE_NAME, sizeof(gendisk->disk_name)); strlcpy(gendisk->disk_name, DEVICE_NAME, sizeof(gendisk->disk_name));
set_capacity(gendisk, priv->size >> 9); set_capacity(gendisk, priv->size >> 9);
dev_info(&dev->core, "%s: Using %lu MiB of GPU memory\n", dev_info(&dev->core, "%s: Using %llu MiB of GPU memory\n",
gendisk->disk_name, get_capacity(gendisk) >> 11); gendisk->disk_name, get_capacity(gendisk) >> 11);
device_add_disk(&dev->core, gendisk, NULL); device_add_disk(&dev->core, gendisk, NULL);
......
...@@ -887,12 +887,22 @@ unsigned int bch_btree_insert_key(struct btree_keys *b, struct bkey *k, ...@@ -887,12 +887,22 @@ unsigned int bch_btree_insert_key(struct btree_keys *b, struct bkey *k,
struct bset *i = bset_tree_last(b)->data; struct bset *i = bset_tree_last(b)->data;
struct bkey *m, *prev = NULL; struct bkey *m, *prev = NULL;
struct btree_iter iter; struct btree_iter iter;
struct bkey preceding_key_on_stack = ZERO_KEY;
struct bkey *preceding_key_p = &preceding_key_on_stack;
BUG_ON(b->ops->is_extents && !KEY_SIZE(k)); BUG_ON(b->ops->is_extents && !KEY_SIZE(k));
m = bch_btree_iter_init(b, &iter, b->ops->is_extents /*
? PRECEDING_KEY(&START_KEY(k)) * If k has preceding key, preceding_key_p will be set to address
: PRECEDING_KEY(k)); * of k's preceding key; otherwise preceding_key_p will be set
* to NULL inside preceding_key().
*/
if (b->ops->is_extents)
preceding_key(&START_KEY(k), &preceding_key_p);
else
preceding_key(k, &preceding_key_p);
m = bch_btree_iter_init(b, &iter, preceding_key_p);
if (b->ops->insert_fixup(b, k, &iter, replace_key)) if (b->ops->insert_fixup(b, k, &iter, replace_key))
return status; return status;
......
...@@ -434,20 +434,26 @@ static inline bool bch_cut_back(const struct bkey *where, struct bkey *k) ...@@ -434,20 +434,26 @@ static inline bool bch_cut_back(const struct bkey *where, struct bkey *k)
return __bch_cut_back(where, k); return __bch_cut_back(where, k);
} }
#define PRECEDING_KEY(_k) \ /*
({ \ * Pointer '*preceding_key_p' points to a memory object to store preceding
struct bkey *_ret = NULL; \ * key of k. If the preceding key does not exist, set '*preceding_key_p' to
\ * NULL. So the caller of preceding_key() needs to take care of memory
if (KEY_INODE(_k) || KEY_OFFSET(_k)) { \ * which '*preceding_key_p' pointed to before calling preceding_key().
_ret = &KEY(KEY_INODE(_k), KEY_OFFSET(_k), 0); \ * Currently the only caller of preceding_key() is bch_btree_insert_key(),
\ * and it points to an on-stack variable, so the memory release is handled
if (!_ret->low) \ * by stackframe itself.
_ret->high--; \ */
_ret->low--; \ static inline void preceding_key(struct bkey *k, struct bkey **preceding_key_p)
} \ {
\ if (KEY_INODE(k) || KEY_OFFSET(k)) {
_ret; \ (**preceding_key_p) = KEY(KEY_INODE(k), KEY_OFFSET(k), 0);
}) if (!(*preceding_key_p)->low)
(*preceding_key_p)->high--;
(*preceding_key_p)->low--;
} else {
(*preceding_key_p) = NULL;
}
}
static inline bool bch_ptr_invalid(struct btree_keys *b, const struct bkey *k) static inline bool bch_ptr_invalid(struct btree_keys *b, const struct bkey *k)
{ {
......
...@@ -431,8 +431,13 @@ STORE(bch_cached_dev) ...@@ -431,8 +431,13 @@ STORE(bch_cached_dev)
bch_writeback_queue(dc); bch_writeback_queue(dc);
} }
/*
* Only set BCACHE_DEV_WB_RUNNING when cached device attached to
* a cache set, otherwise it doesn't make sense.
*/
if (attr == &sysfs_writeback_percent) if (attr == &sysfs_writeback_percent)
if (!test_and_set_bit(BCACHE_DEV_WB_RUNNING, &dc->disk.flags)) if ((dc->disk.c != NULL) &&
(!test_and_set_bit(BCACHE_DEV_WB_RUNNING, &dc->disk.flags)))
schedule_delayed_work(&dc->writeback_rate_update, schedule_delayed_work(&dc->writeback_rate_update,
dc->writeback_rate_update_seconds * HZ); dc->writeback_rate_update_seconds * HZ);
......
...@@ -2777,8 +2777,10 @@ static void io_ring_ctx_free(struct io_ring_ctx *ctx) ...@@ -2777,8 +2777,10 @@ static void io_ring_ctx_free(struct io_ring_ctx *ctx)
io_eventfd_unregister(ctx); io_eventfd_unregister(ctx);
#if defined(CONFIG_UNIX) #if defined(CONFIG_UNIX)
if (ctx->ring_sock) if (ctx->ring_sock) {
ctx->ring_sock->file = NULL; /* so that iput() is called */
sock_release(ctx->ring_sock); sock_release(ctx->ring_sock);
}
#endif #endif
io_mem_free(ctx->sq_ring); io_mem_free(ctx->sq_ring);
......
Markdown is supported
0%
or
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment