Commits · 3da127fa887e5187ede702b835770634d705f8b2 · nexedi / linux

10 Mar, 2011 40 commits

drbd: increase module count on /proc/drbd access · 3da127fa

Lars Ellenberg authored Nov 24, 2010

If someone holds /proc/drbd open, previously rmmod would
"succeed" in starting the unload, but then block on remove_proc_entry,
leading to a situation where the lsmod does not show drbd anymore,
but /proc/drbd being still there (but no longer accessible).

I'd rather have rmmod fail up front in this case.
Signed-off-by: Philipp Reisner <philipp.reisner@linbit.com>
Signed-off-by: Lars Ellenberg <lars.ellenberg@linbit.com>

3da127fa

drbd: Removed 20 seconds upper bound for side-stepping · c507f46f

Philipp Reisner authored Nov 22, 2010

Given low-enough network bandwidth combined with a IO
pattern that hammers onto a single RS-extent, side-stepping
might be necessary for much longer times.

Changed the code to print a single informal message after
20 seconds, but it keeps on stepping aside forever.
Signed-off-by: Philipp Reisner <philipp.reisner@linbit.com>
Signed-off-by: Lars Ellenberg <lars.ellenberg@linbit.com>

c507f46f

drbd: Becoming sync target may not happen out of < C_WF_REPORT_PARAMS · 1fc80cf3

Philipp Reisner authored Nov 22, 2010

This patch is acutally a necessary addendum to the patch
"fix for spurious full sync (becoming sync target looked like invalidate)"
Signed-off-by: Philipp Reisner <philipp.reisner@linbit.com>
Signed-off-by: Lars Ellenberg <lars.ellenberg@linbit.com>

1fc80cf3

drbd: Starting with protocol 96 we can allow app-IO while receiving the bitmap · 3719094e

Philipp Reisner authored Nov 10, 2010

* C_STARTING_SYNC_S, C_STARTING_SYNC_T In these states the bitmap gets
  written to disk. Locking out of app-IO is done by using the
  drbd_queue_bitmap_io() and drbd_bitmap_io() functions these days.
  It is no longer necessary to lock out app-IO based on the connection
  state.
  App-IO that may come in after the BITMAP_IO flag got cleared before the
  state transition to C_SYNC_(SOURCE|TARGET) does not get mirrored, sets
  a bit in the local bitmap, that is already set, therefore changes nothing.

* C_WF_BITMAP_S In this state we send updates (P_OUT_OF_SYNC packets).
  With that we make sure they have the same number of bits when going
  into the C_SYNC_(SOURCE|TARGET) connection state.

* C_UNCONNECTED: The receiver starts, no need to lock out IO.

* C_DISCONNECTING: in drbd_disconnect() we had a wait_event()
  to wait until ap_bio_cnt reaches 0. Removed that.

* C_TIMEOUT, C_BROKEN_PIPE, C_NETWORK_FAILURE
  C_PROTOCOL_ERROR, C_TEAR_DOWN: Same as C_DISCONNECTING

* C_WF_REPORT_PARAMS: IO still possible since that is still
  like C_WF_CONNECTION.

And we do not need to send barriers in C_WF_BITMAP_S connection state.

Allow concurrent accesses to the bitmap when receiving the bitmap.
Everything gets ORed anyways.

A drbd_free_tl_hash() is in after_state_chg_work(). At that point
all the work items of the last connections must have been processed.

Introduced a call to drbd_free_tl_hash() into drbd_free_mdev()
for paranoia reasons.
Signed-off-by: Philipp Reisner <philipp.reisner@linbit.com>
Signed-off-by: Lars Ellenberg <lars.ellenberg@linbit.com>

3719094e

drbd: Improvements in sanitize_state() · ab17b68f

Philipp Reisner authored Nov 17, 2010

The relevant change is that the state change to C_FW_BITMAP_S should
implicitly change pdsk to C_CONSISTENT. (Think of it as C_OUTDATED, only
without the guarantee that the peer has the outdated written to its
meta data)

At that opportunity I restructured the switch statement so that it
gets evaluated every time. (Has declarative character)
Signed-off-by: Philipp Reisner <philipp.reisner@linbit.com>
Signed-off-by: Lars Ellenberg <lars.ellenberg@linbit.com>

ab17b68f

drbd: Fixed race condition in drbd_queue_bitmap_io · 22afd7ee

Philipp Reisner authored Nov 16, 2010

May only test for ap_bio_cnt == 0 under req_lock. It can increase
only under req_lock.
Signed-off-by: Philipp Reisner <philipp.reisner@linbit.com>
Signed-off-by: Lars Ellenberg <lars.ellenberg@linbit.com>

22afd7ee

drbd: Fixed inc_ap_bio() · 8869d683

Philipp Reisner authored Nov 17, 2010

The condition must be checked after perpare_to_wait(). The old
implementaion could loose wakeup events. Never observed in real
life.
Signed-off-by: Philipp Reisner <philipp.reisner@linbit.com>
Signed-off-by: Lars Ellenberg <lars.ellenberg@linbit.com>

8869d683

drbd: use test_and_set_bit() to decide if bm_io_work should be queued · 127b3178
Philipp Reisner authored Nov 16, 2010
```
Signed-off-by: Philipp Reisner <philipp.reisner@linbit.com>
Signed-off-by: Lars Ellenberg <lars.ellenberg@linbit.com>
```
127b3178

drbd: Begin to account BIO processing time before inc_ap_bio() · aeda1cd6

Philipp Reisner authored Nov 09, 2010

Since inc_ap_bio() might sleep already
Signed-off-by: Philipp Reisner <philipp.reisner@linbit.com>
Signed-off-by: Lars Ellenberg <lars.ellenberg@linbit.com>

aeda1cd6

drbd: Implemented side-stepping in drbd_res_begin_io() · f91ab628

Philipp Reisner authored Nov 09, 2010

Before:
  drbd_rs_begin_io() locked app-IO out of an RS extent, and
  waited then until all previous app-IO in that area finished.
  (But not only until the disk-IO was finished but until the
   barrier/epoch ack came in for that == round trip time latency ++)

After:
  As soon as a new app-IO waits wants to start new IO on that
  RS extent, drbd_rs_begin_io() steps aside (clearing the
  BME_NO_WRITES flag again). It retries after 100ms.
Signed-off-by: Philipp Reisner <philipp.reisner@linbit.com>
Signed-off-by: Lars Ellenberg <lars.ellenberg@linbit.com>

f91ab628

drbd: Make some functions static · 9d77a5fe

Philipp Reisner authored Nov 07, 2010

Signed-off-by: Philipp Reisner <philipp.reisner@linbit.com>
Signed-off-by: Lars Ellenberg <lars.ellenberg@linbit.com>

9d77a5fe

drbd: Implemented priority inheritance for resync requests · e3555d85

Philipp Reisner authored Nov 07, 2010

We only issue resync requests if there is no significant application IO
going on. = Application IO has higher priority than resnyc IO.

If application IO can not be started because the resync process locked
an resync_lru entry, start the IO operations necessary to release the
lock ASAP.
Signed-off-by: Philipp Reisner <philipp.reisner@linbit.com>
Signed-off-by: Lars Ellenberg <lars.ellenberg@linbit.com>

e3555d85

drbd: Do not cleanup resync LRU for the Ahead/Behind SyncSource/SyncTarget transitions · 59817f4f

Philipp Reisner authored Oct 29, 2010

This one should be replaced with moving this cleanup to the
'right' position.
Signed-off-by: Philipp Reisner <philipp.reisner@linbit.com>
Signed-off-by: Lars Ellenberg <lars.ellenberg@linbit.com>

59817f4f

drbd: When proxy's buffer drained off go into regular resync mode · c4752ef1

Philipp Reisner authored Oct 27, 2010

Signed-off-by: Philipp Reisner <philipp.reisner@linbit.com>
Signed-off-by: Lars Ellenberg <lars.ellenberg@linbit.com>

c4752ef1

drbd: New packet for Ahead/Behind mode: P_OUT_OF_SYNC · 73a01a18

Philipp Reisner authored Oct 27, 2010

Signed-off-by: Philipp Reisner <philipp.reisner@linbit.com>
Signed-off-by: Lars Ellenberg <lars.ellenberg@linbit.com>

73a01a18

drbd: Implemented two new connection states Ahead/Behind · 67531718

Philipp Reisner authored Oct 27, 2010

In this connection mode, the ahead node no longer replicates
application IO. The behind's disk becomes out dated.
Signed-off-by: Philipp Reisner <philipp.reisner@linbit.com>
Signed-off-by: Lars Ellenberg <lars.ellenberg@linbit.com>

67531718

drbd: New configuration parameters for dealing with network congestion · 422028b1

Philipp Reisner authored Oct 27, 2010

net {
    on_congestion {block|pull-ahead|disconnect};
    congestion-fill {sectors};
    congestion-extents {al-extents};
}
Signed-off-by: Philipp Reisner <philipp.reisner@linbit.com>
Signed-off-by: Lars Ellenberg <lars.ellenberg@linbit.com>

422028b1

drbd: Track the numbers of sectors in flight · 759fbdfb

Philipp Reisner authored Oct 26, 2010

Signed-off-by: Philipp Reisner <philipp.reisner@linbit.com>
Signed-off-by: Lars Ellenberg <lars.ellenberg@linbit.com>

759fbdfb

drbd: Renamed write_flags_to_bio() to wire_flags_to_bio() · 688593c5

Lars Ellenberg authored Nov 17, 2010

Signed-off-by: Philipp Reisner <philipp.reisner@linbit.com>
Signed-off-by: Lars Ellenberg <lars.ellenberg@linbit.com>

688593c5

drbd: restore compatibility with 32bit kernels · 4896e8c1

Lars Ellenberg authored Nov 11, 2010

With commit
drbd: further converge progress display of resync and online-verify
accidentally an u64/u64 div was introduced, causing an unresolvable
symbol __udivdi3 to be reference. Actually for that division, 32bit are
still suficient for now, so we can revert to unsigned long instead.
Signed-off-by: Philipp Reisner <philipp.reisner@linbit.com>
Signed-off-by: Lars Ellenberg <lars.ellenberg@linbit.com>

4896e8c1

drbd: properly use max_hw_sectors to limit the our bio size · 1816a2b4

Lars Ellenberg authored Nov 11, 2010

To ease tracking of bios in some hash tables, we want it to
not cross certain boundaries (128k, used to be 32k).
We limit the maximum bio size using queue parameters.

Historically some defines and variables we use there have been named
max_segment_size, which was misguided. Rename them to max_bio_size,
and use [blk_]queue_max_hw_sectors where appropriate.
Signed-off-by: Philipp Reisner <philipp.reisner@linbit.com>
Signed-off-by: Lars Ellenberg <lars.ellenberg@linbit.com>

1816a2b4

drbd: debug: limit nelink-broadcast of request on digest mismatch to 32k · 3129b1b9

Lars Ellenberg authored Nov 11, 2010

We used to be limited to 32k requests,
but have increased that limit to 128k now.

This part of the code can only deal with 32k,
it would scramble arbitrary pages for larger requests.

As it is used for debugging only anyways,
it is ok to simply truncate the dumped data here.
Signed-off-by: Philipp Reisner <philipp.reisner@linbit.com>
Signed-off-by: Lars Ellenberg <lars.ellenberg@linbit.com>

3129b1b9

drbd: detect modification of in-flight buffers · 470be44a

Lars Ellenberg authored Nov 10, 2010

With data-integrity digest enabled, double-check on the sending side
for modifications by upper layers of buffers under write back,
so we can tell it appart from corruption on the "wire".
Signed-off-by: Philipp Reisner <philipp.reisner@linbit.com>
Signed-off-by: Lars Ellenberg <lars.ellenberg@linbit.com>

470be44a

drbd: further converge progress display of resync and online-verify · 5f9915bb

Lars Ellenberg authored Nov 09, 2010

Show progressbar and ETA always, with proc_details >= 1 also show the
current sector position for both resync and online-verify on both nodes.
Signed-off-by: Philipp Reisner <philipp.reisner@linbit.com>
Signed-off-by: Lars Ellenberg <lars.ellenberg@linbit.com>

5f9915bb

drbd: fix potential wrap of 32bit oos:%lu display in /proc/drbd · 18edc0b9

Lars Ellenberg authored Nov 09, 2010

When converting bits (4k resolution, still) to kB, we shift left. If it
was a large number of bits on a 32bit box (>= 4 TiB storage), we may
wrap the 32bit unsigned long base type, resulting in incorrect display.
Signed-off-by: Philipp Reisner <philipp.reisner@linbit.com>
Signed-off-by: Lars Ellenberg <lars.ellenberg@linbit.com>

18edc0b9

drbd: use the resync controller for online-verify requests as well · 2649f080

Lars Ellenberg authored Nov 05, 2010

Signed-off-by: Philipp Reisner <philipp.reisner@linbit.com>
Signed-off-by: Lars Ellenberg <lars.ellenberg@linbit.com>

2649f080

drbd: factor out drbd_rs_number_requests · e65f440d

Lars Ellenberg authored Nov 05, 2010

Preparation patch to be able to use the auto-throttling resync controller
for online-verify requests as well.
Signed-off-by: Philipp Reisner <philipp.reisner@linbit.com>
Signed-off-by: Lars Ellenberg <lars.ellenberg@linbit.com>

e65f440d

drbd: factor out drbd_rs_controller_reset · 9bd28d3c

Lars Ellenberg authored Nov 05, 2010

Preparation patch to be able to use the auto-throttling resync controller
for online-verify requests as well.
Signed-off-by: Philipp Reisner <philipp.reisner@linbit.com>
Signed-off-by: Lars Ellenberg <lars.ellenberg@linbit.com>

9bd28d3c

drbd: show progress bar and ETA for online-verify · 439d5953

Lars Ellenberg authored Nov 05, 2010

Signed-off-by: Philipp Reisner <philipp.reisner@linbit.com>
Signed-off-by: Lars Ellenberg <lars.ellenberg@linbit.com>

439d5953

drbd: advance progress step marks for online-verify · ea5442af

Lars Ellenberg authored Nov 05, 2010

Signed-off-by: Philipp Reisner <philipp.reisner@linbit.com>
Signed-off-by: Lars Ellenberg <lars.ellenberg@linbit.com>

ea5442af

drbd: factor out advancement of resync marks for progress reporting · c6ea14df

Lars Ellenberg authored Nov 05, 2010

This is in preparation to unify progress reporting of
online-verify and resync requests.
Signed-off-by: Philipp Reisner <philipp.reisner@linbit.com>
Signed-off-by: Lars Ellenberg <lars.ellenberg@linbit.com>

c6ea14df

drbd: initialize online-verify progress tracking on verify target · de228bba

Lars Ellenberg authored Nov 05, 2010

For partial (resumed) online verify, initialize the resync step marks
once we know what the online verify start sector is.
Signed-off-by: Philipp Reisner <philipp.reisner@linbit.com>
Signed-off-by: Lars Ellenberg <lars.ellenberg@linbit.com>

de228bba

drbd: improve online-verify progress tracking · 30b743a2

Lars Ellenberg authored Nov 05, 2010

For a partial (resumed) online-verify, initialize rs_total not to total
bits, but to number of bits to check in this run, to match the meaning
rs_total has for actual resync.
Signed-off-by: Philipp Reisner <philipp.reisner@linbit.com>
Signed-off-by: Lars Ellenberg <lars.ellenberg@linbit.com>

30b743a2

drbd: only reset online-verify start sector if verify completed · 26525618

Lars Ellenberg authored Nov 05, 2010

For network hickups during online-verify, on the next verify
triggered, we by default want to resume where it left off.

After any replication link interruption, there will be a (possibly
empty) resync.  Do not reset online-verify start sector if some resync
completed, that would defeats the purpose.

Only reset the start sector once a verify run is completed.
Signed-off-by: Philipp Reisner <philipp.reisner@linbit.com>
Signed-off-by: Lars Ellenberg <lars.ellenberg@linbit.com>

26525618

Merge branch 'for-2.6.39/stack-plug' into for-2.6.39/core · 4c63f564

Jens Axboe authored Mar 10, 2011

Conflicts:
	block/blk-core.c
	block/blk-flush.c
	drivers/md/raid1.c
	drivers/md/raid10.c
	drivers/md/raid5.c
	fs/nilfs2/btnode.c
	fs/nilfs2/mdt.c
Signed-off-by: Jens Axboe <jaxboe@fusionio.com>

4c63f564

blk-throttle: Use blk_plug in throttle dispatch · 69d60eb9

Vivek Goyal authored Mar 09, 2011

Use plug in throttle dispatch also as we are dispatching a bunch of
bios in throttle context and some of them might merge.
Signed-off-by: Vivek Goyal <vgoyal@redhat.com>
Signed-off-by: Jens Axboe <jaxboe@fusionio.com>

69d60eb9

block: kill off REQ_UNPLUG · 721a9602

Jens Axboe authored Mar 09, 2011

With the plugging now being explicitly controlled by the
submitter, callers need not pass down unplugging hints
to the block layer. If they want to unplug, it's because they
manually plugged on their own - in which case, they should just
unplug at will.
Signed-off-by: Jens Axboe <jaxboe@fusionio.com>

721a9602

aio: remove request submission batching · cf15900e

Jens Axboe authored Mar 02, 2011

This should be useless now that we have on-stack plugging. So lets just
kill it.
Signed-off-by: Jens Axboe <jaxboe@fusionio.com>

cf15900e

fs: make aio plug · 9f5b9425

Shaohua Li authored Jul 01, 2010

Signed-off-by: Shaohua Li <shaohua.li@intel.com>
Signed-off-by: Jens Axboe <jaxboe@fusionio.com>

9f5b9425

fs: make mpage read/write_pages() plug · 2ed1a6bc
Jens Axboe authored Jun 22, 2010
```
Signed-off-by: Jens Axboe <jaxboe@fusionio.com>
```
2ed1a6bc