Commits · 3c5e5f6afd242ea5944197a9b54033c1461b793c · Kirill Smelkov / linux

03 Nov, 2012 9 commits

drbd: add forgotten spin_unlock · 3c5e5f6a

Lars Ellenberg authored Mar 15, 2011

somehow a "goto abort" was introduced with commit
  drbd: Extracted is_valid_transition() out of sanitize_state()
which left drbd_req_state still holding the spin lock.
Signed-off-by: Philipp Reisner <philipp.reisner@linbit.com>
Signed-off-by: Lars Ellenberg <lars.ellenberg@linbit.com>

3c5e5f6a

drbd: bail out if a config requrest is over-determined, and not matching · 527f4b24

Lars Ellenberg authored Mar 14, 2011

We have resources resp. connections, volumes, and minor numbers.
A config request may specifies all three of them.
If it turns out that the minor belongs to a different connection, or a
different volume number in the same connection, that configuration
request is invalid.
Signed-off-by: Philipp Reisner <philipp.reisner@linbit.com>
Signed-off-by: Lars Ellenberg <lars.ellenberg@linbit.com>

527f4b24

drbd: new-connection and new-minor succeed, if the object already exists · 38f19616

Lars Ellenberg authored Mar 14, 2011

Follow O_CREAT semantics when creating connection or minor device/volume
objects. If we need O_CREAT|O_EXCL semantics some time down the road,
we can add NLM_F_EXCL to the netlink message flags.
Signed-off-by: Philipp Reisner <philipp.reisner@linbit.com>
Signed-off-by: Lars Ellenberg <lars.ellenberg@linbit.com>

38f19616

drbd: Allow a Diskless Secondary volume to be removed · cffec5b2

Lars Ellenberg authored Mar 10, 2011

Even if the connection is still established.
We should be able to reduce a volume from a replication group,
without taking the whole group offline.
Signed-off-by: Philipp Reisner <philipp.reisner@linbit.com>
Signed-off-by: Lars Ellenberg <lars.ellenberg@linbit.com>

cffec5b2

drbd: simplify conn_all_vols_unconf, make it bool · d0456c72

Lars Ellenberg authored Mar 10, 2011

Get rid of a temporary variable and, funny bitand assignment.
Just short circuit, returning false, once we encounter the first
still configured volume.

FIXME verify call sites for need of rcu_read_lock or stronger.
Signed-off-by: Philipp Reisner <philipp.reisner@linbit.com>
Signed-off-by: Lars Ellenberg <lars.ellenberg@linbit.com>

d0456c72

drbd: drbd_adm_get_status needs to show some more detail · 543cc10b

Lars Ellenberg authored Mar 10, 2011

We want to see existing connection objects, even if they do not
currently have volumes attached.

Change the .dumpit variant of drbd_adm_get_status to iterate not over
minor devices, but over connections + volumes.
Signed-off-by: Philipp Reisner <philipp.reisner@linbit.com>
Signed-off-by: Lars Ellenberg <lars.ellenberg@linbit.com>

543cc10b

drbd: remove now unused connector related files · 73d901b7

Lars Ellenberg authored Mar 07, 2011

Signed-off-by: Philipp Reisner <philipp.reisner@linbit.com>
Signed-off-by: Lars Ellenberg <lars.ellenberg@linbit.com>

73d901b7

drbd: allow holes in minor and volume id allocation · 8432b314

Lars Ellenberg authored Mar 08, 2011

s/idr_get_new/idr_get_new_above/
Signed-off-by: Philipp Reisner <philipp.reisner@linbit.com>
Signed-off-by: Lars Ellenberg <lars.ellenberg@linbit.com>

8432b314

drbd: switch configuration interface from connector to genetlink · 3b98c0c2

Lars Ellenberg authored Mar 07, 2011

Signed-off-by: Philipp Reisner <philipp.reisner@linbit.com>
Signed-off-by: Lars Ellenberg <lars.ellenberg@linbit.com>

3b98c0c2

14 Oct, 2011 31 commits

drbd: prepare the transition from connector to genetlink · ec2c35ac

Lars Ellenberg authored Mar 07, 2011

This adds the new API header and helper files.
Signed-off-by: Philipp Reisner <philipp.reisner@linbit.com>
Signed-off-by: Lars Ellenberg <lars.ellenberg@linbit.com>

ec2c35ac

drbd: get rid of drbd_bcast_ee, it is of no use anymore · 3cb7a2a9

Lars Ellenberg authored Mar 07, 2011

This function was used to broadcast the (leading part of the)
bio payload in case we see a data integrity error.  It could be received
from userland with the drbdsetup events subcommand,
to have a peek into the payload that caused the checksum mismatch,
and guess from there what may have caused the mismatch,
mainly to guess wether it was modification of in-flight data,
or data corruption by broken hardware or software bugs.

Meanwhile we support bios that are larger than the maximum payload a
netlink datagram can carry.
And we have means to reliably detect modification of in-flight data by
calculating, and comparing, the checksum before and after sendmsg.
There is no need to carry this around anymore.
Signed-off-by: Philipp Reisner <philipp.reisner@linbit.com>
Signed-off-by: Lars Ellenberg <lars.ellenberg@linbit.com>

3cb7a2a9

drbd: fix drbd_delete_device: remove vnr from volumes; idr_remove();... · 569083c0

Lars Ellenberg authored Mar 07, 2011

drbd: fix drbd_delete_device: remove vnr from volumes; idr_remove(); synchronize_rcu(); before cleanup

Still missing: rcu_readlock() on the various call sites that
access/iterate over those idrs.

We don't need a specific write lock, as we only modify from
configuration context, which is already strictly serialized.
Signed-off-by: Philipp Reisner <philipp.reisner@linbit.com>
Signed-off-by: Lars Ellenberg <lars.ellenberg@linbit.com>

569083c0

drbd: introduce a bio_set to allocate housekeeping bios from · da4a75d2

Lars Ellenberg authored Feb 23, 2011

Don't rely on availability of bios from the global fs_bio_set,
we should use our own bio_set for meta data IO.
Signed-off-by: Philipp Reisner <philipp.reisner@linbit.com>
Signed-off-by: Lars Ellenberg <lars.ellenberg@linbit.com>

da4a75d2

drbd: use the newly introduced page pool for bitmap IO · 9db4e77f

Lars Ellenberg authored Feb 23, 2011

Signed-off-by: Philipp Reisner <philipp.reisner@linbit.com>
Signed-off-by: Lars Ellenberg <lars.ellenberg@linbit.com>

9db4e77f

drbd: add page pool to be used for meta data IO · 35abf594

Lars Ellenberg authored Feb 23, 2011

Signed-off-by: Philipp Reisner <philipp.reisner@linbit.com>
Signed-off-by: Lars Ellenberg <lars.ellenberg@linbit.com>

35abf594

drbd: only wakeup if something changed in update_peer_seq · 3c13b680

Lars Ellenberg authored Feb 23, 2011

This commit got it wrong:
    drbd: Make the peer_seq updating code more obvious

    Make it more clear that update_peer_seq() is supposed to wake up the
    seq_wait queue whenever the sequence number changes.

We don't need to wake up everytime we receive a sequence number
that is _different_ from our currently stored "newest" sequence number,
but only if we receive a sequence number _newer_ than what we already
have, when we actually change mdev->peer_seq.
Signed-off-by: Philipp Reisner <philipp.reisner@linbit.com>
Signed-off-by: Lars Ellenberg <lars.ellenberg@linbit.com>

3c13b680

drbd: default to detach on-io-error · a5df0e19

Lars Ellenberg authored Feb 23, 2011

Old default behaviour was "pass-on",
which is not useful in production at all.
Signed-off-by: Philipp Reisner <philipp.reisner@linbit.com>
Signed-off-by: Lars Ellenberg <lars.ellenberg@linbit.com>

a5df0e19

drbd: remove unused define · 2c4a48d0

Lars Ellenberg authored Feb 23, 2011

Signed-off-by: Philipp Reisner <philipp.reisner@linbit.com>
Signed-off-by: Lars Ellenberg <lars.ellenberg@linbit.com>

2c4a48d0

drbd: Replaced the minor_table array by an idr · 81a5d60e

Philipp Reisner authored Feb 22, 2011

Signed-off-by: Philipp Reisner <philipp.reisner@linbit.com>
Signed-off-by: Lars Ellenberg <lars.ellenberg@linbit.com>

81a5d60e

drbd: Implemented new commands to create/delete connections/minors · 774b3055

Philipp Reisner authored Feb 22, 2011

Signed-off-by: Philipp Reisner <philipp.reisner@linbit.com>
Signed-off-by: Lars Ellenberg <lars.ellenberg@linbit.com>

774b3055

drbd: Converted drbd_nl_(net_conf|disconnect)() from mdev to tconn · 80883197

Philipp Reisner authored Feb 18, 2011

Signed-off-by: Philipp Reisner <philipp.reisner@linbit.com>
Signed-off-by: Lars Ellenberg <lars.ellenberg@linbit.com>

80883197

drbd: Preparing the connector interface to operator on connections · 1aba4d7f

Philipp Reisner authored Feb 21, 2011

Up to now it only operated on minor numbers. Now it can work also
on named connections.
Signed-off-by: Philipp Reisner <philipp.reisner@linbit.com>
Signed-off-by: Lars Ellenberg <lars.ellenberg@linbit.com>

1aba4d7f

drbd: Converted the transfer log from mdev to tconn · 2f5cdd0b

Philipp Reisner authored Feb 21, 2011

Signed-off-by: Philipp Reisner <philipp.reisner@linbit.com>
Signed-off-by: Lars Ellenberg <lars.ellenberg@linbit.com>

2f5cdd0b

drbd: Improved the dec_*() macros · 49559d87

Philipp Reisner authored Feb 21, 2011

Now those can be used with a struct drbd_conf * that has an other
name than 'mdev'.
Signed-off-by: Philipp Reisner <philipp.reisner@linbit.com>
Signed-off-by: Lars Ellenberg <lars.ellenberg@linbit.com>

49559d87

drbd: Removed the mdev parameter from the ..to_tags() and ...from_tags() functions · 3f9cbe93
Philipp Reisner authored Feb 17, 2011
```
Signed-off-by: Philipp Reisner <philipp.reisner@linbit.com>
Signed-off-by: Lars Ellenberg <lars.ellenberg@linbit.com>
```
3f9cbe93

drbd: Reworked the unconfiguring and thread stopping code · 0e29d163

Philipp Reisner authored Feb 18, 2011

* Moved CONFIG_PENDING and DEVICE_DYING from mdev to tconn.
* Renamed drbd_reconfig_start() and drbd_reconfig_done() to
  conn_reconfig_start() and conn_reconfig_done().
Signed-off-by: Philipp Reisner <philipp.reisner@linbit.com>
Signed-off-by: Lars Ellenberg <lars.ellenberg@linbit.com>

0e29d163

drbd: Remove left-over function prototypes · c66342d9

Andreas Gruenbacher authored Mar 16, 2011

Signed-off-by: Philipp Reisner <philipp.reisner@linbit.com>
Signed-off-by: Lars Ellenberg <lars.ellenberg@linbit.com>

c66342d9

drbd: Replace get_asender_cmd() with its implementation · 7201b972

Andreas Gruenbacher authored Mar 14, 2011

Signed-off-by: Philipp Reisner <philipp.reisner@linbit.com>
Signed-off-by: Lars Ellenberg <lars.ellenberg@linbit.com>

7201b972

drbd: Get rid of P_MAX_CMD · 6e849ce8

Andreas Gruenbacher authored Mar 14, 2011

Instead of artificially enlarging the command decoding arrays to
P_MAX_CMD entries, check if an index is within the valid range using the
ARRAY_SIZE() macro.
Signed-off-by: Philipp Reisner <philipp.reisner@linbit.com>
Signed-off-by: Lars Ellenberg <lars.ellenberg@linbit.com>

6e849ce8

drbd: Remove redundant check · 1b3bb47d

Andreas Gruenbacher authored Jan 28, 2011

Opening a device only succeeds on a primary node, or when explicitly
setting the allow_oos module parameter to allow opening the device
read-only on a secondary node. There is no other way that a request can
get into drbd_make_request(), so this code cannot trigger.
Signed-off-by: Philipp Reisner <philipp.reisner@linbit.com>
Signed-off-by: Lars Ellenberg <lars.ellenberg@linbit.com>

1b3bb47d

drbd: Improve how conflicting writes are handled · 7be8da07

Andreas Gruenbacher authored Feb 22, 2011

The previous algorithm for dealing with overlapping concurrent writes
was generating unnecessary warnings for scenarios which could be
legitimate, and did not always handle partially overlapping requests
correctly.  Improve it algorithm as follows:

* While local or remote write requests are in progress, conflicting new
  local write requests will be delayed (commit 82172f7).

* When a conflict between a local and remote write request is detected,
  the node with the discard flag decides how to resolve the conflict: It
  will ask its peer to discard conflicting requests which are fully
  contained in the local request and retry requests which overlap only
  partially.  This involves a protocol change.
Signed-off-by: Philipp Reisner <philipp.reisner@linbit.com>
Signed-off-by: Lars Ellenberg <lars.ellenberg@linbit.com>

7be8da07

drbd: Use ping-timeout when waiting for missing ack packets · 71b1c1eb

Andreas Gruenbacher authored Mar 01, 2011

When the node with the discard flag resolves write conflicts in
dual-primary mode, it may determine that its peer has sent ack packets
on the metadata socket which did not arrive, yet. Wait for the next ack
with ping-timeout instead of a hard-coded 30 seconds.
Signed-off-by: Philipp Reisner <philipp.reisner@linbit.com>
Signed-off-by: Lars Ellenberg <lars.ellenberg@linbit.com>

71b1c1eb

drbd: Replace atomic_add_return with atomic_inc_return · 8ccf218e

Andreas Gruenbacher authored Feb 24, 2011

Signed-off-by: Philipp Reisner <philipp.reisner@linbit.com>
Signed-off-by: Lars Ellenberg <lars.ellenberg@linbit.com>

8ccf218e

drbd: Concurrent write detection fix · 206d3589

Andreas Gruenbacher authored Feb 26, 2011

Commit 9b1e63e changed the concurrent write detection algorithm to only insert
peer requests into write_requests tree after determining that there is no
conflict. With this change, new conflicting local requests could be added
while the algorithm runs, but this case was not handled correctly. Instead of
making the algorithm deal with this case, switch back to adding peer requests
to the write_requests tree immediately: this improves fairness.

When a peer request is discarded, remove that request from the write_requests
Signed-off-by: Philipp Reisner <philipp.reisner@linbit.com>
Signed-off-by: Lars Ellenberg <lars.ellenberg@linbit.com>

206d3589

drbd: Use container_of() instead of casting · 8050e6d0

Andreas Gruenbacher authored Feb 18, 2011

Signed-off-by: Philipp Reisner <philipp.reisner@linbit.com>
Signed-off-by: Lars Ellenberg <lars.ellenberg@linbit.com>

8050e6d0

drbd: fix a wrong likely(), updated comments · 9676c760

Lars Ellenberg authored Feb 22, 2011

Signed-off-by: Philipp Reisner <philipp.reisner@linbit.com>
Signed-off-by: Lars Ellenberg <lars.ellenberg@linbit.com>

9676c760

drbd: silence some log messages on bitmap IO · c9d963a4

Lars Ellenberg authored Feb 21, 2011

Summary log messages meant for global bitmap IO
should not be printed for bitmap IO caused by
activity log transactions.
Signed-off-by: Philipp Reisner <philipp.reisner@linbit.com>
Signed-off-by: Lars Ellenberg <lars.ellenberg@linbit.com>

c9d963a4

drbd: new on-disk activity log transaction format · 7ad651b5

Lars Ellenberg authored Feb 21, 2011

Use a new on-disk transaction format for the activity log, which allows
for multiple changes to the active set per transaction.

Using 4k transaction blocks, we can now get rid of the work-around code
to deal with devices not supporting 512 byte logical block size.
Signed-off-by: Philipp Reisner <philipp.reisner@linbit.com>
Signed-off-by: Lars Ellenberg <lars.ellenberg@linbit.com>

7ad651b5

lru_cache: allow multiple changes per transaction · 46a15bc3

Lars Ellenberg authored Feb 21, 2011

Allow multiple changes to the active set of elements in lru_cache.
The only current user of lru_cache, drbd, is driving this generalisation.
Signed-off-by: Philipp Reisner <philipp.reisner@linbit.com>
Signed-off-by: Lars Ellenberg <lars.ellenberg@linbit.com>

46a15bc3

drbd: allow to select specific bitmap pages for writeout · 45dfffeb

Lars Ellenberg authored Feb 21, 2011

We are about to allow several changes to the active set in one activity
log transaction. We have to write out the corresponding bitmap pages as
well, if changed.

Introduce drbd_bm_mark_for_writeout(), then re-use the existing bitmap
writeout path to submit all marked pages in one go.
Signed-off-by: Philipp Reisner <philipp.reisner@linbit.com>
Signed-off-by: Lars Ellenberg <lars.ellenberg@linbit.com>

45dfffeb