Commits · wc2 · Carlos Ramos Carreño / neoppod

05 May, 2023 1 commit

identification: Set possible answers to same msgid · b775379a

Levin Zimmermann authored Mar 31, 2023

This patchs origin is !20.
With recent changes on kirr/neo!2
and levin.zimmermann/wendelin.core@f546d836
we can already use WCFS with multiple master nodes in combination with this
patch. Until we proceed in !20
we can already use a NEO/py with this patch in order to test WCFS on our
wind clone.

---

Original patch content:

When a node tries to connect to another node it initially sends a
'RequestIdentification' packet. The other node can either reply with
'AcceptIdentification' or in case of a secondary master with
'NotPrimaryMaster'.

In the second case the message id differs from the initial requests
message id. This makes it difficult in a multi-threaded implementation
to proceed this answer: due to the different msg/connection - id the
multi-threaded implementation tries to proceed this incoming message in
a different thread, while the requesting thread waits forever for its
peers reply.

The most straightforward solution is to use the same connection - id for
both possible answers to the 'RequestIdentification' packet. This
doesn't break given NEO/py implementation and is only a small patch.
A workaround in a multi-threaded implementation on the other hand
seems to be much more complicated and time-consuming. Finally it also makes
sense semantically, because "Message IDs are used to identify response
packets" and in the given context the 'NotPrimaryMaster' *is* the de facto
response of a 'RequestIdentification'.

Because the msgid is the same now, '_ask' uses the
'PrimaryBootstrapHandler' (if it's the same connection) [1], so it may also
be necessary to move 'notPrimaryMaster' to 'PrimaryBootstrapHandler'.

[1] https://lab.nexedi.com/nexedi/neoppod/blob/ecc9c63c7/neo/lib/threaded_app.py#L142

b775379a

12 Apr, 2023 5 commits

. · 324dcb15
Kirill Smelkov authored Dec 09, 2020

324dcb15

fixup! Y client: Fix URI scheme to move credentials out of query · 7d1daf15

Kirill Smelkov authored Sep 08, 2021

parse_qsl no longer treats ';' as valid query separator for security
reason because most proxies did not do so and it was possible to poison
proxy cache due to difference in query separator handling (see bugs.python.org/issue42967).

To handle credentials we don't have any proxy here, and it is still
perfectly valid to use ';' as credentials separator.

-> Fix it with ';' -> '&' replace workaround, before feeding credentials
string to parse_qsl.

Amends: b9a42957.

7d1daf15

client: Fix URI scheme to move credentials out of query · 2bc2ba14

Kirill Smelkov authored Dec 09, 2020

- Move ca/cert/key out of query part into credentials part of URL
- As a consequence move cluster name out of credentials part into path part
- Allow both neo:// and neos:// schemes to be used.
  neo:// means always without SSL and neos:// means always with SSL.
  Even if neos:// URL comes without credentials embedded into it - they
  can be obtained from elsewhere.

I need this change in wendelin.core 2 which needs to take a zurl and
compute a mountpoint patch that corresponds to it, so that multiple
several zopes that use the same NEO storage, could be associated with
the same single WCFS instance.

2bc2ba14

Connection: Adjust msg_id a bit so it behaves like stream_id in HTTP/2 · ac3a4d13

Kirill Smelkov authored Dec 18, 2016

This is 2020 edition of my original patch from 2016 ( kirr/neo@dd3bb8b4 ).

It was described in my NEO/go article ( https://navytux.spb.ru/~kirr/neo.html )
in the text quoted below:

Then comes the link layer which provides service to exchange messages over
network. In current NEO/py every message has `msg_id` field, that similarly to
ZEO/py marks a request with serial number with requester then waiting for
corresponding answer to come back with the same message id. Even though there
might be several reply messages coming back to a single request, as e.g. NEO/py
asynchronous replication code[0], this approach is still similar to ZEO/py
remote procedure call (RPC) model because of single request semantic. One of
the places where this limitation shows is the same replicator code where
transactions metadata is fetched first with first series of RPC calls, and only
then object data is fetched with the second series of RPC calls. This could be
not very good e.g. in case when there is a lot of transactions/data to
synchronize, because 1) it puts assumption on, and so constraints, the storage
backend model on how data is stored (separate SQL tables for metadata and
data), and 2) no data will be synchronized at all until all transactions are
synchronized first. The second point prevents for example the syncing storage
in turn to provide, even if read-only, service for the already fetched data.
What would be maybe more useful is for requester to send request that it wants
to fetch ZODB data in `tid_min..tid_max` range and then the sender sending
intermixed stream of metadata/data in zodbdump-like format.

Keeping in mind this, and other examples, NEO/go shifts from thinking about
protocol logic as RPC to thinking of it as more general network protocol and
settles to provide general connection-oriented message exchange service[1] :
whenever a message with new `msg_id` is sent, a new connection is established
multiplexed on top of a single node-node TCP link. Then it is possible to
send/receive arbitrary messages over back and forth until so established
connection is closed. This works transparently to NEO/py who still thinks it
operates in simple RPC mode because of the way messages are put on the wire and
because simple RPC is subset of a general exchange. The `neonet` module also
provides `DialLink` and `ListenLink` primitives[2] that work similarly to
standard Go `net.Dial` and `net.Listen` but wrap so created link into the
multiplexing layer. What is actually done this way is very similar to HTTP/2
which also provides multiple general streams multiplexing on top of a single
TCP connection ([3], [4]). However if connection ids (sent in place of
`msg_id` on the wire) are assigned arbitrary, there could be a case when two
nodes could try to initiate two new different connections to each other with
the same connection id. To prevent such kind of conflict a simple rule to
allocate connection ids either even or odd, depending on the role peer played
while establishing the link, could be used. HTTP/2 takes similar approach[5]
where `"Streams initiated by a client MUST use odd-numbered stream identifiers;
those initiated by the server MUST use even-numbered stream identifiers."` with
NEO/go doing the same corresponding to who was originally dialer and who was a
listener. However it requires small patch to be applied on NEO/py side to
increment `msg_id` by 2 instead of 1.

NEO/py currently explicitly specifies `msg_id` for an answer in only limited
set of cases, by default assuming a reply comes to the last received message
whose `msg_id` it remembers globally per TCP-link. This approach is error-prone
and cannot generally work in cases where several simultaneous requests are
received over single link. This way NEO/go does not maintain any such global
per-link knowledge and handles every request by always explicitly using
corresponding connection object created at request reception time.

[0] https://lab.nexedi.com/kirr/neo/blob/463ef9ad/neo/storage/replicator.py
[1] https://lab.nexedi.com/kirr/neo/blob/463ef9ad/go/neo/neonet/connection.go
[2] https://lab.nexedi.com/kirr/neo/blob/463ef9ad/go/neo/neonet/newlink.go
[3] https://tools.ietf.org/html/rfc7540#section-5
[4] https://http2.github.io/faq/#why-is-http2-multiplexed
[5] https://tools.ietf.org/html/rfc7540#section-5.1.1

It can be criticized, but the fact is:

- it does no harm to NEO/py and is backward-compatible: a NEO/py node
without this patch can still successfully connect and interoperate to
another NEO/py node with this patch.

- it is required for NEO/go to be able to interoperate with NEO/py.
Both client and server parts of NEO/go use the same neonet module to exchange messages.

- NEO/go client is used by wendelin.core 2, which organizes access to on-ZODB
ZBigFile data via WCFS filesystem implemented in Go.

So on one side this patch is small, simple and does not do any harm to NEO/py.
On the other side it is required for NEO/go and wendelin.core 2.

To me this clearly indicates that there should be no good reason to reject
inclusion of this patch into NEO/py.

--------

My original patch from 2016 came with corresponding adjustments to neo/tests/testConnection.py
( kirr/neo@dd3bb8b4 )
but commit f6eb02b4 (Remove packet timeouts; 2017-05-04) removed testConnection.py
completely and, if I understand correctly, did not add any other test to
compensate that. This way I'm not trying to restore my tests to
Connection neither.

Anyway, with this patch there is no regression to all other existing NEO/py tests.

--------

My original patch description from 2016 follows:

- even for server initiated streams
- odd for client initiated streams

This way I will be able to use Pkt.msg_id as real stream_id in go's Conn
because with even / odd scheme there is no possibility for id conflicts
in between two peers.

ac3a4d13

Add branch for WC2 compatibility mode · 739096b7

Levin Zimmermann authored Apr 12, 2023

Between NEO/go and NEO/py there are various incompatibilities.
This branch aims to transparently communicate those incompatibilities.
We can also specify a SR in SlapOS which uses this branch in case
someone aims to use WC2.

Here we start from the old-proto (e.g. pre-messagepack) branch, because
in NEO/go msgpack support isn't fully implemented yet. This isn't
considered as an incompatibility but simply as a remaining TODO on
NEO/go side.

739096b7

11 May, 2021 2 commits
- Merge "mysql: workaround for MDEV-20693" · 5d348d4c
  Julien Muchembled authored May 22, 2020
  
  5d348d4c
- neoctl: fix tweak command when used without any argument · 0c8e3085
  Julien Muchembled authored May 11, 2021
```
(cherry picked from commit ba0bc779)
```
  0c8e3085
22 Mar, 2021 1 commit
- qa: renew certificates for tests · 787bd409
  Julien Muchembled authored Mar 22, 2021
```
(cherry picked from commit fa581be5)
```
  787bd409
19 Aug, 2020 3 commits

qa: skip broken ZODB test · a1418c9d

Julien Muchembled authored Jun 12, 2020

======================================================================
FAIL: check_tid_ordering_w_commit (neo.tests.zodb.testBasic.BasicTests)
----------------------------------------------------------------------
Traceback (most recent call last):
  File "ZODB/tests/BasicStorage.py", line 397, in check_tid_ordering_w_commit
    self.assertEqual(results.pop('lastTransaction'), tids[1])
  File "neo/tests/__init__.py", line 301, in assertEqual
    return super(NeoTestBase, self).assertEqual(first, second, msg=msg)
failureException: '\x03\xd8\x85H\xbffp\xbb' != '\x03\xd8\x85H\xbfs\x0b\xdd'

(cherry picked from commit f4cb59d2)

a1418c9d

client: fix race with invalidations when starting a new transaction on ZODB 5 · 96a5c01f
Julien Muchembled authored Jun 05, 2020
```
This requires ZODB >= 5.6.0

(cherry picked from commit a7d101ec)
```
96a5c01f
Code clean-up, comment fixes · fa7fbad6
Julien Muchembled authored Feb 20, 2020
```
(cherry picked from commit 43029be2)
```
fa7fbad6

22 May, 2020 2 commits

master: fix crash in STARTING_BACKUP when connecting to an upstream secondary master · 011eba12

Julien Muchembled authored Apr 30, 2019

This fixes the following assertion:

  Traceback (most recent call last):
    File "neo/master/app.py", line 172, in run
      self._run()
    File "neo/master/app.py", line 182, in _run
      self.playPrimaryRole()
    File "neo/master/app.py", line 302, in playPrimaryRole
      self.backup_app.provideService())
    File "neo/master/backup_app.py", line 114, in provideService
      node, conn = bootstrap.getPrimaryConnection()
    File "neo/lib/bootstrap.py", line 74, in getPrimaryConnection
      poll(1)
    File "neo/lib/event.py", line 160, in poll
      to_process.process()
    File "neo/lib/connection.py", line 504, in process
      self._handlers.handle(self, self._queue.pop(0))
    File "neo/lib/connection.py", line 92, in handle
      self._handle(connection, packet)
    File "neo/lib/connection.py", line 107, in _handle
      pending[0][1].packetReceived(connection, packet)
    File "neo/lib/handler.py", line 125, in packetReceived
      self.dispatch(*args)
    File "neo/lib/handler.py", line 75, in dispatch
      method(conn, *args, **kw)
    File "neo/lib/handler.py", line 159, in notPrimaryMaster
      assert primary != self.app.server
  AttributeError: 'BackupApplication' object has no attribute 'server'

(cherry picked from commit dba07e72)

011eba12

mysql: workaround for MDEV-20693 · 70387981
Julien Muchembled authored Jan 15, 2020

70387981

07 Jan, 2020 1 commit
- Merge v1.12 · 2c823e2e
  Julien Muchembled authored Jan 07, 2020
  
  2c823e2e
28 Apr, 2019 1 commit
- Release version 1.12 · 6332112c
  Julien Muchembled authored Apr 28, 2019
  
  6332112c
27 Apr, 2019 12 commits

master: reject drop/tweak ctl commands that could lead to unwanted status · 55a6dd0f

Julien Muchembled authored Apr 11, 2019

The following 2 operations can be onerous and they should not be
directly usable without some kind of confirmation by the user:
- Dropping a node now requires to first stop it.
- Tweaking does not exclude anymore automatically DOWN nodes,
  because a node could go DOWN between the moment the user sends
  the command to tweak and the actual tweak by the master.

55a6dd0f

qa: extend test reproducing the migration of a big ZODB to NEO · ef4d58f6
Julien Muchembled authored Apr 07, 2019

ef4d58f6
neoctl: better display of full partition tables · ab082d7e
Julien Muchembled authored Apr 04, 2019

ab082d7e
Bump protocol version · c6453626
Julien Muchembled authored Apr 26, 2019

c6453626

tweak: add option to simulate · 2a27239d

Julien Muchembled authored Mar 31, 2019

Initially, I wanted to do the simulation inside neoctl but it has no knowledge
of the topology (the master don't send devpath values of storage nodes).
Therefore, the work is delegated to the master node, which implies a change
of the protocol.

2a27239d

tweak: do not crash when trying to remove all nodes · 3839d224
Julien Muchembled authored Apr 04, 2019

3839d224
tweak: do not touch cells of nodes that are intended to be dropped · 8a645d9f
Julien Muchembled authored Mar 29, 2019

8a645d9f

Better error reporting from the master to neoctl for denied requests · c2c9e99d

Julien Muchembled authored Apr 06, 2019

This stops abusing ProtocolError, which disconnects the admin node needlessly.

The many 'if ... raise RuntimeError' in neo/neoctl/neoctl.py
could be turned into assertions.

c2c9e99d

Make 'neoctl print pt' report the number of replicas · 21190ee7
Julien Muchembled authored Mar 31, 2019

21190ee7

Make the number of replicas modifiable when the cluster is running · ef5fc508

Julien Muchembled authored Mar 27, 2019

neoctl gets a new command to change the number of replicas.

The number of replicas becomes a new partition table attribute and
like the PT id, it is stored in the config table. On the other side,
the configuration value for the number of partitions is dropped,
since it can be computed from the partition table, which is
always stored in full.

The -p/-r master options now only apply at database creation.

Some implementation notes:

- The protocol is slightly optimized in that the master now sends
  automatically the whole partition tables to the admin & client
  nodes upon connection, like for storage nodes.
  This makes the protocol more consistent, and the master is the
  only remaining node requesting partition tables, during recovery.

- Some parts become tricky because app.pt can be None in more cases.
  For example, the extra condition in NodeManager.update
  (before app.pt.dropNode) was added for this is the reason.
  Or the 'loadPartitionTable' method (storage) that is not inlined
  because of unit tests.
  Overall, this commit simplifies more than it complicates.

- In the master handlers, we stop hijacking the 'connectionCompleted'
  method for tasks to be performed (often send the full partition
  table) on handler switches.

- The admin's 'bootstrapped' flag could have been removed earlier:
  race conditions can't happen since the AskNodeInformation packet
  was removed (commit d048a52d).

ef5fc508

New --new-nid storage option for fast cloning · 27e3f620

Julien Muchembled authored Mar 21, 2019

It is often faster to set up replicas by stopping a node (and any
underlying database server like MariaDB) and do a raw copy of the
database (e.g. with rsync). So far, it required to stop the whole
cluster and use tools like 'mysql' or sqlite3' to edit:
- the 'pt' table in databases,
- the 'config.nid' values of the new nodes.

With this new option, if you already have 1 replica, you can set up
new replicas with such fast raw copy, and without interruption of
service. Obviously, this implies less redundancy during the operation.

27e3f620

qa: fix 2 tests with ZODB5 · 64e02391
Julien Muchembled authored Apr 26, 2019

64e02391

26 Apr, 2019 4 commits
- qa: new tools/stress options to evaluate MySQL engines · 491f4c89
  Julien Muchembled authored Apr 23, 2019
```
--kill-mysqld should be combined with something like -f .3 -r .1
to give storage nodes enough time to recover.
And also -D 0 to focus testing on the storage backend rather than NEO.
```
  491f4c89
- qa: provide a way to let tests start 1 mysqld per storage node · c11410ef
  Julien Muchembled authored Apr 23, 2019
  
  c11410ef
- mysql: make 'user' actually optional in the DB connection string · 74ec44e3
  Julien Muchembled authored Apr 23, 2019
  
  74ec44e3
- mysql: specify column families for RocksDB · 87c1de3b
  Julien Muchembled authored Apr 17, 2019
  
  87c1de3b
16 Apr, 2019 5 commits
- qa: add testIncremental (testImporter) test · aa7b654f
  Julien Muchembled authored Apr 09, 2019
  
  aa7b654f
- importer: fix hidden "maximum recursion depth exceeded" at startup · d5834ee9
  Julien Muchembled authored Apr 09, 2019
  
  d5834ee9
- importer: fix closure of ZODB, and also do it when the import is finished · c37bcfa3
  Julien Muchembled authored Apr 09, 2019
  
  c37bcfa3
- sqlite: fix resumption of migration to NEO with Importer · 6608a868
  Julien Muchembled authored Apr 09, 2019
  
  6608a868
- qa: fix a random failure in threaded tests · 989e9920
  Julien Muchembled authored Apr 06, 2019
```
This also reverts commit 442bb43a.
```
  989e9920
05 Apr, 2019 3 commits
- importer: speed up startup when the import is already finished · 26b1246a
  Julien Muchembled authored Apr 05, 2019
  
  26b1246a
- importer: fix replication (as source) once import is finished · 9d14ea1b
  Julien Muchembled authored Apr 05, 2019
```
This fixes up commit be839e92.
```
  9d14ea1b
- storage: fix DatabaseManager.getLastTID with max_tid · c58d4862
  Julien Muchembled authored Apr 05, 2019
  
  c58d4862