Commits · e7c2051ff9a8b00ff846a66dbe8658fcd0cfa33d · Vincent Pelletier / neoppod

24 May, 2018 8 commits
- Document that the bug when checking replicas may also cause the master to crash · e7c2051f
  Julien Muchembled authored May 24, 2018
  
  e7c2051f
- storage: stop logging 'Abort TXN' for txn that have been locked · f7cf8f07
  Julien Muchembled authored May 24, 2018
```
It was confusing and there's already the 'Unlock TXN' log just before abort()
is called (in this case, it's more a cleanup than an abort).
```
  f7cf8f07
- storage: split _migrate2() for reusable _alterTable() · d9b98671
  Julien Muchembled authored May 17, 2018
```
Future migration steps are likely to alter tables, possibly with
transformation of data, and this is complicated for both supported backend.
```
  d9b98671
- qa: new testStorageUpgrade · e2dacd6a
  Julien Muchembled authored May 24, 2018
  
  e2dacd6a
- qa: update testStorageUpgrade data for what is not automatically upgraded · 477e0e44
  Julien Muchembled authored May 22, 2018
```
Some changes in the storage format are minor and applying them automatically
would cost too much for big databases.

Here, we apply them manually so that testStorageUpgrade will be able to
compare dumps.

We hope however that with improvements like
  https://jira.mariadb.org/browse/MDEV-12836
we'll be able to implement more migration steps
and revert parts of this commit.
```
  477e0e44
- qa: original data for the future testStorageUpgrade · 933579f5
  Julien Muchembled authored May 22, 2018
```
These dumps were generated with an old version of NEO, plus a backport of the
test that will use them.

In MySQL dumps, --hex-blob was used only for inserts in the 'data' table.
```
  933579f5
- sqlite: fix indexes of upgraded db · 791900c7
  Julien Muchembled authored May 22, 2018
  
  791900c7
- importer: fix NameError when recovering during tpc_finish · 6dcda4e6
  Julien Muchembled authored May 21, 2018
  
  6dcda4e6
17 May, 2018 1 commit

fixup! importer: fetch and process the data to import in a separate process · dc220d04

Julien Muchembled authored May 17, 2018

- for FileStorage DB, make sure a transaction index is built at most once
- for other DB types, reopen the DB in the subprocess

Now that we have specific code for FileStorage, the generic case is not tested
anymore. We should add a test using ZEO. Or better, and in some way crazy,
one with NEO, but one would need to fix a special case in getObject.

dc220d04

16 May, 2018 5 commits

Serialize empty transaction extension with an empty string · a6d4c4e9

Julien Muchembled authored May 15, 2018

The protocol version is increased to ensure that client nodes are able to
handle an empty 'extension' field in AnswerTransactionInformation.

It also means that once new transactions are written, going back to a previous
revision is not possible.

a6d4c4e9

client: fix partial import from a source storage · 346c9d00

Julien Muchembled authored May 15, 2018

The correct way to specify a start/stop tid is when constructing the 'source'
object, hence the remove of start/stop args. In fact, source.iterator()
does not always take such args.

On the other hand, when resuming import, Application.importFrom must manage
with incomplete preindex.

346c9d00

qa: give a title to subprocesses of functional tests · b648904b
Julien Muchembled authored May 07, 2018
```
Same as previous commit: only cosmetics so optional.
```
b648904b

importer: give a title to the 'import' and 'writeback' subprocesses · 461df152

Julien Muchembled authored May 07, 2018

'title' means both process name and command line.

This is cosmetics so it won't fail if the 'setproctitle' module
is not available.

461df152

importer: fetch and process the data to import in a separate process · 05bf48de

Julien Muchembled authored May 02, 2018

A new subprocess is used to:
- fetch data from the source DB
- repickle to change oids (when merging several DB)
- compress
- checksum

This is mostly useful for the second step, which is relatively much slower than
any other step, while not releasing the GIL.

By using a second CPU core, it is also often possible to use a better
compression algorithm for free (e.g. zlib=9). Actually, smaller data can speed
up the writing process.

In addition to greatly speed up the import by parallelizing fetch+process with
write, it also makes the main process more reactive to queries from client
nodes.

05bf48de

15 May, 2018 1 commit

importer: new option to write back new transactions to the source database · 30a02bdc

Julien Muchembled authored Apr 19, 2018

By doing the work with secondary connections to the underlying databases,
asynchronously and in a separate process, this should have minimal impact on
the performance of the storage node. Extra complexity comes from backends that
may lose connection to the database (here MySQL): this commit fully implements
reconnection.

30a02bdc

11 May, 2018 3 commits
- importer: log when the transaction index for FileStorage DB is built · 2fae3e54
  Julien Muchembled authored Apr 19, 2018
  
  2fae3e54
- importer: open imported zodb in read-only whenever possible · db20bf37
  Julien Muchembled authored Apr 16, 2018
```
For FileStorage DB, this avoids:
- keeping a lock on the source DB during the whole import,
- saving the whole index when the import was resumed.
```
  db20bf37
- fixup! mysql: fix remaining places where a server disconnection was not catched · 26f898c1
  Julien Muchembled authored May 09, 2018
  
  26f898c1
07 May, 2018 4 commits
- fixup! storage: speed up replication by sending bigger network packets · 1a064725
  Julien Muchembled authored May 07, 2018
  
  1a064725
- mysql: do not full-scan for duplicates of big oids if deduplication is disabled · 156da51c
  Julien Muchembled authored Apr 19, 2018
  
  156da51c
- mysql: fix remaining places where a server disconnection was not catched · a63b45fe
  Julien Muchembled authored Apr 19, 2018
  
  a63b45fe
- fixup! Add support for custom compression levels · fec86e26
  Julien Muchembled authored May 04, 2018
  
  fec86e26
18 Apr, 2018 3 commits
- importer: reenable compression by default · 0c34630c
  Julien Muchembled authored Apr 18, 2018
```
It was disabled by mistake in commit fd80cc30.
```
  0c34630c
- qa: review testImporter · 838f450c
  Julien Muchembled authored Apr 18, 2018
```
- Stop using NEO source code as sample data.
- For ZODB5, add a test that does not merge several DB.
```
  838f450c
- qa: remove a few uses of 'chr' · f4c2fc6a
  Julien Muchembled authored Apr 17, 2018
  
  f4c2fc6a
16 Apr, 2018 3 commits

Fix a few issues with ZODB5 · 1316c225

Julien Muchembled authored Apr 16, 2018

In the Importer storage backend, the repickler code never really worked with
ZODB 5 (use of protocol > 1), and now the test does not pass anymore.

The other issues caused by ZODB commit 12ee41c47310156027a674932df34b60de86ba36
are fixed:

  TypeError: list indices must be integers, not binary

  ValueError: unsupported pickle protocol: 3

Although not necessary as long as we don't support Python 3,
this commit also replaces `str` by `bytes` in a few places.

1316c225

importer: small code cleanup in speedupFileStorageTxnLookup patch · b6989a0e
Julien Muchembled authored Apr 16, 2018

b6989a0e

importer: do not trigger speedupFileStorageTxnLookup uselessly · 3bcac6d3

Julien Muchembled authored Apr 16, 2018

When importing a FileStorage DB without interruption and without having to
serve client nodes, the index built by speedupFileStorageTxnLookup is useless.
Such case happens when doing simulation tests and on DB with many oids,
it can take a lot of time and memory for nothing.

3bcac6d3

13 Apr, 2018 2 commits
- Add support for custom compression levels · fd80cc30
  Julien Muchembled authored Apr 10, 2018
  
  fd80cc30
- setup: update MANIFEST.in · 6f855eef
  Julien Muchembled authored Apr 13, 2018
```
This was forgotten in commit 5de0ff3a.
```
  6f855eef
12 Apr, 2018 2 commits
- importer: do not checksum data twice · 3eee728a
  Julien Muchembled authored Apr 12, 2018
  
  3eee728a
- client: store uncompressed if compressed size is equal · b954857f
  Julien Muchembled authored Apr 12, 2018
```
The Importer storage backend already does this.
```
  b954857f
10 Apr, 2018 1 commit
- fixup! master: automatically discard feeding cells that get out-of-date · 42ca12eb
  Julien Muchembled authored Apr 10, 2018
```
This fixes a random failure in testSafeTweak:

  failureException: 'UU.|U.U|.UU' != 'UU.|.UU|U.U'
```
  42ca12eb
29 Mar, 2018 2 commits

master: automatically discard feeding cells that get out-of-date · 3efbbfe3

Julien Muchembled authored Mar 29, 2018

This is a follow-up of commit 2ca7c335,
which changed 'tweak' not to discard readable cells too quickly.

The scenario of a storage being lost whereas it has feeding cells was forgotten.
These must be discarded immediately, otherwise we end up with more up-to-date
cells than wanted. Without the change in outdate(), testSafeTweak would end
with: UU.|U.U|UUU

Once replication is optimized not to always restart checking cells from the
beginning:
- Remembering that an out-of-date cell was feeding could be a safer
  option, but it may not be worth the extra complexity.
- Another possibility may be to replace the FEEDING state by an automatic
  partial tweak that only discards up-to-date cells too many whenever a cell
  becomes up-to-date.

3efbbfe3

qa: remove useless indentation in testSafeTweak · 3443d483
Julien Muchembled authored Mar 29, 2018

3443d483

20 Mar, 2018 2 commits
- bench: new option to mesure ZEO perfs in matrix test · b621a98f
  Julien Muchembled authored Mar 20, 2018
  
  b621a98f
- bench: reduce number of partitions in matrix test · 114c7ab6
  Julien Muchembled authored Mar 20, 2018
  
  114c7ab6
14 Mar, 2018 1 commit

storage: fix replication of creation undone · c3343279

Julien Muchembled authored Mar 14, 2018

For records that undo object creation, None values are used at the backend
level whereas the protocol is not designed to serialize None for any field.

Therefore, a dance done in many places around packet serialization, using the
specific 0/ZERO_HASH/'' triplet to represent a deleted oid. For replication,
it was missing at the sender side, leading to the following crash:

  Traceback (most recent call last):
    File "neo/storage/app.py", line 147, in run
      self._run()
    File "neo/storage/app.py", line 178, in _run
      self.doOperation()
    File "neo/storage/app.py", line 257, in doOperation
      next(task_queue[-1]) or task_queue.rotate()
    File "neo/storage/handlers/storage.py", line 271, in push
      conn.send(Packets.AddObject(oid, *object), msg_id)
    File "neo/lib/protocol.py", line 234, in __init__
      self._fmt.encode(buf.write, args)
    File "neo/lib/protocol.py", line 345, in encode
      return self._trace(self._encode, writer, items)
    File "neo/lib/protocol.py", line 334, in _trace
      return method(*args)
    File "neo/lib/protocol.py", line 367, in _encode
      item.encode(writer, value)
    File "neo/lib/protocol.py", line 345, in encode
      return self._trace(self._encode, writer, items)
    File "neo/lib/protocol.py", line 342, in _trace
      raise ParseError(self, trace)
  ParseError: at add_object/checksum:
    File "neo/lib/protocol.py", line 553, in _encode
      assert len(checksum) == 20, (len(checksum), checksum)
  TypeError: object of type 'NoneType' has no len()

c3343279

13 Mar, 2018 1 commit
- Release version 1.9 · 1b57a7ae
  Julien Muchembled authored Mar 13, 2018
  
  1b57a7ae
02 Mar, 2018 1 commit

master: fix resumption of backup replication (internal or not) · 27229793

Julien Muchembled authored Feb 27, 2018

Before, it waited for upstream activity until all partitions are touched.
However, when upstream is idle the backup cluster could remain stuck forever
if it was interrupted whereas some cells were still late.

27229793