Commits · 0b93b1fb4f8418fc898a6660933daad1b01a1246 · Kirill Smelkov / neo

28 Aug, 2015 1 commit

Fix occasional deadlocks in threaded tests · 0b93b1fb

Julien Muchembled authored Aug 28, 2015

deadlocks mainly happened while stopping a cluster, hence the complete review
of NEOCluster.stop()

A major change is to make the client node handle its lock like other nodes
(i.e. in the polling thread itself) to better know when to call
Serialized.background() (there was a race condition with the test of
'self.poll_thread.isAlive()' in ClientApplication.close).

0b93b1fb

14 Aug, 2015 2 commits

Remove useless assert in a private method of MTClientConnection · 1ab594b4
Julien Muchembled authored Aug 12, 2015

1ab594b4

Do not reconnect too quickly to a node after an error · d898a83d

Julien Muchembled authored Aug 09, 2015

For example, a backup storage node that was rejected because the upstream
cluster was not ready could reconnect in loop without delay, using 100% CPU
and flooding logs.

A new 'setReconnectionNoDelay' method on Connection can be used for cases where
it's legitimate to quickly reconnect.

With this new delayed reconnection, it's possible to remove the remaining
time.sleep().

d898a83d

12 Aug, 2015 16 commits
- Remove useless testEvent · 71e30fb9
  Julien Muchembled authored Aug 12, 2015
```
Such kind of test has never helped to detect regressions and any bug in
EpollEventManager would be quickly reported by other tests.

testConnection may go the same way if it keeps annoying me too much.
```
  71e30fb9
- client: do not wait for the remote to close the connection if it's not ready · f9df31be
  Julien Muchembled authored Aug 10, 2015
```
This is currently not an issue because the 'time.sleep(1)' in iterateForObject
(storage) and _connectToPrimaryNode (master) leave enough time. What could
happen is a new connection attempt for a node that already has a connection
(causing a failure assertion in Node.setConnection).
```
  f9df31be
- Fix invalid processing of unregistered connections · a4731a0c
  Julien Muchembled authored Aug 09, 2015
```
This could happen if a file descriptor was reallocated by the kernel.
```
  a4731a0c
- Simplify API to establish connections and accept mix of IPv4/IPv6 · ed50edca
  Julien Muchembled authored Aug 08, 2015
  
  ed50edca
- Rename parameter of polling methods now that _poll computes the timeout itself · c2c97752
  Julien Muchembled authored Aug 12, 2015
  
  c2c97752
- Tickless poll loop, for lowest latency and cpu usage · eef52c27
  Julien Muchembled authored Aug 02, 2015
```
With this patch, the epolling object is not awoken every second to check
if a timeout has expired. The API of Connection is changed to get the smallest
timeout.
```
  eef52c27
- tests: make Patch usable as a context manager · fd0b9c98
  Julien Muchembled authored Aug 05, 2015
  
  fd0b9c98
- Add file descriptor and aborted flag to __repr__ of connections · 91c66356
  Julien Muchembled authored Aug 02, 2015
  
  91c66356
- client: replace Event by a pipe as a way to stop the poll loop · cb8a5a88
  Julien Muchembled authored Jul 25, 2015
```
This is a prerequisite for tickless poll loops.
```
  cb8a5a88
- Fix 100% CPU usage when the closure of a connection is delayed · 4a328ade
  Julien Muchembled authored Aug 01, 2015
  
  4a328ade
- client: review connection locking (MTClientConnection) · 4e739de4
  Julien Muchembled authored Jul 27, 2015
```
This mainly changes several methods to lock automatically instead of asserting
that the caller did it. This removes any overhead for non-MT classes, and
the use of 'with' instead of lock/unlock methods also simplifies the API.
```
  4e739de4
- client: a simple lock is enough for the connection pool · e438f864
  Julien Muchembled authored Aug 10, 2015
  
  e438f864
- Remove useless socket shutdown on close · c319b065
  Julien Muchembled authored Jul 24, 2015
```
shutdown is implicit because we don't duplicate sockets.
```
  c319b065
- Small optimizations & cleanups · 19745e7c
  Julien Muchembled authored Jul 24, 2015
  
  19745e7c
- Better output of verbose locks · 5b69d553
  Julien Muchembled authored Jul 28, 2015
```
- For all threads except the main one, the id is displayed instead of the name,
  because the latter is not always unique.
- Outputs may be interlaced by concurrent thread, so tracebacks are also
  prefixed by their idents.
```
  5b69d553
- Fix verbose locks when acquiring without blocking · ede173f8
  Julien Muchembled authored Jul 28, 2015
  
  ede173f8
28 Jul, 2015 1 commit
- Add a neo/debug.py example to display tracebacks of threads · 52ed5aab
  Julien Muchembled authored Jul 28, 2015
  
  52ed5aab
13 Jul, 2015 2 commits
- Release version 1.4 · f4e656f6
  Julien Muchembled authored Jul 13, 2015
  
  f4e656f6
- Better handling of NotReady error · 167ad36b
  Julien Muchembled authored Jul 10, 2015
  
  167ad36b
10 Jul, 2015 1 commit
- Some documentation cleanup · 8ec87379
  Julien Muchembled authored Jul 10, 2015
  
  8ec87379
09 Jul, 2015 1 commit
- client: fix misleading exception message in case of mismatch checksum · 197054be
  Julien Muchembled authored Jul 09, 2015
  
  197054be
03 Jul, 2015 3 commits
- Fix neo/debug.py example for clients · 9e026d08
  Julien Muchembled authored Jul 03, 2015
  
  9e026d08
- client: prevent RTMIN+3 from connecting to master if not connected yet · e03a836a
  Julien Muchembled authored Jul 03, 2015
  
  e03a836a
- client: fix "signal only works in main thread" when adding a ZODB Mount Point to NEO · c324955d
  Julien Muchembled authored Jul 03, 2015
  
  c324955d
01 Jul, 2015 1 commit
- Update changelog · 79fca358
  Julien Muchembled authored Jul 01, 2015
  
  79fca358
30 Jun, 2015 2 commits
- Add upgrade notes about MySQL/SQLite schema changes since NEO 1.3 · 02a5b4e3
  Julien Muchembled authored Jun 30, 2015
  
  02a5b4e3
- master: new option to automatically start a new cluster · 58774fb6
  Julien Muchembled authored Jun 29, 2015
  
  58774fb6
29 Jun, 2015 2 commits
- master: simplify recovery loop · 5a76664a
  Julien Muchembled authored Jun 29, 2015
  
  5a76664a
- Add support for IPython >= 1, ignore older versions · b19bf40e
  Julien Muchembled authored Jun 29, 2015
  
  b19bf40e
24 Jun, 2015 8 commits
- client: do not loop forever on unreadable cells when not connected to the master · 8173441b
  Julien Muchembled authored Jun 24, 2015
```
When the connection to the primary master node is lost, the node manager
does not have anymore a reliable list of running nodes, so iterateForObject()
must not retry any cell.
```
  8173441b
- client: code cleanup · 9c0a0c9e
  Julien Muchembled authored Jun 24, 2015
  
  9c0a0c9e
- storage: fix crash when a client tries to "steal" the UUID of another client · cf413589
  Julien Muchembled authored Jun 24, 2015
  
  cf413589
- Add more information in __repr__ of connections · 11debaa9
  Julien Muchembled authored Jun 24, 2015
  
  11debaa9
- When aborting, flush log containing pre-mortem data · 6b410098
  Julien Muchembled authored Jun 24, 2015
  
  6b410098
- mysql: log failed query in case of database failure · 93aef5ed
  Julien Muchembled authored Jun 23, 2015
  
  93aef5ed
- Fix read of empty transactions · f8ce322b
  Julien Muchembled authored Jun 23, 2015
```
Since transactions have metadata like a description, it may not be useless
to allow them. But the behaviour of FileStorage is to silently drop them,
so we may have to do the same in the future.

An application that is not supposed to commit empty transactions should write
its own unit test to prevent this.
```
  f8ce322b
- Prevent nodes from reconnecting too fast · c05e65ac
  Julien Muchembled authored Jun 23, 2015
```
This happened between storage nodes of different clusters because they're not
informed about their state, e.g. a dead upstream storage node.

In any case, logs were flooded at 100% cpu usage.
```
  c05e65ac