- 24 Apr, 2018 1 commit
-
-
sjaakola authored
These test can sporadically show mutex deadlock warnings between LOCK_wsrep_thd and LOCK_thd_data mutexes. This means that these mutexes can be locked in opposite order by different threads, and thus result in deadlock situation. To fix such issue, the locking policy of these mutexes should be revised and enforced to be uniform. However, a quick code review shows that the number of lock/unlock operations for these mutexes combined is between 100-200, and all these mutex invocations should be checked/fixed. On the other hand, it turns out that LOCK_wsrep_thd is used for protecting access to wsrep variables of THD (wsrep_conflict_state, wsrep_query_state), whereas LOCK_thd_data protects query, db and mysys_var variables in THD. Extending LOCK_thd_data to protect also wsrep variables looks like a viable solution, as there should not be a use case where separate threads need simultaneous access to wsrep variables and THD data variables. In this commit LOCK_wsrep_thd mutex is refactored to be replaced by LOCK_thd_data. By bluntly replacing LOCK_wsrep_thd by LOCK_thd_data, will result in double locking of LOCK_thd_data, and some adjustements have been performed to fix such situations.
-
- 23 Apr, 2018 3 commits
-
-
Jan Lindström authored
MDEV-15948 Fix error "Lost connection to MySQL server..." in test gal…
-
Daniele Sciascia authored
* Increased timeout counter in galera_wait_ready.inc * Replaced useless include/wait_until_ready.inc after start_mysqld.inc in galera_st_*.inc with wait_condition on cluster size.
-
Jan Lindström authored
MDEV-15929 Fix lock wait timeout on `SELECT @@GLOBAL.WSREP_ON`
-
- 20 Apr, 2018 1 commit
-
-
Daniele Sciascia authored
Test galera_sst_mysqldump often fails with error "2013: Lost connection to MySQL server during query". The connection is lost after the test restart one of the nodes. This happens because the server closes client connections if it is joining a cluster through SST method mysqldump. On unlucky runs of the test it is possible that mysqld is restarted, and then mtr client is disconnected while it tries to determine if galera is ready before going on with the test. This patch rewrites galera_wait_ready.inc so that it is immune to being disconnected.
-
- 19 Apr, 2018 1 commit
-
-
Daniele Sciascia authored
This patch fixes a lock wait timeout error on `SELECT @@GLOBAL.WSREP_ON` in `wait_wsrep_ready.inc`: ``` --connection node_2 ... --source include/kill_galera.inc --connection node_1 --source include/wait_until_connected_again.inc # This includes wait_wsrep_ready.inc ``` The problem is that on node_2, kill_galera.inc may return before the node is killed. So node_1 may still see that node_1 is alive and will attempt to sync wait when doing those `SELECT` statements. But sync wait is doomed to fail given that node_1 is killed, hence the lock wait timeout. One possible fix is to disable wsrep_sync_wait before including wait_until_connected_again. However, it appears that including wait_until_connected_again is not necessary at all in node_1, so this patch removes it altogether.
-
- 16 Apr, 2018 1 commit
-
-
Jan Lindström authored
fil_crypt_rotate_pages If tablespace is marked as stopping stop also page rotation fil_crypt_flush_space If tablespace is marked as stopping do not try to read page 0 and write it back.
-
- 13 Apr, 2018 4 commits
-
-
Sergei Golubchik authored
-
Jan Lindström authored
Fixed typo on select that is executed only when something unexpected happens.
-
Jan Lindström authored
MDEV-12632: Source and destination overlap in memcpy, encryption.innodb-discard-import-change fails in buildbot with valgrind Use block->page.offset for checking page number.
-
Jan Lindström authored
Problem was that we did not delete explain information when Galera must replay a query. Could not find easily repeatable test case that would not cause other problems.
-
- 12 Apr, 2018 4 commits
-
-
Vladislav Vaintroub authored
CIFS does not like O_DIRECT flag (it is set successfully, but pread would fail). The fix is not to use O_DIRECT, there is not need for it. posix_fadvise() was used already that should prevent buffer cache pollution on Linux. As recommended by documentation of posix_fadvise(), we'll also fsync() tablespaces after a batch of writes.
-
Vladislav Vaintroub authored
System tablespace can be specified with absolute path, if innodb_data_home_dir is empty. This was not handled well by mariabackup 1. In backup phase, empty innodb_data_home_dir variable from server was not recognized. full paths were stored in backup-my.ini, even if it stored all files locally. thus prepare phase would not find the system tablespace files. 2. In copy-back phase, copy would not be done to the absolute destination path, as path would be stripped with trim_dotslash This patch fixes the above defects.
-
Jan Lindström authored
MDEV-12632: Source and destination overlap in memcpy, encryption.innodb-discard-import-change fails in buildbot with valgrind Problem was that if tablespace was encrypted we try to copy also page 0 from read buffer to write buffer that are in that case the same memory area. fil_iterate When tablespace is encrypted or compressed its first page (i.e. page 0) is not encrypted or compressed and there is no need to copy buffer.
-
Jan Lindström authored
Wait until rotation has ended and shutdown before grep to make sure that dirty pages are on datafiles.
-
- 11 Apr, 2018 5 commits
-
-
Vladislav Vaintroub authored
Return early if plugin is not built.
-
Jan Lindström authored
MDEV-15804 Fix and re-enable MTR test galera.pxc-421
-
Jan Lindström authored
MDEV-15808 Fix and re-enable test galera.galera_gra_log
-
Daniele Sciascia authored
This patch makes two changes: * Remove unnecessary `set SESSION wsrep_sync_wait=0`. Disabling `wsrep_sync_wait` caused the test to fail occasionally, due to subsequent `SELECT`s observing stale values. * Remove redundant `--source include/galera_wait_ready.inc`, `galera_wait_ready.inc` is already included in the above `wait_until_connected_again.inc`
-
Daniele Sciascia authored
Test would occasionally fail as follows: ``` mysqltest: At line 20: query 'SELECT COUNT(*) = 0 FROM t1' failed: 1317: Query execution was interrupted ``` This was due to a `CREATE TABLE` applied concurrently that would occasionally cause the `SELECT` to be BF aborted, due to MDL conflict. The `SELECT` is executed while `wsrep_on=OFF`. The change makes sure it is executed only after wsrep is enabled again.
-
- 10 Apr, 2018 4 commits
-
-
Vicențiu Ciorbaru authored
There was a missing argument to service_manager_extend_timeout call and the signness of arguments did not match.
-
Jan Lindström authored
Wait drop table to be replicated before continuing.
-
Marko Mäkelä authored
If innodb_fast_shutdown<2, all transactions of active connections will be rolled back on shutdown. This can take a long time, and the systemd shutdown timeout should be extended during the wait. logs_empty_and_mark_files_at_shutdown(): Extend the timeout when waiting for other threads to complete.
-
Marko Mäkelä authored
This reverts commit 76ec37f5. This behaviour change will be done separately in: MDEV-15832 With innodb_fast_shutdown=3, skip the rollback of connected transactions
-
- 09 Apr, 2018 6 commits
-
-
Vladislav Vaintroub authored
-
Vladislav Vaintroub authored
-
Vicențiu Ciorbaru authored
Table_open_cache gets adjusted on server startup to prevent an out of file descriptor error. However this means that when we reset its value to default, it does not get re-adjusted. This leads to the mtr consistency check to fail with different server status at the end of the test case as opposed to when it started. To fix the problem, do not make use of the DEFAULT keyword to set the variable back, instead save the value before any modifications and restore it from there.
-
Jan Lindström authored
Disable occasionally failing test case galera.galera_var_slave_threads.
-
Jan Lindström authored
Wait until create table is replicated before continuing.
-
Jan Lindström authored
Wait until create tables are replicated to the slave before trying lock table.
-
- 08 Apr, 2018 3 commits
-
-
Jan Lindström authored
Disable more occasionally failing test cases. galera.galera_lock_table galera.galera_gra_log galera.MW-44 galera.lp1376747 galera.lp1376747-2 galera.galera_pc_ignore_sb
-
Jan Lindström authored
Make sure DDL's (create tables) are replicated before continuing to lock table.
-
Jan Lindström authored
Disable occasionally failing test cases pxc-421 query_cache galera_parallel_simple
-
- 07 Apr, 2018 4 commits
-
-
Marko Mäkelä authored
fil_crypt_read_crypt_data(): Do not attempt to read the tablespace if the file is unaccessible due to a pending DDL operation, such as renaming the file or DROP TABLE or TRUNCATE TABLE. This is only reducing the probability of the race condition, not completely preventing it.
-
Vicențiu Ciorbaru authored
-
Jan Lindström authored
-
Jan Lindström authored
MDEV-13549 Fix test galera.galera_wsrep_desync_wsrep_on
-
- 06 Apr, 2018 3 commits
-
-
Jan Lindström authored
-
Daniele Sciascia authored
The test tends to fail if many parallel instances of it are executed: ``` mysqltest: At line 23: query 'ALTER TABLE t1 ADD PRIMARY KEY (f1)' failed: 1317: Query execution was interrupted ``` The `ALTER` fails because it is BF aborted due to an earlier `INSERT SELECT` that is being applied: ``` INSERT INTO t1 (f1) SELECT ... --connection node_2 SET GLOBAL wsrep_desync = TRUE; SET SESSION wsrep_on = FALSE; ALTER TABLE t1 ADD PRIMARY KEY (f1); SET SESSION wsrep_on = TRUE; SET GLOBAL wsrep_desync = FALSE; ``` And because the `ALTER` is executed with `wsrep_on = OFF`, it does not run in total order isolation. To avoid the problem it must be ensured that the `ALTER` only after the large `INSERT SELECT` is done. To do so it is sufficient to issue `SELECT COUNT(*) FROM t1;` from `node_2` before turning off wsrep. The `SELECT` will trigger `wsrep_sync_wait` and proceed only after the `INSERT SELECT` from node_1 is done.
-
Marko Mäkelä authored
-