Commits · 66f4900b517681da2aed3b562158ef58679961e4 · nexedi / MariaDB

11 Jan, 2021 2 commits

Revert "MDEV-23536 : Race condition between KILL and transaction commit" · 66f4900b

Sergei Golubchik authored Jan 11, 2021

This reverts the server part of the commit 775fccea
but keeps InnoDB part (which reverted MDEV-17092 5530a93f).

So after this both MDEV-23536 and MDEV-17092 are reverted,
and the original bug is resurrected.

66f4900b

MDEV-21478 Inplace ALTER fails to report error when FTS_DOC_ID · fdc4b7a6

Thirunarayanan Balathandayuthapani authored Dec 15, 2020

		with wrong data type is added

  Inplace alter fails to report error when fts_doc_id column with
wrong data type is added.

prepare_inplace_alter_table_dict(): Should check whether the column
is fts_doc_id. It should be of bigint type, should accept non null
data type and it should be in capital letters.

fdc4b7a6

09 Jan, 2021 1 commit
- MDEV-24554 Do not use verisign server for authenticode timestamping · 3b548d3b
  Vladislav Vaintroub authored Jan 09, 2021
  
  3b548d3b
08 Jan, 2021 5 commits

MDEV-23536 : Race condition between KILL and transaction commit · 775fccea

Jan Lindström authored Dec 17, 2020

A race condition may occur between the execution of transaction commit,
and an execution of a KILL statement that would attempt to abort that
transaction.

MDEV-17092 worked around this race condition by modifying InnoDB code.
After that issue was closed, Sergey Vojtovich pointed out that this
race condition would better be fixed above the storage engine layer:

If you look carefully into the above, you can conclude that
thd->free_connection() can be called concurrently with
KILL/thd->awake(). Which is the bug. And it is partially fixed in
THD::~THD(), that is destructor waits for KILL completion:

Fix: Add necessary mutex operations to THD::free_connection()
and move WSREP specific code also there. This ensures that no
one is using THD while we do free_connection(). These mutexes
will also ensures that there can't be concurrent KILL/THD::awake().

innobase_kill_query
  We can now remove usage of trx_sys_mutex introduced on MDEV-17092.

trx_t::free()
  Poison trx->state and trx->mysql_thd

This patch is validated with an RQG run similar to the one that
reproduced MDEV-17092.

775fccea

Cleanup: Remove unused symbol QUE_THR_PROCEDURE_WAIT · 18254c18
Marko Mäkelä authored Jan 08, 2021

18254c18
fixup MDEV-17556: fix mroonga · 61a362c9
Nikita Malyavin authored Jan 08, 2021

61a362c9
MDEV-19838 fixup: clang -Wunused-const-variable · cd1e5d65
Marko Mäkelä authored Jan 08, 2021

cd1e5d65

MDEV-17556 Assertion `bitmap_is_set_all(&table->s->all_set)' failed · e25623e7

Nikita Malyavin authored Dec 29, 2020

The assertion failed in handler::ha_reset upon SELECT under
READ UNCOMMITTED from table with index on virtual column.

This was the debug-only failure, though the problem is mush wider:
* MY_BITMAP is a structure containing my_bitmap_map, the latter is a raw
 bitmap.
* read_set, write_set and vcol_set of TABLE are the pointers to MY_BITMAP
* The rest of MY_BITMAPs are stored in TABLE and TABLE_SHARE
* The pointers to the stored MY_BITMAPs, like orig_read_set etc, and
 sometimes all_set and tmp_set, are assigned to the pointers.
* Sometimes tmp_use_all_columns is used to substitute the raw bitmap
 directly with all_set.bitmap
* Sometimes even bitmaps are directly modified, like in
TABLE::update_virtual_field(): bitmap_clear_all(&tmp_set) is called.

The last three bullets in the list, when used together (which is mostly
always) make the program flow cumbersome and impossible to follow,
notwithstanding the errors they cause, like this MDEV-17556, where tmp_set
pointer was assigned to read_set, write_set and vcol_set, then its bitmap
was substituted with all_set.bitmap by dbug_tmp_use_all_columns() call,
and then bitmap_clear_all(&tmp_set) was applied to all this.

To untangle this knot, the rule should be applied:
* Never substitute bitmaps! This patch is about this.
 orig_*, all_set bitmaps are never substituted already.

This patch changes the following function prototypes:
* tmp_use_all_columns, dbug_tmp_use_all_columns
 to accept MY_BITMAP** and to return MY_BITMAP * instead of my_bitmap_map*
* tmp_restore_column_map, dbug_tmp_restore_column_maps to accept
 MY_BITMAP* instead of my_bitmap_map*

These functions now will substitute read_set/write_set/vcol_set directly,
and won't touch underlying bitmaps.

e25623e7

06 Jan, 2021 1 commit
- MDEV-19442 add-on · f319c426
  Andrei Elkin authored Jan 06, 2021
```
fixing windows build.
```
  f319c426
04 Jan, 2021 7 commits

MDEV-24482: Added wait condition to make sure table t1 is replicated to node_2. · 51b7438d
Stepan Patryshev authored Dec 30, 2020

51b7438d
MDEV-24465: Added wait condition to make sure table t1 is replicated to node_2. · 06644f70
Stepan Patryshev authored Dec 30, 2020

06644f70
MDEV-24464: Added wait condition to make sure table t1 is replicated to node_2. · 1284e6c3
Stepan Patryshev authored Dec 30, 2020

1284e6c3
MDEV-24447: Added wait condition to make sure table t1 is replicated to node_2. · 9de9e0c7
Stepan Patryshev authored Dec 30, 2020

9de9e0c7
MDEV-24462: Added wait condition to make sure table t1 is replicated to node_2. · cd529ae8
Stepan Patryshev authored Dec 30, 2020

cd529ae8

MDEV-23033: All slaves crash once in ~24 hours and loop restart with signal 11 · 608b0ee5

Sujatha authored Dec 31, 2020

Problem:
=======
Upon deleting or updating a row in a parent table (with primary key), if
the child table has virtual column and an associated key with ON UPDATE
CASCADE/ON DELETE CASCADE, it will result in slave crash.

Analysis:
========
Tables which are related through foreign key require prelocking similar to
triggers. i.e If a table has triggers/foreign keys we should add all tables
and routines used by them to the prelocking set.  This prelocking happens
during 'open_and_lock_tables' call.  Each table being opened is checked for
foreign key references. If foreign key reference exists then the child
table is opened and it is linked to the table_list. Upon any modification
to  parent table its corresponding child tables are retried from table_list
and they are updated accordingly. This prelocking work fine on master.

On slave  prelocking works for following cases.
 - Statement/mixed based replication
 - In row based replication when trigger execution is enabled through
   'slave_run_triggers_for_rbr=YES/LOGGING/ENFORCE'

Otherwise it results in an assert/crash, as the parent table will not find
the corresponding child table and it will be NULL. Dereferencing NULL
pointer leads to slave server exit.

Fix:
===
Introduce a new 'slave_fk_event_map' flag similar to 'trg_event_map'. This
flag will ensure that when foreign key is enabled in row based replication
all the parent and child tables are prelocked, so that parent is able to
locate the child table.

Note: This issue is specific to slave, hence only slave needs to be
      upgraded.

608b0ee5

MDEV-23875 is failing to build on windows. · 25db9ffa
Rucha Deodhar authored Jan 04, 2021

25db9ffa

31 Dec, 2020 1 commit

MDEV-23875: select into outfile not respect UMASK and UMASK_DIR · 4f5d5a78

Rucha Deodhar authored Dec 28, 2020

Analysis: select into outfile creates files everytime with 666 permission,
regardsless if umask environment variables and umask settings on OS level.
It seems hardcoded.
Fix: change 0666 to 0644 which will let anybody consume the file but not
change it.

4f5d5a78

28 Dec, 2020 3 commits

MDEV-19442 server_audit plugin doesn't consider proxy users in... · 78292047

Alexey Botchkov authored Dec 28, 2020

MDEV-19442 server_audit plugin doesn't consider proxy users in server_audit_excl_users/server_audit_incl_users.

Check the proxy user just as the connection user against the
incl_users_list and excl_users_list.

78292047

MDEV-24449 Corruption of system tablespace or last recovered page · 5b9ee8d8

Marko Mäkelä authored Dec 28, 2020

This corresponds to 10.5 commit 39378e13.

With a patched version of the test innodb.ibuf_not_empty (so that
it would trigger crash recovery after using the change buffer),
and patched code that would modify the os_thread_sleep() in
recv_apply_hashed_log_recs() to be 1ms as well as add a sleep of
the same duration to the end of recv_recover_page() when
recv_sys->n_addrs=0, we can demonstrate a race condition.

After disabling some debug checks in buf_all_freed_instance(),
buf_pool_invalidate_instance() and buf_validate(), we managed to
trigger an assertion failure in fseg_free_step(), on the XDES_FREE_BIT.
In other words, an trx_undo_seg_free() call during
trx_rollback_resurrected() was attempting a double-free of a page.
This was repeated about once in 400 to 500 test runs. With the fix
applied, the test passed 2,000 runs.

recv_apply_hashed_log_recs(): Do not only wait for recv_sys->n_addrs
to reach 0, but also wait for buf_get_n_pending_read_ios() to reach 0,
to guarantee that buf_page_io_complete() will not be executing
ibuf_merge_or_delete_for_page().

5b9ee8d8

MDEV-23851 MDEV-24229 BF-BF conflict issues · 8e3e87d2

sjaakola authored Dec 08, 2020

Issues MDEV-23851 and MDEV-24229 are probably duplicates and are caused by the new self-asserting function lock0lock.cc:wsrep_assert_no_bf_bf_wait().
The criteria for asserting is too strict and does not take in consideration scenarios of "false positive" lock conflicts, which are resolved by replaying the local transaction.
As a fix, this PR is relaxing the assert criteria by two conditions, which skip assert if high priority transactions are locking in correct order or if conflicting high priority lock holder is aborting and has just not yet released the lock.

Alternative fix would be to remove wsrep_assert_no_bf_bf_wait() altogether, or remove the assert in this function and let it only print warnings in error log.
But in my high conflict rate multi-master test scenario, this relaxed asserting appears to be safe.

This PR also removes two wsrep_report_bf_lock_wait() calls in innodb lock manager, which cause mutex access assert in debug builds.

Foreign key appending missed handling of data types of float and double in INSERT execution. This is not directly related to the actual issue here but is fixed in this PR nevertheless. Missing these foreign keys values in certification could cause problems in some multi-master load scenarios.

Finally, some problem reports suggest that some of the issues reported in MDEV-23851 might relate to false positive lock conflicts over unique secondary index gaps. There is separate work for relaxing UK index gap locking of replication appliers, and separate PR will be submitted for it, with a related mtr test as well.

8e3e87d2

24 Dec, 2020 1 commit
- Fix MDEV-21958 code to be working with not 64 MAX_INDEXES · 1e9af799
  Oleksandr Byelkin authored Dec 24, 2020
  
  1e9af799
22 Dec, 2020 2 commits
- Forgot to add this change to previous cset · 8d8370e3
  Sergei Petrunia authored Dec 22, 2020
  
  8d8370e3
- MDEV-24444: ASAN use-after-poison in Item_func_in::get_func_mm_tree with NOT IN · df4f4bd8
  Sergei Petrunia authored Dec 22, 2020
```
Fix a trivial error in the fix for MDEV-21958: check the key in the right
table.
```
  df4f4bd8
19 Dec, 2020 6 commits

MDEV-22630 mysql_upgrade (MariaDB 5.2.X --> MariaDB 10.3.X) does not fix... · dfe8ef8b

Sergei Golubchik authored Apr 21, 2020

MDEV-22630 mysql_upgrade (MariaDB 5.2.X --> MariaDB 10.3.X) does not fix auth_string to change it to authentication_string

cherry-pick from 10.4:

  commit b976b9bf
  Author: Sergei Golubchik <serg@mariadb.com>
  Date:   Tue Apr 21 18:40:15 2020 +0200

    MDEV-21244 mysql_upgrade creating empty global_priv table

    support upgrades from 5.2 privilege tables

dfe8ef8b

Item_func_like::walk() was ignoring escape_item · 6f40d5c8

Sergei Golubchik authored Dec 16, 2020

in particular, it caused escape_item->is_expensive() property
to be lost instead of being properly propagated up.

6f40d5c8

MDEV-24346 valgrind error in main.precedence · 59211ab7

Sergei Golubchik authored Dec 15, 2020

Part II.

It's still possible to bypass Item_func_like::escape
initialization in Item_func_like::fix_fields().

This requires ESCAPE argument being a cacheable subquery
that uses tables and is inside a derived table which
is used in multi-update.

Instead of implementing a complex or expensive fix for
this particular ridiculously artificial case, let's simply disallow it.

59211ab7

MDEV-24346 valgrind error in main.precedence · a587ded2

Sergei Golubchik authored Dec 14, 2020

in queries like

  create view v1 as select 2 like 1 escape (3 in (select 0 union select 1));
  select 2 union select * from v1;

Item_func_like::escape was left uninitialized, because
Item_in_optimizer is const_during_execution()
but not actually const_item() during execution.

It's not, because const subquery evaluation was disabled for derived.
Practically it only needs to be disabled for multi-update
that runs fix_fields() before all tables are locked.

a587ded2

Item_func_like calls escape_item->fix_fields() twice · 5785de72

Sergei Golubchik authored Dec 16, 2020

this happens if Item_func_like is copied (get_copy()).
after one copy gets fixed, the other tries to fix escape item again.

5785de72

MDEV-23065 : Crash after setting wsrep_on to ON dynamically and reconnect · d1e9a4c1
Jan Lindström authored Dec 19, 2020
```
At end_connection make sure we have wsrep before trying to free
connection assigned to it.
```
d1e9a4c1

18 Dec, 2020 2 commits

MDEV-22008 rpl.rpl_semi_sync fails in bb, MDEV-24418 reenable... · 4e43e2f9

Alice Sherepa authored Dec 03, 2020

MDEV-22008 rpl.rpl_semi_sync fails in bb, MDEV-24418 reenable binlog_truncate_innodb and binlog_spurious_ddl_errors, rpl_parallel_retry fails in bb

4e43e2f9

MDEV-24041 Generated column DELETE with FOREIGN KEY crash InnoDB · 83d2e084

Nikita Malyavin authored Dec 17, 2020

row_upd_clust_step() calls row_upd_del_mark_clust_rec() which would
allocate some memory in row_ins_foreign_fill_virtual(). Then,
row_upd_store_row() would access the allocated memory, but only after
potentially freeing that memory by invoking mem_heap_empty(),
leading to ASAN heap-use-after-free diagnostics.

row_ins_foreign_fill_virtual(): Use a more appropriate memory heap with a
longer lifetime.

83d2e084

17 Dec, 2020 2 commits

MDEV-20751 Permission Issue With Nested CTEs · 25d6f634

Igor Babaev authored Dec 17, 2020

Due to this bug the server reported bogus messages about lack of SELECT
privileges for base tables used in the specifications of CTE tables.
It happened only if such a CTE were referred to at least twice.
For any non-recursive reference to CTE that is not primary the
specification of the CTE is cloned. The function check_table_access() is
called for such reference. The function checks privileges of the tables
referenced in the specification. As no name resolution was performed for
CTE references whose definitions occurred outside the specification before
the call of check_table_access() that was supposed to check the access
rights of the underlying tables these references were considered
as references to base tables rather than references to CTEs. Yet for CTEs
as well as for derived tables no privileges are needed and thus cannot
be granted.
The patch ensures proper name resolution of all references to CTEs before
any acl checks.

Approved by Oleksandr Byelkin <sanja@mariadb.com>

25d6f634

MDEV-24327 wsrep XID checkpointing order with log_slave_updates=OFF · 2cb5fb60

sjaakola authored Dec 02, 2020

If log_slave_updates==OFF, wsrep applier threads used to be configured
with option: thd->variables.option_bits&= ~(OPTION_BIN_LOG);
(i.e. like sql_log_bin=ON). And this was regardless of log-bin configuration.

With this, having configuration of: --log-bin && --log-slave-updates=OFF,
local threads used binlogging, but applier threads did not. And further:
local threads went through binlog group commit, while applier threads did
direct commits. This resulted in situation, where applier threads entered
earlier in wsrep XID checkpointing, and could sync their wsrep XID out of order.
Later local thread commit would see that higher seqno was already checkpointed,
and fire an assert because of this.

As a fix, applier threads are now forced to enable binlogging regardless of
log-slave-updates configuration.

This PR comes with new mtr test: galera.MDEV-24327, which causes a scenario
where applier transaction is applied and committed while earlier local transaction
is parked before commit order monitor enter. A buggy mariadb versoin would fail
for assertion because of wsrep XID checkpoint order violation.
Reviewed-by: Jan Lindström <jan.lindstrom@mariadb.com>

2cb5fb60

16 Dec, 2020 2 commits

MDEV-23406 Signal 8 in maria_create after recursive cte query · a244be70

Igor Babaev authored Dec 16, 2020

This bug could cause a crash when executing queries that used mutually
recursive CTEs with system variable big_tables set to 1. It happened due
to several bugs in the code that handled recursive table references
referred mutually recursive CTEs. For each recursive table reference a
temporary table is created that contains all rows generated for the
corresponding recursive CTE table on the previous step of recursion.
This temporary table should be created in the same way as the temporary
table created for a regular materialized derived table using the
method select_union::create_result_table(). In this case when the
temporary table is created it uses the select_union::TMP_TABLE_PARAM
structure as the parameter for the table construction. However the
code created the temporary table using just the function create_tmp_table()
and passed pointers to certain fields of the TMP_TABLE_PARAM structure
used for accumulation of rows of the recursive CTE table as parameters
for update. This was a mistake because now different temporary tables
cannot share some TMP_TABLE_PARAM fields in a general case. Besides,
depending on how mutually recursive CTE tables were defined and which
of them were referred in the executed query the select_union object
allocated for a recursive table reference could be allocated again after
the the temporary table had been created. In this case the TMP_TABLE_PARAM
object associated with the temporary table created for the recursive
table reference contained unassigned fields needed for execution when
Aria engine is employed as the engine for temporary tables.
This patch ensures that
- select_union object is created only once for any recursive table
  reference
- any temporary table created for recursive CTEs uses its own
  TMP_TABLE_PARAM structure
The patch also fixes a problem caused by incomplete cleanup of join tables
associated with recursive table references.

Approved by Oleksandr Byelkin <sanja@mariadb.com>

a244be70

MDEV-22810 mariabackup does not honor open_files_limit from option during backup prepare · 719da2c4
Vlad Lesin authored Dec 15, 2020
```
open_files_limit option was processed only for --backup, but not for
--prepare.
```
719da2c4

15 Dec, 2020 5 commits

MDEV-21958: postfix - result of range_mrr_icp · aebb1112
Daniel Black authored Dec 16, 2020

aebb1112

MDEV-24172: innodb stats table last_update is TIMESTAMP · 2c4761cc

Daniel Black authored Nov 09, 2020

The last_updated column of innodb_table_stats and innodb_index_stats
hasn't been DATA_FIXBINARY for many years.

Innodb represents TIMESTAMP as INT of length 4. Let's test it with this
and stop hiding the result in mysql_upgrade test.

Reviewer: Marko

2c4761cc

MDEV-24414 Update and enable galera.galera_defaults · dc62a67e
Stepan Patryshev authored Dec 15, 2020

dc62a67e

MDEV-21958: Query having many NOT-IN clauses running forever · 066212d1

Sergei Petrunia authored Dec 15, 2020

Basic variant of the fix: do not consider conditions in form

  unique_key NOT IN (c1,c2...)

to be sargable. If there are only a few constants, the condition
is not selective. If there are a lot constants, the overhead of
processing such a huge range list is not worth it.

(Backport to 10.2)

066212d1

MDEV-24034 Policy CMP0075 is not set during compile · ac9c6f53

Vladislav Vaintroub authored Oct 27, 2020

The policy is not set for 10.2
If it is set, CMake would complain about bundled zlib for which the policy
is not set.

Fix:
- Set policy for 10.2 for the top level project.
For 10.3+ it was already set

- Cleanup zlib to remove unneeded stuff. It is an internal static library,
it needs none of PROJECT, library versioning, RC file on Windows.
The name of the library on Unix does not make any difference, since it is
static and compiled in.

ac9c6f53