1. 27 Mar, 2020 10 commits
    • Alexander Barkov's avatar
    • Marko Mäkelä's avatar
      MDEV-22060 MSAN use-of-uninitialized-value in main.query_cache_innodb · d3bdc30c
      Marko Mäkelä authored
      During the test main.query_cache_innodb, only 16 bytes of
      db_buf are initialized during the memcmp() in
      dict_acquire_mdl_shared<false>(), but db_len was wrongly set to 20 bytes.
      
      Something similar was fixed in MDEV-21344, but only for the table name,
      in commit 0e25a8b4.
      
      dict_table_t::parse_name(): Assign the return value of
      filename_to_tablename() to the output parameters for lengths.
      There is no need to invoke strlen().
      d3bdc30c
    • Andrew Hutchings's avatar
      MDEV-20329 Fix S3 engine OpenSSL race · 0181384a
      Andrew Hutchings authored
      With OpenSSL < 1.1 there is a potential for a race condition to occur.
      This can cause the S3 engine to crash. The workaround is to add locking
      callbacks to OpenSSL so that this doesn't happen.
      
      https://curl.haxx.se/libcurl/c/threadsafe.html
      
      There is a fix in libMariaS3 for this which when a certain flag is set
      (HAVE_CURL_OPENSSL_UNSAFE) will add the required locks.
      
      This patch adds CMake support so that the flag is set if it is found
      that Curl is compiled with an unsafe OpenSSL version. For example Ubuntu
      16.04 with libcurl4-openssl-dev.
      0181384a
    • Monty's avatar
      Fixed failing tests in buildbot · ff64152b
      Monty authored
      - Updated icp_tests.inc and result files
      - Updates usage of records_in_range() protype in cassandra
      ff64152b
    • Marko Mäkelä's avatar
      Merge 10.4 into 10.5 · 53aabda6
      Marko Mäkelä authored
      53aabda6
    • Marko Mäkelä's avatar
      MDEV-21899 INSERT into a secondary index with zero-data-length key is not crash-safe · f614b6ea
      Marko Mäkelä authored
      page_cur_insert_rec_low(): Remove a bogus condition that wrongly
      omitted redo logging when the record contains no data payload bytes.
      We can have such records in secondary indexes, when the values of
      the PRIMARY KEY column(s) are the empty string, and the values of
      secondary key column(s) are are NULL or the empty string.
      
      page_apply_delete_dynamic(): Improve the consistency check, and
      do not allow adjacent records to be less than 5 bytes apart from
      each other. The fixed-size part of the record header is 5 bytes.
      Usually there must also be some header or payload bytes, but in
      an extreme case where all columns are CHAR(0) NOT NULL, the
      minimum secondary index record size is 5 bytes, and the table can
      contain at most 1 row. The minimum clustered index record size is
      5+6+7 bytes (header, DB_TRX_ID, DB_ROLL_PTR) or x+5+4 bytes
      (fixed-size header, child page number, and some additional header
      or payload bytes).
      f614b6ea
    • Monty's avatar
      Updated optimizer costs in multi_range_read_info_const() and sql_select.cc · eb483c51
      Monty authored
      - multi_range_read_info_const now uses the new records_in_range interface
      - Added handler::avg_io_cost()
      - Don't calculate avg_io_cost() in get_sweep_read_cost if avg_io_cost is
        not 1.0.  In this case we trust the avg_io_cost() from the handler.
      - Changed test_quick_select to use TIME_FOR_COMPARE instead of
        TIME_FOR_COMPARE_IDX to align this with the rest of the code.
      - Fixed bug when using test_if_cheaper_ordering where we didn't use
        keyread if index was changed
      - Fixed a bug where we didn't use index only read when using order-by-index
      - Added keyread_time() to HEAP.
        The default keyread_time() was optimized for blocks and not suitable for
        HEAP. The effect was the HEAP prefered table scans over ranges for btree
        indexes.
      - Fixed get_sweep_read_cost() for HEAP tables
      - Ensure that range and ref have same cost for simple ranges
        Added a small cost (MULTI_RANGE_READ_SETUP_COST) to ranges to ensure
        we favior ref for range for simple queries.
      - Fixed that matching_candidates_in_table() uses same number of records
        as the rest of the optimizer
      - Added avg_io_cost() to JT_EQ_REF cost. This helps calculate the cost for
        HEAP and temporary tables better. A few tests changed because of this.
      - heap::read_time() and heap::keyread_time() adjusted to not add +1.
        This was to ensure that handler::keyread_time() doesn't give
        higher cost for heap tables than for normal tables. One effect of
        this is that heap and derived tables stored in heap will prefer
        key access as this is now regarded as cheap.
      - Changed cost for index read in sql_select.cc to match
        multi_range_read_info_const(). All index cost calculation is now
        done trough one function.
      - 'ref' will now use quick_cost for keys if it exists. This is done
        so that for '=' ranges, 'ref' is prefered over 'range'.
      - scan_time() now takes avg_io_costs() into account
      - get_delayed_table_estimates() uses block_size and avg_io_cost()
      - Removed default argument to test_if_order_by_key(); simplifies code
      eb483c51
    • Monty's avatar
      Removed double calls to records_in_range from distinct and group by · b3ab3105
      Monty authored
      Fixed by moving testing of get_best_group_min_max() after range testing.
      b3ab3105
    • Monty's avatar
      Added page_range to records_in_range() to improve range statistics · f36ca142
      Monty authored
      Prototype change:
      -  virtual ha_rows records_in_range(uint inx, key_range *min_key,
      -                                   key_range *max_key)
      +  virtual ha_rows records_in_range(uint inx, const key_range *min_key,
      +                                   const key_range *max_key,
      +                                   page_range *res)
      
      The handler can ignore the page_range parameter. In the case the handler
      updates the parameter, the optimizer can deduce the following:
      - If previous range's last key is on the same block as next range's first
        key
      - If the current key range is in one block
      - We can also assume that the first and last block read are cached!
        This can be used for a better calculation of IO seeks when we
        estimate the cost of a range index scan.
      
      The parameter is fully implemented for MyISAM, Aria and InnoDB.
      A separate patch will update handler::multi_range_read_info_const() to
      take the benefits of this change and also remove the double
      records_in_range() calls that are not anymore needed.
      f36ca142
    • Monty's avatar
  2. 26 Mar, 2020 6 commits
  3. 25 Mar, 2020 12 commits
  4. 24 Mar, 2020 12 commits
    • Alexander Barkov's avatar
      MDEV-22030 Don't grant REPLICATION MASTER ADMIN automatically on upgrade from... · 19e998d2
      Alexander Barkov authored
      MDEV-22030 Don't grant REPLICATION MASTER ADMIN automatically on upgrade from an older JSON user table
      19e998d2
    • Alexander Barkov's avatar
      An additional test for MDEV-21743 Split up SUPER privilege to smaller privileges · 30cacf3f
      Alexander Barkov authored
      Adding a test to check that having a user with REPLICATION SLAVE
      privilege is enough to run replication.
      
      Made by Serg.
      30cacf3f
    • Monty's avatar
    • Monty's avatar
      3faa15cf
    • Monty's avatar
      Added FLUSH THREADS · 6736f152
      Monty authored
      6736f152
    • Monty's avatar
      Replace handler::primary_key_is_clustered() with handler::pk_is_clustering_key() · 37393bea
      Monty authored
      This was done to both simplify the code and also to be easier to handle
      storage engines that are clustered on some other index than the primary
      key.
      
      As pk_is_clustering_key() and is_clustering_key now are using only
      index_flags, these where removed from all storage engines.
      37393bea
    • Monty's avatar
      mysqld --help will now load mysqld.options table · 8eba777c
      Monty authored
      Changes:
      - Initalize Aria early to allow it to load mysql.plugin table with --help
      - Don't print 'aborting' when doing --help
      - Don't write 'loose' error messages on log_warning < 2 (2 is default)
      - Don't write warnings about disabled plugings when doing --help
      - Don't write aria_log_control or aria log files when doing --help
      - When using --help, open all Aria tables in read only mode (safety)
      - If aria_init() fails, do a cleanup(). (Frees used memory)
      - If aria_log_control is locked with --help, then don't wait 30 seconds
        but instead return at once without initialzing Aria plugin.
      8eba777c
    • Michael Widenius's avatar
    • Monty's avatar
      Fixed typo in aria_s3_copy · 44790f58
      Monty authored
      44790f58
    • Monty's avatar
      3775e7cc
    • Monty's avatar
      Speed up writing to encrypted binlogs · 120b73a0
      Monty authored
      MDEV-21604
      
      Added "virtual" low level write function encrypt_or_write that is set
      to point to either normal or encrypted write functions.
      
      This patch also fixes a possible memory leak if writing to binary log fails.
      120b73a0
    • Monty's avatar
      Clean up and speed up interfaces for binary row logging · 91ab42a8
      Monty authored
      MDEV-21605 Clean up and speed up interfaces for binary row logging
      MDEV-21617 Bug fix for previous version of this code
      
      The intention is to have as few 'if' as possible in ha_write() and
      related functions. This is done by pre-calculating once per statement the
      row_logging state for all tables.
      
      Benefits are simpler and faster code both when binary logging is disabled
      and when it's enabled.
      
      Changes:
      - Added handler->row_logging to make it easy to check it table should be
        row logged. This also made it easier to disabling row logging for system,
        internal and temporary tables.
      - The tables row_logging capabilities are checked once per "statements
        that updates tables" in THD::binlog_prepare_for_row_logging() which
        is called when needed from THD::decide_logging_format().
      - Removed most usage of tmp_disable_binlog(), reenable_binlog() and
        temporary saving and setting of thd->variables.option_bits.
      - Moved checks that can't change during a statement from
        check_table_binlog_row_based() to check_table_binlog_row_based_internal()
      - Removed flag row_already_logged (used by sequence engine)
      - Moved binlog_log_row() to a handler::
      - Moved write_locked_table_maps() to THD::binlog_write_table_maps() as
        most other related binlog functions are in THD.
      - Removed binlog_write_table_map() and binlog_log_row_internal() as
        they are now obsolete as 'has_transactions()' is pre-calculated in
        prepare_for_row_logging().
      - Remove 'is_transactional' argument from binlog_write_table_map() as this
        can now be read from handler.
      - Changed order of 'if's in handler::external_lock() and wsrep_mysqld.h
        to first evaluate fast and likely cases before more complex ones.
      - Added error checking in ha_write_row() and related functions if
        binlog_log_row() failed.
      - Don't clear check_table_binlog_row_based_result in
        clear_cached_table_binlog_row_based_flag() as it's not needed.
      - THD::clear_binlog_table_maps() has been replaced with
        THD::reset_binlog_for_next_statement()
      - Added 'MYSQL_OPEN_IGNORE_LOGGING_FORMAT' flag to open_and_lock_tables()
        to avoid calculating of binary log format for internal opens. This flag
        is also used to avoid reading statistics tables for internal tables.
      - Added OPTION_BINLOG_LOG_OFF as a simple way to turn of binlog temporary
        for create (instead of using THD::sql_log_bin_off.
      - Removed flag THD::sql_log_bin_off (not needed anymore)
      - Speed up THD::decide_logging_format() by remembering if blackhole engine
        is used and avoid a loop over all tables if it's not used
        (the common case).
      - THD::decide_logging_format() is not called anymore if no tables are used
        for the statement. This will speed up pure stored procedure code with
        about 5%+ according to some simple tests.
      - We now get annotated events on slave if a CREATE ... SELECT statement
        is transformed on the slave from statement to row logging.
      - In the original code, the master could come into a state where row
        logging is enforced for all future events if statement could be used.
        This is now partly fixed.
      
      Other changes:
      - Ensure that all tables used by a statement has query_id set.
      - Had to restore the row_logging flag for not used tables in
        THD::binlog_write_table_maps (not normal scenario)
      - Removed injector::transaction::use_table(server_id_type sid, table tbl)
        as it's not used.
      - Cleaned up set_slave_thread_options()
      - Some more DBUG_ENTER/DBUG_RETURN, code comments and minor indentation
        changes.
      - Ensure we only call THD::decide_logging_format_low() once in
        mysql_insert() (inefficiency).
      - Don't annotate INSERT DELAYED
      - Removed zeroing pos_in_table_list in THD::open_temporary_table() as it's
        already 0
      91ab42a8