TiDB 6.5.0 Release Notes
Release date: December 29, 2022
TiDB version: 6.5.0
Quick access: Quick start | Production deployment | Installation packages
TiDB 6.5.0 is a Long-Term Support Release (LTS).
Compared with TiDB 6.4.0-DMR, TiDB 6.5.0 introduces the following key features and improvements:
- The index acceleration feature becomes generally available (GA), which improves the performance of adding indexes by about 10 times compared with v6.1.0.
- The TiDB global memory control becomes GA, and you can control the memory consumption threshold via
tidb_server_memory_limit
. - The high-performance and globally monotonic
AUTO_INCREMENT
column attribute becomes GA, which is compatible with MySQL. FLASHBACK CLUSTER TO TIMESTAMP
is now compatible with TiCDC and PITR and becomes GA.- Enhance TiDB optimizer by making the more accurate Cost Model version 2 generally available and supporting expressions connected by
AND
for INDEX MERGE. - Support pushing down the
JSON_EXTRACT()
function to TiFlash. - Support password management policies that meet password compliance auditing requirements.
- TiDB Lightning and Dumpling support importing and exporting compressed SQL and CSV files.
- TiDB Data Migration (DM) continuous data validation becomes GA.
- TiDB Backup & Restore supports snapshot checkpoint backup, improves the recovery performance of PITR by 50%, and reduces the RPO in common scenarios to as short as 5 minutes.
- Improve the TiCDC throughput of replicating data to Kafka from 4000 rows/s to 35000 rows/s, and reduce the replication latency to 2s.
- Provide row-level Time to live (TTL) to manage data lifecycle (experimental).
- TiCDC supports replicating changed logs to object storage such as Amazon S3, Azure Blob Storage, and NFS (experimental).
New features
SQL
The performance of TiDB adding indexes is improved by about 10 times (GA) #35983 @benjamin2037 @tangenta
TiDB v6.3.0 introduces the Add index acceleration as an experimental feature to improve the speed of backfilling when creating an index. In v6.5.0, this feature becomes GA and is enabled by default, and the performance on large tables is expected to be about 10 times faster than that in v6.1.0. The acceleration feature is suitable for scenarios where a single SQL statement adds an index serially. When multiple SQL statements add indexes in parallel, only one of the SQL statements will be accelerated.
Provide lightweight metadata lock to improve the DML success rate during DDL change (GA) #37275 @wjhuang2016
TiDB v6.3.0 introduces Metadata lock as an experimental feature. To avoid the
Information schema is changed
error caused by DML statements, TiDB coordinates the priority of DMLs and DDLs during table metadata change, and makes the ongoing DDLs wait for the DMLs with old metadata to commit. In v6.5.0, this feature becomes GA and is enabled by default. It is suitable for various types of DDLs change scenarios. When you upgrade your existing cluster from versions earlier than v6.5.0 to v6.5.0 or later, TiDB automatically enables metadata lock. To disable this feature, you can set the system variabletidb_enable_metadata_lock
toOFF
.For more information, see documentation.
Support restoring a cluster to a specific point in time by using
FLASHBACK CLUSTER TO TIMESTAMP
(GA) #37197 #13303 @Defined2014 @bb7133 @JmPotato @Connor1996 @HuSharp @CalvinNeoSince v6.4.0, TiDB has introduced the
FLASHBACK CLUSTER TO TIMESTAMP
statement as an experimental feature. You can use this statement to restore a cluster to a specific point in time within the Garbage Collection (GC) life time. In v6.5.0, this feature is now compatible with TiCDC and PITR and becomes GA. This feature helps you to easily undo DML misoperations, restore the original cluster in minutes, and roll back data at different time points to determine the exact time when data changes.For more information, see documentation.
Fully support non-transactional DML statements including
INSERT
,REPLACE
,UPDATE
, andDELETE
#33485 @ekexiumIn the scenarios of large data processing, a single SQL statement with a large transaction might have a negative impact on the cluster stability and performance. A non-transactional DML statement is a DML statement split into multiple SQL statements for internal execution. The split statements compromise transaction atomicity and isolation but greatly improve the cluster stability. TiDB has supported non-transactional
DELETE
statements since v6.1.0, and non-transactionalINSERT
,REPLACE
, andUPDATE
statements since v6.5.0.For more information, see Non-Transactional DML statements and
BATCH
syntax.Support time to live (TTL) (experimental) #39262 @lcwangchao
TTL provides row-level data lifetime management. In TiDB, a table with the TTL attribute automatically checks data lifetime and deletes expired data at the row level. TTL is designed to help you clean up unnecessary data periodically and in a timely manner without affecting the online read and write workloads.
For more information, see documentation.
Support saving TiFlash query results using the
INSERT INTO SELECT
statement (experimental) #37515 @gengliqiStarting from v6.5.0, TiDB supports pushing down the
SELECT
clause (analytical query) of theINSERT INTO SELECT
statement to TiFlash. In this way, you can easily save the TiFlash query result to a TiDB table specified byINSERT INTO
for further analysis, which takes effect as result caching (that is, result materialization). For example:INSERT INTO t2 SELECT Mod(x,y) FROM t1;During the experimental phase, this feature is disabled by default. To enable it, you can set the
tidb_enable_tiflash_read_for_write_stmt
system variable toON
. There are no special restrictions on the result table specified byINSERT INTO
for this feature, and you are free to add or not add a TiFlash replica to that result table. Typical usage scenarios of this feature include:- Run complex analytical queries using TiFlash
- Reuse TiFlash query results or deal with highly concurrent online requests
- Need a relatively small result set compared with the input data size, preferably smaller than 100 MiB.
For more information, see documentation.
Support binding history execution plans (experimental) #39199 @fzzf678
For a SQL statement, due to various factors during execution, the optimizer might occasionally choose a new execution plan instead of its previous optimal execution plan, and the SQL performance is impacted. In this case, if the optimal execution plan has not been cleared yet, it still exists in the SQL execution history.
In v6.5.0, TiDB supports binding historical execution plans by extending the binding object in the
CREATE [GLOBAL | SESSION] BINDING
statement. When the execution plan of a SQL statement changes, you can bind the original execution plan by specifyingplan_digest
in theCREATE [GLOBAL | SESSION] BINDING
statement to quickly recover SQL performance, as long as the original execution plan is still in the SQL execution history memory table (for example,statements_summary
). This feature can simplify the process of handling execution plan change issues and improve your maintenance efficiency.For more information, see documentation.
Security
Support the password complexity policy #38928 @CbcWestwolf
After this policy is enabled, when you set a password, TiDB checks the password length, whether uppercase and lowercase letters, numbers, and special characters in the password are sufficient, whether the password matches the dictionary, and whether the password matches the username. This ensures that you set a secure password.
TiDB provides the SQL function
VALIDATE_PASSWORD_STRENGTH()
to validate the password strength.For more information, see documentation.
Support the password expiration policy #38936 @CbcWestwolf
TiDB supports configuring the password expiration policy, including manual expiration, global-level automatic expiration, and account-level automatic expiration. After this policy is enabled, you must change your passwords periodically. This reduces the risk of password leakage due to long-term use and improves password security.
For more information, see documentation.
Support the password reuse policy #38937 @keeplearning20221
TiDB supports configuring the password reuse policy, including global-level password reuse policy and account-level password reuse policy. After this policy is enabled, you cannot use the passwords that you have used within a specified period or the most recent several passwords that you have used. This reduces the risk of password leakage due to repeated use of passwords and improves password security.
For more information, see documentation.
Support failed-login tracking and temporary account locking policy #38938 @lastincisor
After this policy is enabled, if you log in to TiDB with incorrect passwords multiple times consecutively, the account is temporarily locked. After the lock time ends, the account is automatically unlocked.
For more information, see documentation.
Observability
TiDB Dashboard can be deployed on Kubernetes as an independent Pod #1447 @SabaPing
TiDB v6.5.0 (and later) and TiDB Operator v1.4.0 (and later) support deploying TiDB Dashboard as an independent Pod on Kubernetes. Using TiDB Operator, you can access the IP address of this Pod to start TiDB Dashboard.
Independently deploying TiDB Dashboard provides the following benefits:
- The compute work of TiDB Dashboard does not pose pressure on PD nodes. This ensures more stable cluster operation.
- The user can still access TiDB Dashboard for diagnosis even if the PD node is unavailable.
- Accessing TiDB Dashboard on the internet does not involve the privileged interfaces of PD. Therefore, the security risk of the cluster is reduced.
For more information, see documentation.
Performance Overview dashboard adds TiFlash and CDC (Change Data Capture) panels #39230 @dbsid
Since v6.1.0, TiDB has introduced the Performance Overview dashboard in Grafana, which provides a system-level entry for overall performance diagnosis of TiDB, TiKV, and PD. In v6.5.0, the Performance Overview dashboard adds TiFlash and CDC panels. With these panels, starting from v6.5.0, you can use the Performance Overview dashboard to analyze the performance of all components in a TiDB cluster.
The TiFlash and CDC panels reorganize the TiFlash and TiCDC monitoring information, which can help you greatly improve the efficiency of analyzing and troubleshooting TiFlash and TiCDC performance issues.
- On the TiFlash panels, you can easily view the request types, latency analysis, and resource usage overview of your TiFlash cluster.
- On the CDC panels, you can easily view the health, replication latency, data flow, and downstream write latency of your TiCDC cluster.
For more information, see documentation.
Performance
INDEX MERGE supports expressions connected by
AND
#39333 @guo-shaoge @time-and-fate @hailanwhuBefore v6.5.0, TiDB only supported using index merge for the filter conditions connected by
OR
. Starting from v6.5.0, TiDB has supported using index merge for filter conditions connected byAND
in theWHERE
clause. In this way, the index merge of TiDB can now cover more general combinations of query filter conditions and is no longer limited to union (OR
) relationship. The current v6.5.0 version only supports index merge underOR
conditions as automatically selected by the optimizer. To enable index merge forAND
conditions, you need to use theUSE_INDEX_MERGE
hint.For more details about index merge, see v5.4.0 Release Notes and Explain Index Merge.
Support pushing down the following JSON functions to TiFlash #39458 @yibin87
->
->>
JSON_EXTRACT()
The JSON format provides a flexible way for application data modeling. Therefore, more and more applications are using the JSON format for data exchange and data storage. By pushing down JSON functions to TiFlash, you can improve the efficiency of analyzing data in the JSON type and use TiDB for more real-time analytics scenarios.
Support pushing down the following string functions to TiFlash #6115 @xzhangxian1008
regexp_like
regexp_instr
regexp_substr
Support the global optimizer hint to interfere with the execution plan generation in Views #37887 @Reminiscent
In some view access scenarios, you need to use optimizer hints to interfere with the execution plan of a query in the view to achieve the optimal performance. Starting from v6.5.0, TiDB supports adding global hints to query blocks in views, making the hints defined in the query effective in the view. This feature provides a way to inject hints into complex SQL statements that contain nested views, enhances the execution plan control, and stabilizes the performance of complex statements. To use global hints, you need to name the query blocks and specify hint references.
For more information, see documentation.
Support pushing down sorting operations of partitioned tables to TiKV #26166 @winoros
Although the partitioned table feature has been GA since v6.1.0, TiDB is continually improving its performance. In v6.5.0, TiDB supports pushing down sorting operations such as
ORDER BY
andLIMIT
to TiKV for computation and filtering, which reduces network I/O overhead and improves SQL performance when you use partitioned tables.Optimizer introduces a more accurate Cost Model Version 2 (GA) #35240 @qw4990
TiDB v6.2.0 introduces the Cost Model Version 2 as an experimental feature. This model uses a more accurate cost estimation method to help the optimizer choose the optimal execution plan. Especially when TiFlash is deployed, Cost Model Version 2 automatically helps choose the appropriate storage engine and avoids much manual intervention. After real-scene testing for a period of time, this model becomes GA in v6.5.0. Starting from v6.5.0, newly created clusters use Cost Model Version 2 by default. For clusters upgrade to v6.5.0, because Cost Model Version 2 might cause changes to query plans, you can set the
tidb_cost_model_version = 2
variable to use the new cost model after sufficient performance testing.Cost Model Version 2 becomes a generally available feature that significantly improves the overall capability of the TiDB optimizer and helps TiDB evolve towards a more powerful HTAP database.
For more information, see documentation.
TiFlash optimizes the operations of getting the number of table rows #37165 @elsa0520
In the scenarios of data analysis, It is a common operation to get the actual number of rows of a table through
COUNT(*)
without filter conditions. In v6.5.0, TiFlash optimizes the rewriting ofCOUNT(*)
and automatically selects the not-null columns with the shortest column definition to count the number of rows, which can effectively reduce the number of I/O operations in TiFlash and improve the execution efficiency of getting row counts.
Stability
The global memory control feature is now GA #37816 @wshwsh12
Since v6.4.0, TiDB has introduced global memory control as an experimental feature. In v6.5.0, it becomes GA and can track the main memory consumption. When the global memory consumption reaches the threshold defined by
tidb_server_memory_limit
, TiDB tries to limit the memory usage by GC or canceling SQL operations, to ensure stability.Note that the memory consumed by the transaction in a session (the maximum value was previously set by the configuration item
txn-total-size-limit
) is now tracked by the memory management module: when the memory consumption of a single session reaches the threshold defined by the system variabletidb_mem_quota_query
, the behavior defined by the system variabletidb_mem_oom_action
will be triggered (the default isCANCEL
, that is, canceling operations). To ensure forward compatibility, whentxn-total-size-limit
is configured as a non-default value, TiDB will still ensure that transactions can use the memory set bytxn-total-size-limit
.If you are using TiDB v6.5.0 or later, it is recommended to remove
txn-total-size-limit
and not to set a separate limit on the memory usage of transactions. Instead, use the system variablestidb_mem_quota_query
andtidb_server_memory_limit
to manage global memory, which can improve the efficiency of memory usage.For more information, see documentation.
Ease of use
Refine the execution information of the TiFlash
TableFullScan
operator in theEXPLAIN ANALYZE
output #5926 @hongyunyanThe
EXPLAIN ANALYZE
statement is used to print execution plans and runtime statistics. In v6.5.0, TiFlash has refined the execution information of theTableFullScan
operator by adding the DMFile-related execution information. Now the TiFlash data scan status information is presented more intuitively, which helps you analyze TiFlash performance more easily.For more information, see documentation.
Support the output of execution plans in the JSON format #39261 @fzzf678
In v6.5.0, TiDB extends the output format of execution plans. By specifying
FORMAT = "tidb_json"
in theEXPLAIN
statement, you can output SQL execution plans in the JSON format. With this capability, SQL debugging tools and diagnostic tools can read execution plans more conveniently and accurately, thus improving the ease of use of SQL diagnosis and tuning.For more information, see documentation.
MySQL compatibility
Support a high-performance and globally monotonic
AUTO_INCREMENT
column attribute (GA) #38442 @tiancaiamaoSince v6.4.0, TiDB has introduced the
AUTO_INCREMENT
MySQL compatibility mode as an experimental feature. This mode introduces a centralized auto-increment ID allocating service that ensures IDs monotonically increase on all TiDB instances. This feature makes it easier to sort query results by auto-increment IDs. In v6.5.0, this feature becomes GA. The insert TPS of a table using this feature is expected to exceed 20,000, and this feature supports elastic scaling to improve the write throughput of a single table and entire clusters. To use the MySQL compatibility mode, you need to setAUTO_ID_CACHE
to1
when creating a table. The following is an example:CREATE TABLE t(a int AUTO_INCREMENT key) AUTO_ID_CACHE 1;For more information, see documentation.
Data migration
Support exporting and importing SQL and CSV files in gzip, snappy, and zstd compression formats #38514 @lichunzhu
Dumpling supports exporting data to compressed SQL and CSV files in these compression formats: gzip, snappy, and zstd. TiDB Lightning also supports importing compressed files in these formats.
Previously, you had to provide large storage space for exporting or importing data to store CSV and SQL files, resulting in high storage costs. With the release of this feature, you can greatly reduce your storage costs by compressing the data files.
For more information, see documentation.
Optimize binlog parsing capability #924 @gmhdbjd
TiDB can filter out binlog events of the schemas and tables that are not in the migration task, thus improving the parsing efficiency and stability. This policy takes effect by default in v6.5.0. No additional configuration is required.
Previously, even if only a few tables were migrated, the entire binlog file upstream had to be parsed. The binlog events of the tables in the binlog file that did not need to be migrated still had to be parsed, which was not efficient. Meanwhile, if these binlog events do not support parsing, the task will fail. By only parsing the binlog events of the tables in the migration task, the binlog parsing efficiency can be greatly improved and the task stability can be enhanced.
Disk quota in TiDB Lightning is GA #446 @buchuitoudegou
You can configure disk quota for TiDB Lightning. When there is not enough disk quota, TiDB Lightning stops reading source data and writing temporary files. Instead, it writes the sorted key-values to TiKV first, and then continues the import process after TiDB Lightning deletes the local temporary files.
Previously, when TiDB Lightning imported data using physical mode, it would create a large number of temporary files on the local disk for encoding, sorting, and splitting the raw data. When your local disk ran out of space, TiDB Lightning would exit with an error due to failing to write to the file. With this feature, TiDB Lightning tasks can avoid overwriting the local disk.
For more information, see documentation.
Continuous data validation in DM is GA #4426 @D3Hunter
In the process of migrating incremental data from upstream to downstream databases, there is a small probability that data flow might cause errors or data loss. In scenarios where strong data consistency is required, such as credit and securities businesses, you can perform a full volume checksum on the data after migration to ensure data consistency. However, in some incremental replication scenarios, upstream and downstream writes are continuous and uninterrupted because the upstream and downstream data is constantly changing, making it difficult to perform consistency checks on all data.
Previously, you needed to interrupt the business to validate the full data, which would affect your business. Now, with this feature, you can perform incremental data validation without interrupting the business.
For more information, see documentation.
TiDB data share subscription
TiCDC supports replicating changed logs to storage sinks (experimental) #6797 @zhaoxinyu
TiCDC supports replicating changed logs to Amazon S3, Azure Blob Storage, NFS, and other S3-compatible storage services. Cloud storage is reasonably priced and easy to use. If you are not using Kafka, you can use storage sinks. TiCDC saves the changed logs to a file and then sends it to the storage system. From the storage system, the consumer program reads the newly generated changed log files periodically.
The storage sink supports changed logs in the canal-json and CSV formats. For more information, see documentation.
TiCDC supports bidirectional replication between two clusters #38587 @xiongjiwei @asddongmen
TiCDC supports bidirectional replication between two TiDB clusters. If you need to build geo-distributed and multiple active data centers for your application, you can use this feature as a solution. By configuring the
bdr-mode = true
parameter for the TiCDC changefeeds from one TiDB cluster to another TiDB cluster, you can achieve bidirectional data replication between the two TiDB clusters.For more information, see documentation.
TiCDC supports updating TLS online #7908 @CharlesCheung96
To keep security of the database system, you need to set an expiration policy for the certificate used by the system. After the expiration period, the system needs a new certificate. TiCDC v6.5.0 supports online updates of TLS certificates. Without interrupting the replication tasks, TiCDC can automatically detect and update the certificate, without the need for manual intervention.
TiCDC performance improves significantly #7540 #7478 #7532 @sdojjy @3AceShowHand
In a test scenario of the TiDB cluster, the performance of TiCDC has improved significantly. Specifically, in the scenario of replicating data to Kafka, the maximum row changes that a single TiCDC can process reaches 30K rows/s, and the replication latency is reduced to 10s. Even during TiKV and TiCDC rolling upgrade, the replication latency is less than 30s.
In a disaster recovery (DR) scenario, if TiCDC redo log and Syncpoint are enabled, the TiCDC throughput can be improved from 4000 rows/s to 35000 rows/s, and the replication latency can be limited to 2s.
Backup and restore
TiDB Backup & Restore supports snapshot checkpoint backup #38647 @Leavrth
TiDB snapshot backup supports resuming backup from a checkpoint. When Backup & Restore (BR) encounters a recoverable error, it retries backup. However, BR exits if the retry fails for several times. The checkpoint backup feature allows for longer recoverable failures to be retried, for example, a network failure of tens of minutes.
Note that if you do not recover the system from a failure within one hour after BR exits, the snapshot data to be backed up might be recycled by the GC mechanism, causing the backup to fail. For more information, see documentation.
PITR performance improved remarkably @joccau
In the log restore stage, the restore speed of one TiKV can reach 9 MiB/s, which is 50% faster than before. The restore speed is scalable and the RTO in DR scenarios is reduced greatly. The RPO in DR scenarios can be as short as 5 minutes. In normal cluster operation and maintenance (OM), for example, a rolling upgrade is performed or only one TiKV is down, the RPO can be 5 minutes.
TiKV-BR GA: Supports backing up and restoring RawKV #67 @pingyu @haojinming
TiKV-BR is a backup and restore tool used in TiKV clusters. TiKV and PD can constitute a KV database when used without TiDB, which is called RawKV. TiKV-BR supports data backup and restore for products that use RawKV. TiKV-BR can also upgrade the
api-version
fromAPI V1
toAPI V2
for TiKV cluster.For more information, see documentation.
Compatibility changes
System variables
Variable name | Change type | Description |
---|---|---|
tidb_enable_amend_pessimistic_txn | Deprecated | Starting from v6.5.0, this variable is deprecated, and TiDB uses the Metadata Lock feature by default to avoid the Information schema is changed error. |
tidb_enable_outer_join_reorder | Modified | Changes the default value from OFF to ON after further tests, meaning that the support of Outer Join for the Join Reorder algorithm is enabled by default. |
tidb_cost_model_version | Modified | Changes the default value from 1 to 2 after further tests, meaning that Cost Model Version 2 is used for index selection and operator selection by default. |
tidb_enable_gc_aware_memory_track | Modified | Changes the default value from ON to OFF . Because the GC-aware memory track is found inaccurate in tests and causes too large analyzed memory size tracked, the memory track is disabled. In addition, in Golang 1.19, the memory tracked by the GC-aware memory track does not have much impact on the overall memory. |
tidb_enable_metadata_lock | Modified | Changes the default value from OFF to ON after further tests, meaning that the metadata lock feature is enabled by default. |
tidb_enable_tiflash_read_for_write_stmt | Modified | Takes effect starting from v6.5.0. It controls whether read operations in SQL statements containing INSERT , DELETE , and UPDATE can be pushed down to TiFlash. The default value is OFF . |
tidb_ddl_enable_fast_reorg | Modified | Changes the default value from OFF to ON after further tests, meaning that the acceleration of ADD INDEX and CREATE INDEX is enabled by default. |
tidb_mem_quota_query | Modified | For versions earlier than TiDB v6.5.0, this variable is used to set the threshold value of memory quota for a query. For TiDB v6.5.0 and later versions, to control the memory of DML statements more accurately, this variable is used to set the threshold value of memory quota for a session. |
tidb_replica_read | Modified | Starting from v6.5.0, to optimize load balancing across TiDB nodes, when this variable is set toclosest-adaptive and the estimated result of a read request is greater than or equal to tidb_adaptive_closest_read_threshold , the number of TiDB nodes whose closest-adaptive configuration takes effect is limited in each availability zone, which is always the same as the number of TiDB nodes in the availability zone with the fewest TiDB nodes, and the other TiDB nodes automatically read from the leader replica. |
tidb_server_memory_limit | Modified | Changes the default value from 0 to 80% . As the TiDB global memory control becomes GA, this default value change enables the memory control by default and sets the memory limit for a TiDB instance to 80% of the total memory by default. |
default_password_lifetime | Newly added | Sets the global policy for automatic password expiration to require users to change passwords periodically. The default value 0 indicates that passwords never expire. |
disconnect_on_expired_password | Newly added | Indicates whether TiDB disconnects the client connection when the password is expired. This variable is read-only. |
password_history | Newly added | This variable is used to establish a password reuse policy that allows TiDB to limit password reuse based on the number of password changes. The default value 0 means disabling the password reuse policy based on the number of password changes. |
password_reuse_interval | Newly added | This variable is used to establish a password reuse policy that allows TiDB to limit password reuse based on time elapsed. The default value 0 means disabling the password reuse policy based on time elapsed. |
tidb_auto_build_stats_concurrency | Newly added | This variable is used to set the concurrency of executing the automatic update of statistics. The default value is 1 . |
tidb_cdc_write_source | Newly added | When this variable is set to a value other than 0, data written in this session is considered to be written by TiCDC. This variable can only be modified by TiCDC. Do not manually modify this variable in any case. |
tidb_index_merge_intersection_concurrency | Newly added | Sets the maximum concurrency for the intersection operations that index merge performs. It is effective only when TiDB accesses partitioned tables in the dynamic pruning mode. |
tidb_source_id | Newly added | This variable is used to configure the different cluster IDs in a bi-directional replication cluster. |
tidb_sysproc_scan_concurrency | Newly added | This variable is used to set the concurrency of scan operations performed when TiDB executes internal SQL statements (such as an automatic update of statistics). The default value is 1 . |
tidb_ttl_delete_batch_size | Newly added | This variable is used to set the maximum number of rows that can be deleted in a single DELETE transaction in a TTL job. |
tidb_ttl_delete_rate_limit | Newly added | This variable is used to limit the maximum number of DELETE statements allowed per second in a single node in a TTL job. When this variable is set to 0 , no limit is applied. |
tidb_ttl_delete_worker_count | Newly added | This variable is used to set the maximum concurrency of TTL jobs on each TiDB node. |
tidb_ttl_job_enable | Newly added | This variable is used to control whether to enable TTL jobs. If it is set to OFF , all tables with TTL attributes automatically stop cleaning up expired data. |
tidb_ttl_job_run_interval | Newly added | This variable is used to control the scheduling interval of the TTL job in the background. For example, if the current value is set to 1h0m0s , each table with TTL attributes will clean up expired data once every hour. |
tidb_ttl_job_schedule_window_start_time | Newly added | This variable is used to control the start time of the scheduling window of the TTL job in the background. When you modify the value of this variable, be cautious that a small window might cause the cleanup of expired data to fail. |
tidb_ttl_job_schedule_window_end_time | Newly added | This variable is used to control the end time of the scheduling window of the TTL job in the background. When you modify the value of this variable, be cautious that a small window might cause the cleanup of expired data to fail. |
tidb_ttl_scan_batch_size | Newly added | This variable is used to set the LIMIT value of each SELECT statement used to scan expired data in a TTL job. |
tidb_ttl_scan_worker_count | Newly added | This variable is used to set the maximum concurrency of TTL scan jobs on each TiDB node. |
validate_password.check_user_name | Newly added | A check item in the password complexity check. It checks whether the password matches the username. This variable takes effect only when validate_password.enable is enabled. The default value is ON . |
validate_password.dictionary | Newly added | A check item in the password complexity check. It checks whether the password matches any word in the dictionary. This variable takes effect only when validate_password.enable is enabled and validate_password.policy is set to 2 (STRONG). The default value is "" . |
validate_password.enable | Newly added | This variable controls whether to perform password complexity check. If this variable is set to ON , TiDB performs the password complexity check when you set a password. The default value is OFF . |
validate_password.length | Newly added | A check item in the password complexity check. It checks whether the password length is sufficient. By default, the minimum password length is 8 . This variable takes effect only when validate_password.enable is enabled. |
validate_password.mixed_case_count | Newly added | A check item in the password complexity check. It checks whether the password contains sufficient uppercase and lowercase letters. This variable takes effect only when validate_password.enable is enabled and validate_password.policy is set to 1 (MEDIUM) or larger. The default value is 1 . |
validate_password.number_count | Newly added | A check item in the password complexity check. It checks whether the password contains sufficient numbers. This variable takes effect only when validate_password.enable is enabled and validate_password.policy is set to 1 (MEDIUM) or larger. The default value is 1 . |
validate_password.policy | Newly added | This variable controls the policy for the password complexity check. The value can be 0 , 1 , or 2 (corresponds to LOW, MEDIUM, or STRONG). This variable takes effect only when validate_password.enable is enabled. The default value is 1 . |
validate_password.special_char_count | Newly added | A check item in the password complexity check. It checks whether the password contains sufficient special characters. This variable takes effect only when validate_password.enable is enabled and validate_password.policy is set to 1 (MEDIUM) or larger. The default value is 1 . |
Configuration file parameters
Configuration file | Configuration parameter | Change type | Description |
---|---|---|---|
TiDB | server-memory-quota | Deprecated | Starting from v6.5.0, this configuration item is deprecated. Instead, use the system variable tidb_server_memory_limit to manage memory globally. |
TiDB | disconnect-on-expired-password | Newly added | Determines whether TiDB disconnects the client connection when the password is expired. The default value is true , which means the client connection is disconnected when the password is expired. |
TiKV | raw-min-ts-outlier-threshold | Deleted | This configuration item was deprecated in v6.4.0 and deleted in v6.5.0. |
TiKV | cdc.min-ts-interval | Modified | To reduce CDC latency, the default value is changed from 1s to 200ms . |
TiKV | memory-use-ratio | Newly added | Indicates the ratio of available memory to total system memory in PITR log recovery. |
TiCDC | sink.terminator | Newly added | Indicates the row terminator, which is used for separating two data change events. The value is empty by default, which means \r\n is used. |
TiCDC | sink.date-separator | Newly added | Indicates the date separator type of the file directory. Value options are none , year , month , and day . none is the default value and means that the date is not separated. |
TiCDC | sink.enable-partition-separator | Newly added | Specifies whether to use partitions as the separation string. The default value is false , which means that partitions in a table are not stored in separate directories. |
TiCDC | sink.csv.delimiter | Newly added | Indicates the delimiter between fields. The value must be an ASCII character and defaults to , . |
TiCDC | sink.csv.quote | Newly added | The quotation that surrounds the fields. The default value is " . When the value is empty, no quotation is used. |
TiCDC | sink.csv.null | Newly added | Specifies the character displayed when the CSV column is null. The default value is \N . |
TiCDC | sink.csv.include-commit-ts | Newly added | Specifies whether to include commit-ts in CSV rows. The default value is false . |
Others
- Starting from v6.5.0, the
mysql.user
table adds two new columns:Password_reuse_history
andPassword_reuse_time
. - Starting from v6.5.0, the index acceleration feature is enabled by default. This feature is not fully compatible with altering multiple columns or indexes in a single
ALTER TABLE
statement. When adding a unique index with the index acceleration, you need to avoid altering other columns or indexes in the same statement. This feature is incompatible with PITR (Point-in-time recovery) either. When using the index acceleration feature, you need to make sure that no PITR backup task is running in the background; otherwise, unexpected results might occur. For more information, see documentation.
Deprecated feature
Starting from v6.5.0, the AMEND TRANSACTION
mechanism introduced in v4.0.7 is deprecated and replaced by Metadata Lock.
Improvements
TiDB
- For
BIT
andCHAR
columns, make the result ofINFORMATION_SCHEMA.COLUMNS
consistent with MySQL #25472 @hawkingrei - Optimize the TiDB probing mechanism for TiFlash nodes in the TiFlash MPP mode to mitigate the performance impact when nodes are abnormal #39686 @hackersean
- For
TiKV
- Stop writing to Raft Engine when there is insufficient space to avoid exhausting disk space #13642 @jiayang-zheng
- Support pushing down the
json_valid
function to TiKV #13571 @lizhenhuan - Support backing up multiple ranges of data in a single backup request #13701 @Leavrth
- Support backing up data to the Asia Pacific region (ap-southeast-3) of AWS by updating the rusoto library #13751 @3pointer
- Reduce pessimistic transaction conflicts #13298 @MyonKeminta
- Improve recovery performance by caching external storage objects #13798 @YuJuncen
- Run the CheckLeader in a dedicated thread to reduce TiCDC replication latency #13774 @overvenus
- Support pull model for Checkpoints #13824 @YuJuncen
- Avoid spinning issues on the sender side by updating crossbeam-channel #13815 @sticnarf
- Support batch Coprocessor tasks processing in TiKV #13849 @cfzjywxk
- Reduce waiting time on failure recovery by notifying TiKV to wake up Regions #13648 @LykxSassinator
- Reduce the requested size of memory usage by code optimization #13827 @BusyJay
- Introduce the Raft extension to improve code extensibility #13827 @BusyJay
- Support using tikv-ctl to query which Regions are included in a certain key range #13760 @HuSharp
- Improve read and write performance for rows that are not updated but locked continuously #13694 @sticnarf
PD
- Optimize the granularity of locks to reduce lock contention and improve the handling capability of heartbeats under high concurrency #5586 @rleungx
- Optimize scheduler performance for large-scale clusters and accelerate the production of scheduling policies #5473 @bufferflies
- Improve the speed of loading Regions #5606 @rleungx
- Reduce unnecessary overhead by optimized handling of Region heartbeats #5648 @rleungx
- Add the feature of automatically garbage collecting tombstone stores #5348 @nolouch
TiFlash
- Improve write performance in scenarios where there is no batch processing on the SQL side #6404 @lidezhu
- Add more details for TableFullScan in the
explain analyze
output #5926 @hongyunyan
Tools
TiDB Dashboard
Backup & Restore (BR)
TiCDC
- Improve the performance of Kafka protocol encoder #7540 #7532 #7543 @3AceShowHand @sdojjy
TiDB Data Migration (DM)
- Improve the data replication performance for DM by not parsing the data of tables in the block list #7622 @GMHDBJD
- Improve the write efficiency of DM relay by using asynchronous write and batch write #7580 @GMHDBJD
- Optimize the error messages in DM precheck #7621 @buchuitoudegou
- Improve the compatibility of
SHOW SLAVE HOSTS
for old MySQL versions #5017 @lyzx2001
Bug fixes
TiDB
- Fix the issue of memory chunk misuse for the chunk reuse feature that occurs in some cases #38917 @keeplearning20221
- Fix the issue that the internal sessions of
tidb_constraint_check_in_place_pessimistic
might be affected by the global setting #38766 @ekexium - Fix the issue that the
AUTO_INCREMENT
column cannot work with theCHECK
constraint #38894 @YangKeao - Fix the issue that using
INSERT IGNORE INTO
to insert data of theSTRING
type into an auto-increment column of theSMALLINT
type will cause an error #38483 @hawkingrei - Fix the issue that the null pointer error occurs in the operation of renaming the partition column of a partitioned table #38932 @mjonss
- Fix the issue that modifying the partition column of a partitioned table causes DDL to hang #38530 @mjonss
- Fix the issue that the
ADMIN SHOW JOB
operation panics after upgrading from v4.0.16 to v6.4.0 #38980 @tangenta - Fix the issue that the
tidb_decode_key
function fails to correctly parse the encoding of partitioned tables #39304 @Defined2014 - Fix the issue that gRPC error logs are not redirected to the correct log file during log rotation #38941 @xhebox
- Fix the issue that TiDB generates an unexpected execution plan for the
BEGIN; SELECT... FOR UPDATE;
point query when TiKV is not configured as a read engine #39344 @Yisaer - Fix the issue that mistakenly pushing down
StreamAgg
to TiFlash causes wrong results #39266 @fixdb
TiKV
- Fix an error in Raft Engine ctl #11119 @tabokie
- Fix the
Get raft db is not allowed
error when executing thecompact raft
command in tikv-ctl #13515 @guoxiangCN - Fix the issue that log backup does not work when TLS is enabled #13867 @YuJuncen
- Fix the support issue of the Geometry field type #13651 @dveeden
- Fix the issue that
_
in theLIKE
operator cannot match non-ASCII characters when new collation is not enabled #13769 @YangKeao - Fix the issue that tikv-ctl is terminated unexpectedly when executing the
reset-to-version
command #13829 @tabokie
PD
TiFlash
- Fix the issue that column files in the delta layer cannot be compacted after restarting TiFlash #6159 @lidezhu
- Fix the issue that TiFlash File Open OPS is too high #6345 @JaySon-Huang
Tools
Backup & Restore (BR)
- Fix the issue that when BR deletes log backup data, it mistakenly deletes data that should not be deleted #38939 @Leavrth
- Fix the issue that restore tasks fail when using old framework for collations in databases or tables #39150 @MoCuishle28
- Fix the issue that backup fails because Alibaba Cloud and Huawei Cloud are not fully compatible with Amazon S3 storage #39545 @3pointer
TiCDC
- Fix the issue that TiCDC gets stuck when the PD leader crashes #7470 @zeminzhou
- Fix data loss occurred in the scenario of executing DDL statements first and then pausing and resuming the changefeed #7682 @asddongmen
- Fix the issue that TiCDC mistakenly reports an error when there is a later version of TiFlash #7744 @overvenus
- Fix the issue that the sink component gets stuck if the downstream network is unavailable #7706 @hicqu
- Fix the issue that data is lost when a user quickly deletes a replication task and then creates another one with the same task name #7657 @overvenus
TiDB Data Migration (DM)
- Fix the issue that a
task-mode:all
task cannot be started when the upstream database enables the GTID mode but does not have any data #7037 @liumengya94 - Fix the issue that data is replicated for multiple times when a new DM worker is scheduled before the existing worker exits #7658 @GMHDBJD
- Fix the issue that DM precheck is not passed when the upstream database uses regular expressions to grant privileges #7645 @lance6716
- Fix the issue that a
TiDB Lightning
Contributors
We would like to thank the following contributors from the TiDB community:
- e1ijah1
- guoxiangCN (First-time contributor)
- jiayang-zheng
- jiyfhust
- mikechengwei
- pingandb
- sashashura
- sourcelliu
- wxbty