
TDSQL: Tencent Distributed Database System
Yuxing Chen
Anqun Pan*
Hailin Lei
Tencent Inc.
{axingguchen,aaronpan,
harlylei}@tencent.com
Anda Ye
Shuo Han
Yan Tang
Tencent Inc.
{andaye,shuohan,
allenytang}@tencent.com
Wei Lu
Yunpeng Chai
Renmin University of China
lu-wei@ruc.edu.cn
ypchai@ruc.edu.cn
Feng Zhang
Xiaoyong Du
Renmin University of China
fengzhang@ruc.edu.cn
duyong@ruc.edu.cn
ABSTRACT
Distributed databases have become indispensable in contemporary
computing and data processing, owing to their pivotal role in en-
suring high availability and scalability. They eectively cater to the
requirements of data management and high-concurrency access.
However, developing a distributed database system that is well-
suited for diverse application scenarios, particularly for large-scale
applications, presents several challenges. These challenges include
ensuring data consistency and achieving high levels of performance.
This paper presents TDSQL, a distributed database system that
prioritizes core design principles of distributed systems, including
high availability, strong consistency, and scalability. In particular,
TDSQL has achieved high performance through over a decade of
practical experience and optimization in various modules, such
as the kernel, synchronous replication, and transaction process-
ing, in large-scale application scenarios. By conducting the TPC-C
benchmark test, TDSQL demonstrated outstanding performance,
achieving a throughput of 814 million tpmC across 1650 database
nodes, with a jitter rate of less than 0.2%. This jitter rate is an order
of magnitude lower than the standard required, showcasing the
system’s stability and reliability. During the 8-hour TPC-C standard
stress test, TDSQL successfully completed over 860 billion trans-
actions and processed 40 trillion order details, with zero forced
rollbacks and zero data inconsistency.
PVLDB Reference Format:
Yuxing Chen, Anqun Pan, Hailin Lei, Anda Ye, Shuo Han, Yan Tang, Wei
Lu, Yunpeng Chai, Feng Zhang, and Xiaoyong Du. TDSQL: Tencent
Distributed Database System. PVLDB, 17(12): 3869 - 3882, 2024.
doi:10.14778/3685800.3685812
1 INTRODUCTION
In recent years, the exponential growth in data volume and com-
plexity has led to performance challenges for traditional centralized
databases. Distributed database systems have emerged as a promis-
ing solution, oering high scalability [
12
,
15
,
26
,
55
], availability
[
9
,
60
,
61
], and performance [
18
,
32
,
43
,
50
,
71
,
80
]. However, in
distributed scenarios, new challenges arise, such as the trade-o
between performance [
46
,
53
] and consistency [
8
,
17
,
54
,
64
,
70
].
This work is licensed under the Creative Commons BY-NC-ND 4.0 International
License. Visit https://creativecommons.org/licenses/by-nc-nd/4.0/ to view a copy of
this license. For any use beyond those covered by this license, obtain permission by
emailing info@vldb.org. Copyright is held by the owner/author(s). Publication rights
licensed to the VLDB Endowment.
Proceedings of the VLDB Endowment, Vol. 17, No. 12 ISSN 2150-8097.
doi:10.14778/3685800.3685812
*Anqun Pan is the corresponding author.
T
encent
D
istributed
SQL
(TDSQL), developed by Tencent Cloud
[
13
], is a database system specically designed to address the per-
formance requirements of large-scale applications, e.g., e-commerce
and banking scenarios, while also ensuring consistency, including
strong synchronization [22, 44, 53, 59].
TDSQL is specically designed to deliver high-performance and
reliable databases for enterprises of all sizes. TDSQL has undergone
iterative developments and meticulous optimizations to enhance
its distributed functionalities, particularly focusing on improving
distributed transaction processing capabilities. A noteworthy char-
acteristic of TDSQL is its share-nothing architecture, which facil-
itates horizontal scaling across multiple nodes. This architecture
empowers TDSQL to eectively handle substantial data volumes
and manage high concurrency, with performance scaling that ap-
proaches linearity. Our TPC-C benchmark test has demonstrated
the capability of TDSQL in eciently processing data exceeding
the 10 PB threshold while maintaining scalability on a single cluster
equipped with over 100,000 physical cores. As an increasing num-
ber of nancial industry enterprises, such as banks and securities
rms, adopt TDSQL, it also oers a range of advanced features to
ensure high availability. These features encompass auto-failover,
data replication, primary-secondary switching, and recovery [79].
TDSQL was ocially launched on Tencent Cloud [
13
] and has
gained widespread adoption across 30,000 enterprises in various in-
dustries, including e-commerce, nance, government, and telecom-
munications. As a result, it has emerged as the market leader in
China’s distributed relational database market [
30
]. Signicantly,
TDSQL holds the distinction of being the rst domestically de-
veloped database in China to be utilized in both internet-based
distributed banking core systems and traditional banking core sys-
tems. It has also played a pioneering role in assisting domestic
banks with migrating their core systems from centralized to dis-
tributed architectures. Currently, 7 out of the top 10 banks in China
have already adopted TDSQL for services such as deposits, loans,
payments, general ledger, and common operations.
This paper shares our experiences in designing, developing, and
optimizing the TDSQL, a large-scale distributed database system.
We conducted the ocial TPC-C benchmark test [
58
] on TDSQL.
The results were impressive compared to open reports [
57
], as TD-
SQL achieved a remarkable performance of 814 million tpmC across
1650 database nodes (surpassing the second place by 15% overall
and 8% per node) with a jitter rate less than 0.2%, which is an order
of magnitude lower than the standard required. Throughout the
8-hour stress test, TDSQL demonstrated exceptional performance
by handling a staggering volume of over 860 billion transactions
and processing an astonishing 40 trillion order details, all without
3869
文档被以下合辑收录
评论