Back to Search Start Over

TiDB

Authors :
Yuxing Zhou
Dongxu Huang
Xuelian Wu
Qi Liu
Menglong Huang
Lingyu Song
Xiaoyu Ma
Liquan Pei
Jian Zhang
Ruoxi Sun
Fei Xu
Wan Wei
Xin Tang
Fang Zhuhe
Shuaipeng Yu
Lei Zhao
Liu Tang
Li Shen
Cui Qiu
Cong Liu
Jianjun Li
Nicholas Cameron
Source :
Proceedings of the VLDB Endowment. 13:3072-3084
Publication Year :
2020
Publisher :
Association for Computing Machinery (ACM), 2020.

Abstract

Hybrid Transactional and Analytical Processing (HTAP) databases require processing transactional and analytical queries in isolation to remove the interference between them. To achieve this, it is necessary to maintain different replicas of data specified for the two types of queries. However, it is challenging to provide a consistent view for distributed replicas within a storage system, where analytical requests can efficiently read consistent and fresh data from transactional workloads at scale and with high availability. To meet this challenge, we propose extending replicated state machine-based consensus algorithms to provide consistent replicas for HTAP workloads. Based on this novel idea, we present a Raft-based HTAP database: TiDB. In the database, we design a multi-Raft storage system which consists of a row store and a column store. The row store is built based on the Raft algorithm. It is scalable to materialize updates from transactional requests with high availability. In particular, it asynchronously replicates Raft logs to learners which transform row format to column format for tuples, forming a real-time updatable column store. This column store allows analytical queries to efficiently read fresh and consistent data with strong isolation from transactions on the row store. Based on this storage system, we build an SQL engine to process large-scale distributed transactions and expensive analytical queries. The SQL engine optimally accesses row-format and column-format replicas of data. We also include a powerful analysis engine, TiSpark, to help TiDB connect to the Hadoop ecosystem. Comprehensive experiments show that TiDB achieves isolated high performance under CH-benCHmark, a benchmark focusing on HTAP workloads.

Details

ISSN :
21508097
Volume :
13
Database :
OpenAIRE
Journal :
Proceedings of the VLDB Endowment
Accession number :
edsair.doi...........eee76310ce7a52fc895b55d728d6ad58