Back to Search Start Over

Karst: Transactional Data Ingestion Without Blocking on a Scalable Architecture.

Authors :
Li, Zhifang
Peng, Beicheng
Huang, Qiuli
Weng, Chuliang
Source :
IEEE Transactions on Knowledge & Data Engineering. May2022, Vol. 34 Issue 5, p2241-2253. 13p.
Publication Year :
2022

Abstract

Although real-time analytics on the up-to-date dataset has become an emerging demand, many big data systems are still designed for offline analytics. Particularly, for critical applications like Fintech, transactional data ingestion ensures a timely, always-correct, and scalable dataset. To carry out append-only ingestion, existing OLTP/HTAP systems are based on strict transactions with imperfect scalability, while NoSQL-like systems support scalable but relaxed transactions. How to ensure essential transactional guarantees without harming scalability seems to be a non-trivial issue. This paper proposes Karst to bring transactional data ingestion for existing offline analytics. We notice that blocking two-phase commit (2PC) to resolve transactional data ingestion is a performance killer for the partitioned analytical systems. Karst introduces a scalable protocol called metadata-oriented commit (MOC) that converts each distributed transaction into multiple partial transactions to avoid 2PC. Moreover, to ingest massive data into plenty of partitions, Karst also employs lazy persistence, lightweight logging, and optimized data traffic. In experiments, Karst could achieve up to about 2x $\sim$ ∼ 10x performance over relevant systems and also shows remarkable scalability. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
10414347
Volume :
34
Issue :
5
Database :
Academic Search Index
Journal :
IEEE Transactions on Knowledge & Data Engineering
Publication Type :
Academic Journal
Accession number :
156273270
Full Text :
https://doi.org/10.1109/TKDE.2020.3011510