Back to Search Start Over

CrowdChart: Crowdsourced Data Extraction From Visualization Charts.

Authors :
Chai, Chengliang
Li, Guoliang
Fan, Ju
Luo, Yuyu
Source :
IEEE Transactions on Knowledge & Data Engineering; Nov2021, Vol. 33 Issue 11, p3537-3549, 13p
Publication Year :
2021

Abstract

Visualization charts are widely utilized for presenting structured data. Under many circumstances, people want to digitalize the data in the charts collected from various sources (e.g., papers and websites), in oder to further analyze the data or create new charts. However, existing automatic and semi-automatic approaches are not always effective due to the variety of charts. In this paper, we introduce a crowdsourcing approach that leverages human ability to extract data from visualization charts. There are several challenges. The first is how to avoid tedious human interaction with charts and design effective crowdsourcing tasks. Second, it is challenging to evaluate worker’s quality for truth inference, because workers may not only provide inaccurate values but also misalign values to wrong data series. Third, to guarantee quality, one may assign a task to many workers, leading to a high crowdsourcing cost. To address these challenges, we design an effective crowdsourcing task scheme that splits a chart into simple micro-tasks. We introduce a novel worker quality model by considering worker’s accuracy and task difficulty. We also devise effective task assignment and early-termination mechanisms to save the cost. We evaluate our approach on real-world datasets on real crowdsourced platforms, and the results demonstrate the effectiveness of our method. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
10414347
Volume :
33
Issue :
11
Database :
Complementary Index
Journal :
IEEE Transactions on Knowledge & Data Engineering
Publication Type :
Academic Journal
Accession number :
153711847
Full Text :
https://doi.org/10.1109/TKDE.2020.2972543