1. Benchmark Comparison of Cloud Analytics Methods Applied to Earth Observations
- Author
-
Lynnes, Christopher, Little, Michael M, Huang, Thomas, Jacob, Joseph Charles, Yang, Chaowei Phil, Hegde, Mahabaleshwara, and Zhang, Hailiang
- Subjects
Computer Systems - Abstract
Earth Observation data are a vital resource for studying long term changes, but the large data volumes can be challenging to analyze. Time series analysis in particular is hampered by the typical thin-time-slice file organization. We examine several potential solutions inspired in large part by the data-parallel methods that have arisen with cloud computing. These solutions include various combinations of data re-organization, spatial indexing, distributed storage and pre-computation that we term "Analytics Optimized Data Stores" (AODS). We find that even simple solutions (such as a data cube) produce more than an order of magnitude improvement; the best provide two to three orders of magnitude improvement. The most performant solutions have tradeoffs in terms of generality or storage footprint, but may nonetheless be useful components in data analytics frameworks where performance is critical.
- Published
- 2018