12 results on '"BINGSHENG HE"'
Search Results
2. GraphTune: An Efficient Dependency-Aware Substrate to Alleviate Irregularity in Concurrent Graph Processing.
- Author
-
JIN ZHAO, YU ZHANG, LIGANG HE, QIKUN LI, XIANG ZHANG, XINYU JIANG, HUI YU, XIAOFEI LIAO, HAI JIN, LIN GU, HAIKUN LIU, BINGSHENG HE, JI ZHANG, XIANZHENG SONG, LIN WANG, and JUN ZHOU
- Published
- 2023
- Full Text
- View/download PDF
3. The OARF Benchmark Suite: Characterization and Implications for Federated Learning Systems.
- Author
-
SIXU HU, YUAN LI, XU LIU, QINBIN LI, ZHAOMIN WU, and BINGSHENG HE
- Subjects
INSTRUCTIONAL systems ,MACHINE learning - Abstract
This article presents and characterizes an Open Application Repository for Federated Learning (OARF), a benchmark suite for federated machine learning systems. Previously available benchmarks for federated learning (FL) have focused mainly on synthetic datasets and use a limited number of applications. OARF mimics more realistic application scenarios with publicly available datasets as different data silos in image, text, and structured data. Our characterization shows that the benchmark suite is diverse in data size, distribution, feature distribution, and learning task complexity. The extensive evaluations with reference implementations show the future research opportunities for important aspects of FL systems. We have developed reference implementations, and evaluated the important aspects of FL, including model accuracy, communication cost, throughput, and convergence time. Through these evaluations, we discovered some interesting findings such as FL can effectively increase end-to-end throughput. The code of OARF is publicly available on GitHub.
1 [ABSTRACT FROM AUTHOR]- Published
- 2022
- Full Text
- View/download PDF
4. ByteSeries:An In-Memory Time Series Database for Large-Scale Monitoring Systems
- Author
-
Yan Jiang, Xuanhua Shi, Hai Jin, Xin Li, Yongluan Zhou, Zezhao Feng, Bingsheng He, Kaixi Li, and Zhijun Ling
- Subjects
Computer science ,Real-time computing ,02 engineering and technology ,Inverted index ,Metadata ,020204 information systems ,Metadata management ,0202 electrical engineering, electronic engineering, information engineering ,Memory footprint ,Overhead (computing) ,020201 artificial intelligence & image processing ,Time series ,Throughput (business) ,Time series database - Abstract
Monitoring large-scale and complex systems often generates high-dimensional and highly dynamic time series data. In such a scenario, massive metadata has to be maintained to support efficient querying, whose large footprint poses great challenges to in-memory databases. In this paper, we present ByteSeries, an in-memory time series database that is designed specifically for large-scale monitoring systems to manage high-dimensional time series. We start with an analysis of the production data and workload at ByteDance's metric monitoring system, which contains over 10 billion time series dimensions. The observation of high overhead of metadata management in high-dimensional time series data calls for a rethink of time series database systems. Byte-Series's memory structure employs the novel Compressed Inverted Index to effectively compress metadata while maintaining high efficiency for multi-dimensional queries. In addition, an algorithm is proposed to effectively convert data into compressed form without sacrificing the data ingestion throughput. We experimentally evaluate ByteSeries by comparing it with ByteDance's original production system, tsdc, as well as two open-source systems, namely Gorilla and Prometheus. We show that ByteSeries significantly improves over ByteDance's original production system by 1) reducing the memory footprint of metadata by 60% and the whole memory consumption by 50%, and 2) speeding up multi-dimensional queries by 1.8x-10.7x.
- Published
- 2020
- Full Text
- View/download PDF
5. Welcome.
- Author
-
Salim, Flora, Bingsheng He, and Ken-ichi Kawarabayashi
- Subjects
- *
ARTIFICIAL intelligence , *BLOCKCHAINS , *BIG data , *MACHINE learning , *EDUCATIONAL technology - Abstract
An introduction is presented by the special section co-organizers of the editorial board discussing the theme of this special issue of ‘Communications of the ACM’ which focuses on East Asia and Oceania's contributions to research and innovation in the computer science and technology industry. Topics in this special issue include, but are not limited, to artificial intelligence and machine learning, big data, and learning analytics.
- Published
- 2023
- Full Text
- View/download PDF
6. The Serverless Computing Survey: A Technical Primer for Design Architecture.
- Author
-
ZIJUN LI, LINSONG GUO, JIAGAN CHENG, QUAN CHEN, BINGSHENG HE, and MINYI GUO
- Subjects
ARCHITECTURAL design ,COMMUNICATION infrastructure ,COMMUNITIES ,SCALABILITY - Abstract
The development of cloud infrastructures inspires the emergence of cloud-native computing. As the most promising architecture for deploying microservices, serverless computing has recently attracted more and more attention in both industry and academia. Due to its inherent scalability and flexibility, serverless computing becomes attractive and more pervasive for ever-growing Internet services. Despite the momentum in the cloud-native community, the existing challenges and compromises still wait for more advanced research and solutions to further explore the potential of the serverless computing model. As a contribution to this knowledge, this article surveys and elaborates the research domains in the serverless context by decoupling the architecture into four stack layers: Virtualization, Encapsule, System Orchestration, and System Coordination. Inspired by the security model, we highlight the key implications and limitations of these works in each layer, and make suggestions for potential challenges to the field of future serverless computing. [ABSTRACT FROM AUTHOR]
- Published
- 2022
- Full Text
- View/download PDF
7. CGraph: A Distributed Storage and Processing System for Concurrent Iterative Graph Analysis Jobs.
- Author
-
YU ZHANG, JIN ZHAO, XIAOFEI LIAO, HAI JIN, LIN GU, HAIKUN LIU, BINGSHENG HE, and LIGANG HE
- Subjects
JOB analysis ,DISTRIBUTED computing ,DISTRIBUTED algorithms ,CACHE memory ,DATA structures - Abstract
Distributed graph processing platforms usually need to handle massive Concurrent iterative Graph Processing (CGP) jobs for different purposes. However, existing distributed systems face high ratio of data access cost to computation for the CGP jobs, which incurs low throughput. We observed that there are strong spatial and temporal correlations among the data accesses issued by different CGP jobs, because these concurrently running jobs usually need to repeatedly traverse the shared graph structure for the iterative processing of each vertex. Based on this observation, this article proposes a distributed storage and processing system CGraph for the CGP jobs to efficiently handle the underlying static/evolving graph for high throughput. It uses a data-centric load-trigger-pushing model, together with several optimizations, to enable the CGP jobs to efficiently share the graph structure data in the cache/memory and their accesses by fully exploiting such correlations, where the graph structure data is decoupled from the vertex state associated with each job. It can deliver much higher throughput for the CGP jobs by effectively reducing their average ratio of data access cost to computation. Experimental results show that CGraph improves the throughput of the CGP jobs by up to 3.47× in comparison with existing solutions on distributed platforms. [ABSTRACT FROM AUTHOR]
- Published
- 2019
- Full Text
- View/download PDF
8. RAMZzz.
- Author
-
Donghong Wu, Bingsheng He, Xueyan Tang, Jianliang Xu, and Minyi Guo
- Published
- 2012
9. Sensor Placement and Measurement of Wind for Water Quality Studies in Urban Reservoirs.
- Author
-
WAN DU, ZIKUN XING, MO LI, BINGSHENG HE, CHYE CHUA, LLOYD HOCK, and HAIYAN MIAO
- Subjects
SENSOR placement ,WIND measurement ,WATER quality monitoring ,RESERVOIRS ,METROPOLITAN areas ,COMPUTATIONAL fluid dynamics - Abstract
We study the water quality in an urban district, where the surface wind distribution is an essential input but undergoes high spatial and temporal variations due to the impact of surrounding buildings. In this work, we develop an optimal sensor placement scheme to measure the wind distribution over a large urban reservoir using a limited number of wind sensors. Unlike existing solutions that assume Gaussian process of target phenomena, this study measures the wind that inherently exhibits strong non-Gaussian yearly distribution. By leveraging the local monsoon characteristics of wind, we segment a year into different monsoon seasons that follow a unique distribution respectively. We also use computational fluid dynamics to learn the spatial correlation of wind. The output of sensor placement is a set of the most informative locations to deploy the wind sensors, based on the readings of which we can accurately predict the wind over the entire reservoir in real time. Ten wind sensors are deployed. The in-field measurement results of more than 3 months suggest that the proposed sensor placement and spatial prediction scheme provides accurate wind measurement that outperforms the state-of-the-art Gaussian model based on interpolation-based approaches. [ABSTRACT FROM AUTHOR]
- Published
- 2015
- Full Text
- View/download PDF
10. Relational Query Coprocessing on Graphics Processors.
- Author
-
BINGSHENG HE, MIAN LU, KE YANG, RUI FANG, GOVINDARAJU, NAGA K., QIONG LUO, and SANDER, PEDRO V.
- Subjects
- *
GRAPHICS processing units , *COPROCESSORS , *RELATIONAL databases , *QUERYING (Computer science) , *ALGORITHMS , *PARALLEL processing - Abstract
Graphics processors (GPUs) have recently emerged as powerful coprocessors for general purpose computation. Compared with commodity CPUs, GPUs have an order of magnitude higher computation power as well as memory bandwidth. Moreover, new-generation GPUs allow writes to random memory locations, provide efficient interprocessor communication through on-chip local memory, and support a general purpose parallel programming model. Nevertheless, many of the GPU features are specialized for graphics processing, including the massively multithreaded architecture, the Single-Instruction-Multiple-Data processing style, and the execution model of a single application at a time. Additionally, GPUs rely on a bus of limited bandwidth to transfer data to and from the CPU, do not allow dynamic memory allocation from GPU kernels, and have little hardware support for write conflicts. Therefore, a careful design and implementation is required to utilize the GPU for coprocessing database queries. In this article, we present our design, implementation, and evaluation of an in-memory relational query coprocessing system, GDB, on the GPU. Taking advantage of the GPU hardware features, we design a set of highly optimized data-parallel primitives such as split and sort, and use these primitives to implement common relational query processing algorithms. Our algorithms utilize the high parallelism as well as the high memory bandwidth of the GPU, and use parallel computation and memory optimizations to effectively reduce memory stalls. Furthermore, we propose coprocessing techniques that take into account both the computation resources and the GPU-CPU data transfer cost so that each operator in a query can utilize suitable processors—the CPU, the GPU, or both—for an optimized overall performance. We have evaluated our GDB system on a machine with an Intel quad-core CPU and an NVIDIA GeForce 8800 GTX GPU. Our workloads include microbenchmark queries on memory-resident data as well as TPC-H queries that involve complex data types and multiple query operators on data sets larger than the GPU memory. Our results show that our GPU-based algorithms are 2-27× faster than their optimized CPU-based counterparts on in-memory data. Moreover, the performance of our coprocessing scheme is similar to, or better than, both the GPU-only and the CPU-only schemes. [ABSTRACT FROM AUTHOR]
- Published
- 2009
- Full Text
- View/download PDF
11. Cache-Oblivious Databases: Limitations and Opportunities.
- Author
-
Bingsheng He and Qiong Luo
- Subjects
- *
DATABASES , *CACHE memory , *MEMORY hierarchy (Computer science) , *ALGORITHMS , *COST control , *MEASUREMENT - Abstract
Cache-oblivious techniques, proposed in the theory community, have optimal asymptotic bounds on the amount of data transferred between any two adjacent levels of an arbitrary memory hierarchy. Moreover, this optimal performance is achieved without any hardware platform specific tuning. These properties are highly attractive to autonomous databases, especially because the hardware architectures are becoming increasingly complex and diverse. In this article,we present our design, implementation, and evaluation of the first cache-oblivious in-memory query processor, EaseDB. Moreover, we discuss the inherent limitations of the cacheoblivious approach as well as the opportunities given by the upcoming hardware architectures. Specifically, a cache-oblivious technique usually requires sophisticated algorithm design to achieve a comparable performance to its cache-conscious counterpart. Nevertheless, this development-time effort is compensated by the automaticity of performance achievement and the reduced ownership cost. Furthermore, this automaticity enables cache-oblivious techniques to outperform their cacheconscious counterparts in multi-threading processors. [ABSTRACT FROM AUTHOR]
- Published
- 2008
- Full Text
- View/download PDF
12. An overview of CMPI.
- Author
-
Yifan Gong, Bingsheng He, and Jianlong Zhong
- Published
- 2012
- Full Text
- View/download PDF
Catalog
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.