Author: "BINGSHENG HE" / Publisher: association for computing machinery - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"BINGSHENG HE"' showing total 12 results

Start Over Author "BINGSHENG HE" Publisher association for computing machinery

12 results on '"BINGSHENG HE"'

1. RACE: An Efficient Redundancy-aware Accelerator for Dynamic Graph Neural Network.

Author: HUI YU, YU ZHANG, JIN ZHAO, YUJIAN LIAO, ZHIYING HUANG, DONGHAO HE, LIN GU, HAI JIN, XIAOFEI LIAO, HAIKUN LIU, BINGSHENG HE, and JIANHUI YUE
Published: 2023
Full Text: View/download PDF

2. GraphTune: An Efficient Dependency-Aware Substrate to Alleviate Irregularity in Concurrent Graph Processing.

Author: JIN ZHAO, YU ZHANG, LIGANG HE, QIKUN LI, XIANG ZHANG, XINYU JIANG, HUI YU, XIAOFEI LIAO, HAI JIN, LIN GU, HAIKUN LIU, BINGSHENG HE, JI ZHANG, XIANZHENG SONG, LIN WANG, and JUN ZHOU
Published: 2023
Full Text: View/download PDF

3. The OARF Benchmark Suite: Characterization and Implications for Federated Learning Systems.

Author: SIXU HU, YUAN LI, XU LIU, QINBIN LI, ZHAOMIN WU, and BINGSHENG HE
Subjects: INSTRUCTIONAL systems, MACHINE learning
Abstract: This article presents and characterizes an Open Application Repository for Federated Learning (OARF), a benchmark suite for federated machine learning systems. Previously available benchmarks for federated learning (FL) have focused mainly on synthetic datasets and use a limited number of applications. OARF mimics more realistic application scenarios with publicly available datasets as different data silos in image, text, and structured data. Our characterization shows that the benchmark suite is diverse in data size, distribution, feature distribution, and learning task complexity. The extensive evaluations with reference implementations show the future research opportunities for important aspects of FL systems. We have developed reference implementations, and evaluated the important aspects of FL, including model accuracy, communication cost, throughput, and convergence time. Through these evaluations, we discovered some interesting findings such as FL can effectively increase end-to-end throughput. The code of OARF is publicly available on GitHub.1 [ABSTRACT FROM AUTHOR]
Published: 2022
Full Text: View/download PDF

4. ByteSeries:An In-Memory Time Series Database for Large-Scale Monitoring Systems

Author: Yan Jiang, Xuanhua Shi, Hai Jin, Xin Li, Yongluan Zhou, Zezhao Feng, Bingsheng He, Kaixi Li, and Zhijun Ling
Subjects: Computer science, Real-time computing, 02 engineering and technology, Inverted index, Metadata, 020204 information systems, Metadata management, 0202 electrical engineering, electronic engineering, information engineering, Memory footprint, Overhead (computing), 020201 artificial intelligence & image processing, Time series, Throughput (business), Time series database
Abstract: Monitoring large-scale and complex systems often generates high-dimensional and highly dynamic time series data. In such a scenario, massive metadata has to be maintained to support efficient querying, whose large footprint poses great challenges to in-memory databases. In this paper, we present ByteSeries, an in-memory time series database that is designed specifically for large-scale monitoring systems to manage high-dimensional time series. We start with an analysis of the production data and workload at ByteDance's metric monitoring system, which contains over 10 billion time series dimensions. The observation of high overhead of metadata management in high-dimensional time series data calls for a rethink of time series database systems. Byte-Series's memory structure employs the novel Compressed Inverted Index to effectively compress metadata while maintaining high efficiency for multi-dimensional queries. In addition, an algorithm is proposed to effectively convert data into compressed form without sacrificing the data ingestion throughput. We experimentally evaluate ByteSeries by comparing it with ByteDance's original production system, tsdc, as well as two open-source systems, namely Gorilla and Prometheus. We show that ByteSeries significantly improves over ByteDance's original production system by 1) reducing the memory footprint of metadata by 60% and the whole memory consumption by 50%, and 2) speeding up multi-dimensional queries by 1.8x-10.7x.
Published: 2020
Full Text: View/download PDF

5. Welcome.

Author: Salim, Flora, Bingsheng He, and Ken-ichi Kawarabayashi
Subjects: *ARTIFICIAL intelligence, *BLOCKCHAINS, *BIG data, *MACHINE learning, *EDUCATIONAL technology
Abstract: An introduction is presented by the special section co-organizers of the editorial board discussing the theme of this special issue of ‘Communications of the ACM’ which focuses on East Asia and Oceania's contributions to research and innovation in the computer science and technology industry. Topics in this special issue include, but are not limited, to artificial intelligence and machine learning, big data, and learning analytics.
Published: 2023
Full Text: View/download PDF

6. The Serverless Computing Survey: A Technical Primer for Design Architecture.

Author: ZIJUN LI, LINSONG GUO, JIAGAN CHENG, QUAN CHEN, BINGSHENG HE, and MINYI GUO
Subjects: ARCHITECTURAL design, COMMUNICATION infrastructure, COMMUNITIES, SCALABILITY
Abstract: The development of cloud infrastructures inspires the emergence of cloud-native computing. As the most promising architecture for deploying microservices, serverless computing has recently attracted more and more attention in both industry and academia. Due to its inherent scalability and flexibility, serverless computing becomes attractive and more pervasive for ever-growing Internet services. Despite the momentum in the cloud-native community, the existing challenges and compromises still wait for more advanced research and solutions to further explore the potential of the serverless computing model. As a contribution to this knowledge, this article surveys and elaborates the research domains in the serverless context by decoupling the architecture into four stack layers: Virtualization, Encapsule, System Orchestration, and System Coordination. Inspired by the security model, we highlight the key implications and limitations of these works in each layer, and make suggestions for potential challenges to the field of future serverless computing. [ABSTRACT FROM AUTHOR]
Published: 2022
Full Text: View/download PDF

7. CGraph: A Distributed Storage and Processing System for Concurrent Iterative Graph Analysis Jobs.

Author: YU ZHANG, JIN ZHAO, XIAOFEI LIAO, HAI JIN, LIN GU, HAIKUN LIU, BINGSHENG HE, and LIGANG HE
Subjects: JOB analysis, DISTRIBUTED computing, DISTRIBUTED algorithms, CACHE memory, DATA structures
Abstract: Distributed graph processing platforms usually need to handle massive Concurrent iterative Graph Processing (CGP) jobs for different purposes. However, existing distributed systems face high ratio of data access cost to computation for the CGP jobs, which incurs low throughput. We observed that there are strong spatial and temporal correlations among the data accesses issued by different CGP jobs, because these concurrently running jobs usually need to repeatedly traverse the shared graph structure for the iterative processing of each vertex. Based on this observation, this article proposes a distributed storage and processing system CGraph for the CGP jobs to efficiently handle the underlying static/evolving graph for high throughput. It uses a data-centric load-trigger-pushing model, together with several optimizations, to enable the CGP jobs to efficiently share the graph structure data in the cache/memory and their accesses by fully exploiting such correlations, where the graph structure data is decoupled from the vertex state associated with each job. It can deliver much higher throughput for the CGP jobs by effectively reducing their average ratio of data access cost to computation. Experimental results show that CGraph improves the throughput of the CGP jobs by up to 3.47× in comparison with existing solutions on distributed platforms. [ABSTRACT FROM AUTHOR]
Published: 2019
Full Text: View/download PDF

8. RAMZzz.

Author: Donghong Wu, Bingsheng He, Xueyan Tang, Jianliang Xu, and Minyi Guo
Published: 2012

9. Sensor Placement and Measurement of Wind for Water Quality Studies in Urban Reservoirs.

Author: WAN DU, ZIKUN XING, MO LI, BINGSHENG HE, CHYE CHUA, LLOYD HOCK, and HAIYAN MIAO
Subjects: SENSOR placement, WIND measurement, WATER quality monitoring, RESERVOIRS, METROPOLITAN areas, COMPUTATIONAL fluid dynamics
Abstract: We study the water quality in an urban district, where the surface wind distribution is an essential input but undergoes high spatial and temporal variations due to the impact of surrounding buildings. In this work, we develop an optimal sensor placement scheme to measure the wind distribution over a large urban reservoir using a limited number of wind sensors. Unlike existing solutions that assume Gaussian process of target phenomena, this study measures the wind that inherently exhibits strong non-Gaussian yearly distribution. By leveraging the local monsoon characteristics of wind, we segment a year into different monsoon seasons that follow a unique distribution respectively. We also use computational fluid dynamics to learn the spatial correlation of wind. The output of sensor placement is a set of the most informative locations to deploy the wind sensors, based on the readings of which we can accurately predict the wind over the entire reservoir in real time. Ten wind sensors are deployed. The in-field measurement results of more than 3 months suggest that the proposed sensor placement and spatial prediction scheme provides accurate wind measurement that outperforms the state-of-the-art Gaussian model based on interpolation-based approaches. [ABSTRACT FROM AUTHOR]
Published: 2015
Full Text: View/download PDF

10. Relational Query Coprocessing on Graphics Processors.

Author: BINGSHENG HE, MIAN LU, KE YANG, RUI FANG, GOVINDARAJU, NAGA K., QIONG LUO, and SANDER, PEDRO V.
Subjects: *GRAPHICS processing units, *COPROCESSORS, *RELATIONAL databases, *QUERYING (Computer science), *ALGORITHMS, *PARALLEL processing
Abstract: Graphics processors (GPUs) have recently emerged as powerful coprocessors for general purpose computation. Compared with commodity CPUs, GPUs have an order of magnitude higher computation power as well as memory bandwidth. Moreover, new-generation GPUs allow writes to random memory locations, provide efficient interprocessor communication through on-chip local memory, and support a general purpose parallel programming model. Nevertheless, many of the GPU features are specialized for graphics processing, including the massively multithreaded architecture, the Single-Instruction-Multiple-Data processing style, and the execution model of a single application at a time. Additionally, GPUs rely on a bus of limited bandwidth to transfer data to and from the CPU, do not allow dynamic memory allocation from GPU kernels, and have little hardware support for write conflicts. Therefore, a careful design and implementation is required to utilize the GPU for coprocessing database queries. In this article, we present our design, implementation, and evaluation of an in-memory relational query coprocessing system, GDB, on the GPU. Taking advantage of the GPU hardware features, we design a set of highly optimized data-parallel primitives such as split and sort, and use these primitives to implement common relational query processing algorithms. Our algorithms utilize the high parallelism as well as the high memory bandwidth of the GPU, and use parallel computation and memory optimizations to effectively reduce memory stalls. Furthermore, we propose coprocessing techniques that take into account both the computation resources and the GPU-CPU data transfer cost so that each operator in a query can utilize suitable processors—the CPU, the GPU, or both—for an optimized overall performance. We have evaluated our GDB system on a machine with an Intel quad-core CPU and an NVIDIA GeForce 8800 GTX GPU. Our workloads include microbenchmark queries on memory-resident data as well as TPC-H queries that involve complex data types and multiple query operators on data sets larger than the GPU memory. Our results show that our GPU-based algorithms are 2-27× faster than their optimized CPU-based counterparts on in-memory data. Moreover, the performance of our coprocessing scheme is similar to, or better than, both the GPU-only and the CPU-only schemes. [ABSTRACT FROM AUTHOR]
Published: 2009
Full Text: View/download PDF

11. Cache-Oblivious Databases: Limitations and Opportunities.

Author: Bingsheng He and Qiong Luo
Subjects: *DATABASES, *CACHE memory, *MEMORY hierarchy (Computer science), *ALGORITHMS, *COST control, *MEASUREMENT
Abstract: Cache-oblivious techniques, proposed in the theory community, have optimal asymptotic bounds on the amount of data transferred between any two adjacent levels of an arbitrary memory hierarchy. Moreover, this optimal performance is achieved without any hardware platform specific tuning. These properties are highly attractive to autonomous databases, especially because the hardware architectures are becoming increasingly complex and diverse. In this article,we present our design, implementation, and evaluation of the first cache-oblivious in-memory query processor, EaseDB. Moreover, we discuss the inherent limitations of the cacheoblivious approach as well as the opportunities given by the upcoming hardware architectures. Specifically, a cache-oblivious technique usually requires sophisticated algorithm design to achieve a comparable performance to its cache-conscious counterpart. Nevertheless, this development-time effort is compensated by the automaticity of performance achievement and the reduced ownership cost. Furthermore, this automaticity enables cache-oblivious techniques to outperform their cacheconscious counterparts in multi-threading processors. [ABSTRACT FROM AUTHOR]
Published: 2008
Full Text: View/download PDF

12. An overview of CMPI.

Author: Yifan Gong, Bingsheng He, and Jianlong Zhong
Published: 2012
Full Text: View/download PDF

Catalog

Books, media, physical & digital resources

See catalog results

Searchworks

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources

Refine your results

12 results on '"BINGSHENG HE"'

1. RACE: An Efficient Redundancy-aware Accelerator for Dynamic Graph Neural Network.

2. GraphTune: An Efficient Dependency-Aware Substrate to Alleviate Irregularity in Concurrent Graph Processing.

3. The OARF Benchmark Suite: Characterization and Implications for Federated Learning Systems.

4. ByteSeries:An In-Memory Time Series Database for Large-Scale Monitoring Systems

5. Welcome.

6. The Serverless Computing Survey: A Technical Primer for Design Architecture.

7. CGraph: A Distributed Storage and Processing System for Concurrent Iterative Graph Analysis Jobs.

8. RAMZzz.

9. Sensor Placement and Measurement of Wind for Water Quality Studies in Urban Reservoirs.

10. Relational Query Coprocessing on Graphics Processors.

11. Cache-Oblivious Databases: Limitations and Opportunities.

12. An overview of CMPI.

Catalog

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Publication Type

Journal

Region

Database

12 results on '"BINGSHENG HE"'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources