Author: "Laura Carrington" / Publisher: sage publications - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"Laura Carrington"' showing total 2 results

Start Over Author "Laura Carrington" Publisher sage publications

2 results on '"Laura Carrington"'

1. Reducing communication in parallel graph search algorithms with software caches

Author: Manu Shantharam, Pietro Cicotti, and Laura Carrington
Subjects: Computer science, business.industry, Distributed computing, Parallel algorithm, Theoretical Computer Science, Software, Hardware and Architecture, Search algorithm, Systems design, Cache, Reference implementation, business, Implementation, Randomness
Abstract: In many scientific and computational domains, graphs are used to represent and analyze data. Such graphs often exhibit the characteristics of small-world networks: few high-degree vertexes connect many low-degree vertexes. Despite the randomness in a graph search, it is possible to capitalize on the characteristics of small-world networks and cache relevant information of high-degree vertexes. We applied this idea by caching remote vertex ids in a parallel breadth-first search benchmark. Our experiment with different implementations demonstrated significant performance improvements over the reference implementation in several configurations, using 64 to 1024 cores. We proposed a system design in which resources are dedicated exclusively to caching and shared among a set of nodes. Our evaluation demonstrates that this design reduces communication and has the potential to improve performance on large-scale systems in which the communication cost increases significantly with the distance between nodes. We also tested a memcached system as the cache server finding that its generic protocol, which does not match our usage semantics, hinders significantly the potential performance improvements and suggested that a generic system should also support a basic and lightweight communication protocol to meet the needs of high-performance computing applications. Finally, we explored different configurations to find efficient ways to utilize the resources allocated to solve a given problem size; to this extent, we found utilizing half of the compute cores per allocated node improves performance, and even in this case, caching variants always outperform the reference implementation.
Published: 2018
Full Text: View/download PDF

2. Modeling and predicting performance of high performance computing applications on hardware accelerators

Author: Mitesh R. Meswani, Laura Carrington, Scott B. Baden, Allan Snavely, Stephen W. Poole, and Didem Unat
Subjects: TOP500, Speedup, Computer science, business.industry, Graphics processing unit, Workload, Parallel computing, Supercomputer, Porting, Theoretical Computer Science, Hardware and Architecture, Performance prediction, Central processing unit, business, Field-programmable gate array, Software, Computer hardware
Abstract: Computers with hardware accelerators, also referred to as hybrid-core systems, speedup applications by offloading certain compute operations that can run faster on accelerators. Thus, it is not surprising that many of top500 supercomputers use accelerators. However, in addition to procurement cost, significant programming and porting effort is required to realize the potential benefit of such accelerators. Hence, before building such a system it is prudent to answer the question 'what is the projected performance benefit from accelerators for the workloads of interest?'. We address this question by way of a performance-modeling framework that predicts realizable application performance on accelerators rapidly and accurately without going to the considerable effort of porting and tuning. The modeling framework first automatically identifies commonly found compute patterns in scientific applications which we term idioms, which may benefit by accelerator technology. Next the framework models the predicted speedup of those idioms if they were to be ported to and run on hardware accelerators. As a proof of concept we characterize two kinds of accelerators 1) the FPGA accelerators on a Convey HC-1 system and 2) an NVIDIA FERMI GPU accelerator. We model performance of the idioms gather/scatter and stream and our predictions show that where these occur in two full-scale HPC applications, Milc and HYCOM, gather/scatter speeds up by as much as 15X, and stream by as much as 14X, whereas the overall compute time of Milc improves by 3.4% and HYCOM by 20%.
Published: 2012
Full Text: View/download PDF

Catalog

Books, media, physical & digital resources

See catalog results

Searchworks

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources

Refine your results

2 results on '"Laura Carrington"'

1. Reducing communication in parallel graph search algorithms with software caches

2. Modeling and predicting performance of high performance computing applications on hardware accelerators

Catalog

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Database

2 results on '"Laura Carrington"'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources