Author: "Zhao, Rong-Cai" / Topic: simd - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"Zhao, Rong-Cai"' showing total 3 results

Start Over Author "Zhao, Rong-Cai" Topic simd

3 results on '"Zhao, Rong-Cai"'

1. Outer-Loop Auto-Vectorization for SIMD Architectures Based on Open64 Compiler

Author: Zhao Rong-cai, Li Yingying, Wang Qi, and Wang Dong
Subjects: Computer science, Loop inversion, Data parallelism, Loop fusion, 020208 electrical & electronic engineering, 02 engineering and technology, Parallel computing, 020202 computer hardware & architecture, Loop fission, 0202 electrical engineering, electronic engineering, information engineering, Loop interchange, SIMD, Inner loop, Loop dependence analysis
Abstract: SIMD (Single Instruction Multiple Data) extensions are acceleration components integrated in general processor, aiming at extracting instruction and data level parallelism of multimedia and scientific calculation programs. Currently, most of the automatic vectorization methods for SIMD architectures are based on innermost loops. Inner loop vectorization is the common approach for auto-vectorization. This method has been used for many years and its efficiency is widely accepted by people. In this paper, we put forward a better method than inner loop vectorization for some loop nests, which is outer loop vectorization. Outer loop vectorization method means vectorizing the outer loop directly. It can extract more data level parallelism and make the most use of the spatial locality to improve the program efficiency than the inner one which is more suitable for some loop nests. This paper presents the realization. In this paper, we first revisit the preliminary analysis of outer loop vectorization based on Open64 complier. And then, data type conversion and code generation is presented in detail. Finally, we propose two optimization methods, capable of boosting the performance of outer loop vectorization to achieve the acceleration of 20% on average, 50% at most.
Published: 2016

2. Recovery Methodology to Avoid Loss for SLP

Author: Zhao Rong-cai, Wei shuai, and Yao Yuan
Subjects: Speedup, Computer science, General Medicine, Parallel computing, computer.software_genre, SIMD, SLP algorithm, vectorization, Image tracing, Compiler, Hardware_CONTROLSTRUCTURESANDMICROPROGRAMMING, computer, Engineering(all), data dependence analysis
Abstract: Nowadays more and more processors are integrated with SIMD extensions, and many compilers have applied auto-vectorization. SLP is an vectorization algorithm that could vectorize scientific applications more effectively than traditional algorithm. However, if basic blocks have not vectorized efficiently by SLP then the vectorization performance will degrade. To solve that problem this paper brings SLP that applied recovery methodology. The algorithm adopts SLP algorithm to vectorize program and then esitimate the vectorization benifit based cost model, at last recover the basic blocks that haven’t vectorized efficiently to their original states. Experiment results indicate that with the adoption of the new policy, the speedup gain for some applications can reach 29.4%.
Published: 2011

3. An open64-based cost analytical model in auto-vectorization

Author: Zhao Rong-cai and Zhang Yuanyuan
Subjects: Parallel processing (DSP implementation), Computer science, Vectorization (mathematics), Optimizing compiler, Parallel computing, Compiler, SIMD, Activity-based costing, computer.software_genre, Storage management, computer, Field (computer science)
Abstract: Discontinuous references to memory and misalignment of memory access mode can have great impact on program performance in auto-vectorization. Different target-specific architecture may have different influence on vectorization performance. As a popular technology in recent years, the multimedia extension technique is important in the vectorization field. Supported by special processing unit in microprocessors, the SIMD automatic vectorization become available. Compiler targeted to SIMD has been widely used in research. This article has proposed a cost analytical model in automatic vectorization compiler. Based on the analysis of several important factors which impact the performance, this model combining with the SLP technique, evaluates both benefit and cost during vectorization and exists as a guidance to vectorization. Experimental results indicate that to some extent this model can accurately predict benefits for vectorization and guide compiler optimization.
Published: 2010

Catalog

Books, media, physical & digital resources

See catalog results

Searchworks

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources

Refine your results

3 results on '"Zhao, Rong-Cai"'

1. Outer-Loop Auto-Vectorization for SIMD Architectures Based on Open64 Compiler

2. Recovery Methodology to Avoid Loss for SLP

3. An open64-based cost analytical model in auto-vectorization

Catalog

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Journal

Database

Publisher

3 results on '"Zhao, Rong-Cai"'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources