Author: "Khunjush, Farshad" / Topic: gpu - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"Khunjush, Farshad"' showing total 5 results

Start Over Author "Khunjush, Farshad" Topic gpu

5 results on '"Khunjush, Farshad"'

1. Ignite-GPU: a GPU-enabled in-memory computing architecture on clusters

Author: Sojoodi, Amir Hossein, Salimi Beni, Majid, and Khunjush, Farshad
Published: 2021
Full Text: View/download PDF

2. Adaptive sparse matrix representation for efficient matrix–vector multiplication

Author: Zardoshti, Pantea, Khunjush, Farshad, and Sarbazi-Azad, Hamid
Published: 2016
Full Text: View/download PDF

3. A statistical performance analyzer framework for OpenCL kernels on Nvidia GPUs

Author: Karami, Ali, Khunjush, Farshad, and Mirsoleimani, Seyyed Ali
Published: 2015
Full Text: View/download PDF

4. A two-tier design space exploration algorithm to construct GPU performance model.

Author: Mirsoleimani, S. Ali, Khunjush, Farshad, and Karami, Ali
Subjects: *GRAPHICS processing units, *ALGORITHMS, *PERFORMANCE evaluation, *MATHEMATICAL models, *MICROPROCESSORS, *COMPUTER simulation
Abstract: Graphics Processing Units (GPUs) have a large and complex design space that needs to be explored in order to optimize the performance of future GPUs. Statistical techniques are useful tools to help computer architects to predict performance of complex processors. In this study, these methods are utilized to build a model which predicts the GPU performance efficiently. The design space of targeted Fermi GPU has more than 8 million points which cause exploring this huge design space a challenging process. In order to build an accurate model, we propose a two-tier algorithm in our algorithm which builds a multiple linear regression model from a small set of simulated data. In this algorithm the Plackett–Burman design is used to find the key parameters of the GPU, and further simulations are guided by a fractional factorial design for the most important parameters. Our algorithm is able to construct a GPU performance predictor which can predict the performance of any point in the design space with an average prediction error between 1% and 5% for different benchmark applications. In addition, in comparison to other methods which need a large number of sampling points, the accuracy in our method is achieved by only sampling between 0.0003% and 0.0015% of the full design space. [ABSTRACT FROM AUTHOR]
Published: 2015
Full Text: View/download PDF

5. A statistical performance prediction model for OpenCL kernels on NVIDIA GPUs.

Author: Karami, Ali, Mirsoleimani, Sayyed Ali, and Khunjush, Farshad
Abstract: Understanding performance bottlenecks of applications in high performance computing can lead to dramatic improvements of applications performances. For example, a key problem in GPU programming is finding performance bottlenecks and solving them to reach the best possible performance. These bottlenecks in GPU architectures span a variety of factors such as memory access latency, branch divergence, utilization, and the amount of existing parallelism. In addition, a simple profiling cannot demonstrate the relations between these bottlenecks. In this paper, we propose a statistical performance model that not only helps us find bottlenecks but also shows the relations between them which is not possible by using a profiler. The OpenCL programming standard can be used in a variety of platforms (e.g., CPUs and GPUs); therefore, a program written in one platform can be imported to other platforms with minimal effort. As a result, we selected the OpenCL programming standard in order to design our performance model for NVIDIA GPUs. For this, we first measure the values of a GPU performance counters for the selected benchmarks. Then, using the achieved results and applying a regression model and the principle component analysis we develop a model to show how different GPU parameters account for applications performance bottlenecks. Our results show that the proposed model can predict applications behaviors with a 91% accuracy. Moreover, the proposed model is able to characterize unknown applications based on their performance similarities with an existing database of benchmark to predict their likely performance bottlenecks. [ABSTRACT FROM PUBLISHER]
Published: 2013
Full Text: View/download PDF

Catalog

Books, media, physical & digital resources

See catalog results

Searchworks

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources

Refine your results

5 results on '"Khunjush, Farshad"'

1. Ignite-GPU: a GPU-enabled in-memory computing architecture on clusters

2. Adaptive sparse matrix representation for efficient matrix–vector multiplication

3. A statistical performance analyzer framework for OpenCL kernels on Nvidia GPUs

4. A two-tier design space exploration algorithm to construct GPU performance model.

5. A statistical performance prediction model for OpenCL kernels on NVIDIA GPUs.

Catalog

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Publication Type

Journal

Database

Publisher

5 results on '"Khunjush, Farshad"'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources