Back to Search
Start Over
An Asynchronous Parallel Implementation of Multilevel Fast Multipole Algorithm on GPU Cluster for 3D Electromagnetic Scattering Problems
- Source :
- 2021 International Applied Computational Electromagnetics Society (ACES-China) Symposium.
- Publication Year :
- 2021
- Publisher :
- IEEE, 2021.
-
Abstract
- This paper presents a CPU/GPU asynchronous computing pattern based improved parallel multilevel fast multipole algorithm (MLFMA) for 3D electromagnetic scattering problems on GPU Cluster. In the presented parallel implementation, the matrix assembly process of the MLFMA is decomposed into CPU execution and GPU execution two parts. The former is performed on CPU using OpenMP multi-threading programming model, while the latter is performed on GPU with CUDA programming model. The execution time between the two parts is overlapped by using the feature of asynchronous execution between CPU and GPU. The performance of the proposed parallel implementation is investigated in terms of accuracy and efficiency. Numerical results show that, with the proposed parallel approach, over 10% speed-up can be attained, compared with the original parallel implementation.
- Subjects :
- Computer Science::Performance
Matrix (mathematics)
Computer science
Asynchronous communication
Feature (computer vision)
Computer Science::Mathematical Software
Programming paradigm
Process (computing)
Computational electromagnetics
GPU cluster
Multipole expansion
Computer Science::Operating Systems
Algorithm
Subjects
Details
- Database :
- OpenAIRE
- Journal :
- 2021 International Applied Computational Electromagnetics Society (ACES-China) Symposium
- Accession number :
- edsair.doi...........003e0ced6b8369e0610433dc9aefba27