Computational and Applied Mathematics (2020) 39:219
https://doi.org/10.1007/s40314-020-01244-1
Parallel performance analysis of coupled heat and fluid flow
in parallel plate channel using CUDA
Asif Afzal
1
· Zahid Ansari
2
· M. K. Ramis
1
Received: 4 December 2019 / Revised: 18 June 2020 / Accepted: 2 July 2020
© SBMAC - Sociedade Brasileira de Matemática Aplicada e Computacional 2020
Abstract
The heat transfer analysis coupled with fluid flow is important in many real-world application
areas varying from micro-channels to spacecraft’s. Numerical prediction of thermal and fluid
flow situation has become very common method using any computational fluid dynamics
software or by developing in-house codes. One of the major issues pertinent to numerical
analysis lies with immense computational time required for repeated analysis. In this article,
technique applied for parallelization of in-house developed generic code using CUDA and
OpenMP paradigm is discussed. The parallelized finite-volume method (FVM)-based code
for analysis of various problems is analyzed for different boundary conditions. Two GPUs
(graphical processing units) are used for parallel execution. Out of four functions in the code
(U, V , P, and T ), only P function is parallelized using CUDA as it consumes 91% of com-
putational time and the rest functions are parallelized using OpenMP. Parallel performance
analysis is carried out for 400, 625, and 900 threads launched from host for parallel execu-
tion. Improvement in speedup using CUDA compared with speedup using complete OpenMP
parallelization on different computing machines is also provided. Parallel efficiency of the
FVM code for different grid size, Reynolds number, internal flow, and external flow is also
carried out. It is found that the GPU provides immense speedup and outperforms OpenMP
largely. Parallel execution on GPU gives results in a quite acceptable amount of time. The
parallel efficiency is found to be close to 90% in internal flow and 10% for external flow.
Keywords Parallelization · Conjugate heat transfer · CUDA · OpenMP · Speedup · Parallel
efficiency
Mathematics Subject Classification 65K05 · 65Y05
Communicated by Jorge X. Velasco.
B Asif Afzal
asif.afzal86@gmail.com
B M. K. Ramis
ramismk@pace.edu.in
1
Department of Mechanical Engineering, P. A. College of Engineering (Affiliated to Visvesvaraya
Technological University Belagavi), Mangaluru, India
2
Department of Computer Science and Engineering, P. A. College of Engineering (Affiliated to
Visvesvaraya Technological University Belagavi), Mangaluru, India
0123456789().: V,-vol 123