J Supercomput (2013) 65:1063–1075
DOI 10.1007/s11227-012-0853-z
A preliminary evaluation of OpenACC implementations
Ruymán Reyes · Iván López · Juan J. Fumero ·
Francisco de Sande
Published online: 31 January 2013
© Springer Science+Business Media New York 2013
Abstract During the last few years, the availability of hardware accelerators, such as
GPUs, has rapidly increased. However, the entry cost to GPU programming is high
and requires a considerable porting and tuning effort. Some research groups and ven-
dors have made attempts to ease the situation by defining APIs and languages that
simplify these tasks. In the wake of the success of OpenMP, industria and academia
are working toward defining a new standard of compiler directives to leverage the
GPU programming effort. Support from vendors and similarities with the upcoming
OpenMP 4.0 standard lead us to believe that OpenACC is a good alternative for devel-
opers who want to port existing codes to accelerators. In this paper, we evaluate three
OpenACC implementations: two commercial implementations (PGI and CAPS) and
our own research implementation, accULL, to evaluate the current status and future
directions of the standard.
Keywords OpenACC · PGI · HMPP · accULL · Accelerators · GPGPU · CUDA ·
OpenCL · OpenMP · Compiler · Productivity
1 Introduction
The importance of developer productivity in recent years should not be underesti-
mated [4]. Much research has been conducted on reducing the programming effort
R. Reyes · I. López · J.J. Fumero · F. de Sande ()
Dept. de Estadística, I. O. y Computación, La Laguna, Spain
e-mail: fsande@ull.es
R. Reyes
e-mail: rreyes@ull.es
I. López
e-mail: ilopezro@ull.es
J.J. Fumero
e-mail: jjfumeroa@ull.es