TY - GEN
T1 - Towards the Transparent Execution of Compound OpenCL Computations in Multi-CPU/Multi-GPU Environments
AU - Soldado, Fabio
AU - Alexandre, Fernando
AU - Paulino, Hervé Miguel Cordeiro
PY - 2014
Y1 - 2014
N2 - Current computational systems are heterogeneous by nature, featuring a combination of CPUs and GPUs. As the latter are becoming an established platform for high-performance computing, the focus is shifting towards the seamless programming of the heterogeneous systems as a whole. The distinct nature of the architectural and execution models in place raise several challenges, as the best hardware configuration is behavior and data-set dependent. In this paper, we focus the execution of compound computations in multi-CPU/multi-GPU environments, in the scope of Marrow algorithmic skeleton framework, the only, to the best of our knowledge, to support skeleton nesting in GPU computing. We address how these computations may be efficiently scheduled onto the target hardware, and how the system may adapt itself to changes in the CPU’s load and in the input data-set.
AB - Current computational systems are heterogeneous by nature, featuring a combination of CPUs and GPUs. As the latter are becoming an established platform for high-performance computing, the focus is shifting towards the seamless programming of the heterogeneous systems as a whole. The distinct nature of the architectural and execution models in place raise several challenges, as the best hardware configuration is behavior and data-set dependent. In this paper, we focus the execution of compound computations in multi-CPU/multi-GPU environments, in the scope of Marrow algorithmic skeleton framework, the only, to the best of our knowledge, to support skeleton nesting in GPU computing. We address how these computations may be efficiently scheduled onto the target hardware, and how the system may adapt itself to changes in the CPU’s load and in the input data-set.
KW - GPU
KW - Skeletons
KW - Heterogeneous computing
U2 - 10.1007/978-3-319-14325-5_16
DO - 10.1007/978-3-319-14325-5_16
M3 - Conference contribution
VL - Pt. 1
T3 - Lecture Notes in Computer Science
SP - 177
EP - 188
BT - EURO-PAR 2014: PARALLEL PROCESSING WORKSHOPS
PB - SPRINGER-VERLAG BERLIN
T2 - Euro-Par 2014 International Workshops
Y2 - 1 January 2014
ER -