Simultaneous CPU–GPU Execution of Data Parallel Algorithmic Skeletons

Wrede, Fabian; Ernsting, Steffen

Simultaneous CPU–GPU Execution of Data Parallel Algorithmic Skeletons

Wrede Fabian, Ernsting Steffen

Zusammenfassung
Parallel programming has become ubiquitous; however, it is still a low-level and error-prone task, especially when accelerators such as GPUs are used. Thus, algorithmic skeletons have been proposed to provide well-defined programming patterns in order to assist programmers and shield them from low-level aspects. As the complexity of problems, and consequently the need for computing capacity, grows, we have directed our research toward simultaneous CPU-GPU execution of data parallel skeletons to achieve a performance gain. GPUs are optimized with respect to throughput and designed for massively parallel computations. Nevertheless, we analyze whether the additional utilization of the CPU for data parallel skeletons in the Muenster Skeleton Library leads to speedups or causes a reduced performance, because of the smaller computational capacity of CPUs compared to GPUs. We present a C++ implementation based on a static distribution approach. In order to evaluate the implementation, four different benchmarks, including matrix multiplication, N-body simulation, Frobenius norm, and ray tracing, have been conducted. The ratio of CPU and GPU execution has been varied manually to observe the effects of different distributions. The results show that a speedup can be achieved by distributing the execution among CPUs and GPUs. However, both the results and the optimal distribution highly depend on the available hardware and the specific algorithm.

Schlüsselwörter
High-level parallel programming; Data parallel algorithmic skeletons; Simultaneous CPU–GPU execution

Publikationstyp

Forschungsartikel (Zeitschrift)

Begutachtet

Ja

Publikationsstatus

Veröffentlicht

Jahr

2018

Fachzeitschrift

International Journal of Parallel Programming

Band

46

Ausgabe

1

Erste Seite

42

Letzte Seite

61

Sprache

Englisch

ISSN

0885-7458

DOI

https://doi.org/10.1007/s10766-016-0483-9

Gesamter Text

http://link.springer.com/