Sie befinden Sich nicht im Netzwerk der Universität Paderborn. Der Zugriff auf elektronische Ressourcen ist gegebenenfalls nur via VPN oder Shibboleth (DFN-AAI) möglich. mehr Informationen...
Ergebnis 17 von 391
Concurrency and computation, 2020-10, Vol.32 (20), p.n/a
2020
Volltextzugriff (PDF)

Details

Autor(en) / Beteiligte
Titel
Heterogeneous computing with OpenMP and Hydra
Ist Teil von
  • Concurrency and computation, 2020-10, Vol.32 (20), p.n/a
Ort / Verlag
Hoboken: Wiley Subscription Services, Inc
Erscheinungsjahr
2020
Quelle
Wiley Blackwell Single Titles
Beschreibungen/Notizen
  • Summary High‐performance computing relies on accelerators (such as GPGPUs) to achieve fast execution of scientific applications. Traditionally, these accelerators have been programmed with specialized languages, such as CUDA or OpenCL. In recent years, OpenMP emerged as a promising alternative for supporting accelerators, providing advantages such as maintaining a single code base for the host and different accelerator types and providing a simple way to extend support for accelerators to existing application codes. Efficiently using this support requires solving several challenges, related to performance, work partitioning, and concurrent execution on multiple device types. In this article, we discuss our experiences with using OpenMP for accelerators and present performance guidelines. We also introduce a library, Hydra, that addresses several of the challenges of using OpenMP for such devices. We apply Hydra to a scientific application, PlasCom2, that has not previously been able to use accelerators. Experiments on three architectures show that Hydra results in performance gains of up to 10× compared with CPU‐only execution. Concurrent execution on the host and GPU resulted in additional gains of up to 20% compared to running on the GPU only.
Sprache
Englisch
Identifikatoren
ISSN: 1532-0626
eISSN: 1532-0634
DOI: 10.1002/cpe.5728
Titel-ID: cdi_osti_scitechconnect_1603688

Weiterführende Literatur

Empfehlungen zum selben Thema automatisch vorgeschlagen von bX