UB Paderborn / Katalog / Suche / Details

Ergebnis 13 von 25744

Sie befinden Sich nicht im Netzwerk der Universität Paderborn. Der Zugriff auf elektronische Ressourcen ist gegebenenfalls nur via VPN oder Shibboleth (DFN-AAI) möglich. mehr Informationen...

Domain-specific acceleration and auto-parallelization of legacy scientific code in FORTRAN 77 using source-to-source compilation

Computers & fluids, 2018-09, Vol.173, p.1-5

2018

Details

Autor(en) / Beteiligte

Titel

Domain-specific acceleration and auto-parallelization of legacy scientific code in FORTRAN 77 using source-to-source compilation

Ist Teil von

Computers & fluids, 2018-09, Vol.173, p.1-5

Ort / Verlag

Amsterdam: Elsevier Ltd

Erscheinungsjahr

2018

Link zum Volltext

Quelle

Elsevier ScienceDirect Journals Complete

Beschreibungen/Notizen

•Accelerators (GPGPUs, manycores, FPGAs) are powerful but porting code is very hard.•We automatically transform F77 code into GPU-accelerated programs using OpenCL.•Our compiler creates modern, acceleration-ready Fortran 95 from legacy FORTRAN 77.•Our compiler further creates OpenCL code with auto-parallelized kernels.•The performance of the automatically OpenC code on GPU is as good as handported code. Massively parallel accelerators such as GPGPUs, manycores and FPGAs represent a powerful and affordable tool for scientists who look to speed up simulations of complex systems. However, porting code to such devices requires a detailed understanding of heterogeneous programming tools and effective strategies for parallelization. In this paper we present a source to source compilation approach with whole-program analysis to automatically transform single-threaded FORTRAN 77 legacy code into OpenCL-accelerated programs with parallelized kernels. The main contributions of our work are: (1) whole-source refactoring to allow any subroutine in the code to be offloaded to an accelerator. (2) Minimization of the data transfer between the host and the accelerator by eliminating redundant transfers. (3) Pragmatic auto-parallelization of the code to be offloaded to the accelerator by identification of parallelizable maps and reductions. We have validated the code transformation performance of the compiler on the NIST FORTRAN 78 test suite and several real-world codes: the Large Eddy Simulator for Urban Flows, a high-resolution turbulent flow model; the shallow water component of the ocean model Gmodel; the Linear Baroclinic Model, an atmospheric climate model and Flexpart-WRF, a particle dispersion simulator. The automatic parallelization component has been tested on as 2-D Shallow Water model (2DSW) and on the Large Eddy Simulator for Urban Flows (UFLES) and produces a complete OpenCL-enabled code base. The fully OpenCL-accelerated versions of the 2DSW and the UFLES are resp. 9x and 20x faster on GPU than the original code on CPU, in both cases this is the same performance as manually ported code.

Sprache: Englisch
Identifikatoren: ISSN: 0045-7930
eISSN: 1879-0747
DOI: 10.1016/j.compfluid.2018.06.005
Titel-ID: cdi_proquest_journals_2114220878

Weiterführende Literatur

Empfehlungen zum selben Thema automatisch vorgeschlagen von bX

Menü

Weitere Dienste

Einstellungen

Domain-specific acceleration and auto-parallelization of legacy scientific code in FORTRAN 77 using source-to-source compilation

Details

Weiterführende Literatur