UB Paderborn / Katalog / Suche / Details

Ergebnis 23 von 3405

The Journal of supercomputing, 2020-03, Vol.76 (3), p.2039-2062

2020

Autor(en) / Beteiligte

Titel

Using Arm’s scalable vector extension on stencil codes

Ist Teil von

Ort / Verlag

New York: Springer US

Erscheinungsjahr

2020

Link zum Volltext

Quelle

Alma/SFX Local Collection

Beschreibungen/Notizen

Data-level parallelism is frequently ignored or underutilized. Achieved through vector/SIMD capabilities, it can provide substantial performance improvements on top of widely used techniques such as thread-level parallelism. However, manual vectorization is a tedious and costly process that needs to be repeated for each specific instruction set or register size. In addition, automatic compiler vectorization is susceptible to code complexity, and usually limited due to data and control dependencies. To address some of these issues, Arm recently released a new vector ISA, the scalable vector extension (SVE), which is vector-length agnostic (VLA). VLA enables the generation of binary files that run regardless of the physical vector register length. In this paper, we leverage the main characteristics of SVE to implement and optimize stencil computations, ubiquitous in scientific computing. We show that SVE enables easy deployment of textbook optimizations like loop unrolling, loop fusion, load trading or data reuse. Our detailed simulations using vector lengths ranging from 128 to 2048 bits show that these optimizations can lead to performance improvements over straightforward vectorized code of up to 1.57 × . In addition, we show that certain optimizations can hurt performance due to reduced arithmetic intensity and instruction overheads, and provide insight useful for compiler optimizers.

Sprache: Englisch
Identifikatoren: ISSN: 0920-8542
eISSN: 1573-0484
DOI: 10.1007/s11227-019-02842-5
Titel-ID: cdi_csuc_recercat_oai_recercat_cat_2072_362863

Format: –
Schlagworte: Arquitectura de computadors, Compiladors (Programes d'ordinador), Compilers, Compilers (Computer programs), Computer Science, Computer simulation, Data-level parallelism, Informàtica, Informàtica ubiqua, Interpreters, Processor Architectures, Programming Languages, Scalable vector extension, Stencil computations, Ubiquitous computing, Vector processing (computers), Vector-length agnostic, Àrees temàtiques de la UPC

Empfehlungen zum selben Thema automatisch vorgeschlagen von bX