Sie befinden Sich nicht im Netzwerk der Universität Paderborn. Der Zugriff auf elektronische Ressourcen ist gegebenenfalls nur via VPN oder Shibboleth (DFN-AAI) möglich. mehr Informationen...
ACM transactions on programming languages and systems, 2008-05, Vol.30 (3), p.1-43
2008

Details

Autor(en) / Beteiligte
Titel
PEAK-a fast and effective performance tuning system via compiler optimization orchestration
Ist Teil von
  • ACM transactions on programming languages and systems, 2008-05, Vol.30 (3), p.1-43
Ort / Verlag
ACM
Erscheinungsjahr
2008
Link zum Volltext
Quelle
Alma/SFX Local Collection
Beschreibungen/Notizen
  • Compile-time optimizations generally improve program performance. Nevertheless, degradations caused by individual compiler optimization techniques are to be expected. Feedback-directed optimization orchestration systems generate optimized code versions under a series of optimization combinations, evaluate their performance, and search for the best version. One challenge to such systems is to tune program performance quickly in an exponential search space. Another challenge is to achieve high program performance, considering that optimizations interact. Aiming at these two goals, this article presents an automated performance tuning system, PEAK , which searches for the best compiler optimization combinations for the important code sections in a program. The major contributions made in this work are as follows: (1) An algorithm called Combined Elimination (CE) is developed to explore the optimization space quickly and effectively; (2) Three fast and accurate rating methods are designed to evaluate the performance of an optimized code section based on a partial execution of the program; (3) An algorithm is developed to identify important code sections as candidates for performance tuning by trading off tuning speed and tuned program performance; and (4) A set of compiler tools are implemented to automate optimization orchestration. Orchestrating optimization options in SUN Forte compilers at the whole-program level, our CE algorithm improves performance by 10.8% over the SPEC CPU2000 FP baseline setting, compared to 5.6% improved by manual tuning. Orchestrating GCC O3 optimizations, CE improves performance by 12% over O3, the highest optimization level. Applying the rating methods, PEAK reduces tuning time from 2.19 hours to 5.85 minutes on average, while achieving equal or better program performance.
Sprache
Englisch
Identifikatoren
ISSN: 0164-0925
eISSN: 1558-4593
DOI: 10.1145/1353445.1353451
Titel-ID: cdi_crossref_primary_10_1145_1353445_1353451

Weiterführende Literatur

Empfehlungen zum selben Thema automatisch vorgeschlagen von bX