Sie befinden Sich nicht im Netzwerk der Universität Paderborn. Der Zugriff auf elektronische Ressourcen ist gegebenenfalls nur via VPN oder Shibboleth (DFN-AAI) möglich. mehr Informationen...
Ergebnis 21 von 29
EURASIP journal on advances in signal processing, 2016-03, Vol.2016 (1), p.1-21, Article 38
2016
Volltextzugriff (PDF)

Details

Autor(en) / Beteiligte
Titel
Instruction scheduling heuristic for an efficient FFT in VLIW processors with balanced resource usage
Ist Teil von
  • EURASIP journal on advances in signal processing, 2016-03, Vol.2016 (1), p.1-21, Article 38
Ort / Verlag
Cham: Springer International Publishing
Erscheinungsjahr
2016
Quelle
Alma/SFX Local Collection
Beschreibungen/Notizen
  • The fast Fourier transform (FFT) is perhaps today’s most ubiquitous algorithm used with digital data; hence, it is still being studied extensively. Besides the benefit of reducing the arithmetic count in the FFT algorithm, memory references and scheme’s projection on processor’s architecture are critical for a fast and efficient implementation. One of the main bottlenecks is in the long latency memory accesses to butterflies’ legs and in the redundant references to twiddle factors. In this paper, we describe a new FFT implementation on high-end very long instruction word (VLIW) digital signal processors (DSP), which presents improved performance in terms of clock cycles due to the resulting low-level resource balance and to the reduced memory accesses of twiddle factors. The method introduces a tradeoff parameter between accuracy and speed. Additionally, we suggest a cache-efficient implementation methodology for the FFT, dependently on the provided VLIW hardware resources and cache structure. Experimental results on a TI VLIW DSP show that our method reduces the number of clock cycles by an average of 51 % (2 times acceleration) when compared to the most assembly-optimized and vendor-tuned FFT libraries. The FFT was generated using an instruction-level scheduling heuristic. It is a modulo-based register-sensitive scheduling algorithm, which is able to compute an aggressively efficient sequence of VLIW instructions for the FFT, maximizing the parallelism rate and minimizing clock cycles and register usage.
Sprache
Englisch
Identifikatoren
ISSN: 1687-6180, 1687-6172
eISSN: 1687-6180
DOI: 10.1186/s13634-016-0336-0
Titel-ID: cdi_proquest_miscellaneous_1808072601

Weiterführende Literatur

Empfehlungen zum selben Thema automatisch vorgeschlagen von bX