Sie befinden Sich nicht im Netzwerk der Universität Paderborn. Der Zugriff auf elektronische Ressourcen ist gegebenenfalls nur via VPN oder Shibboleth (DFN-AAI) möglich. mehr Informationen...
Ergebnis 26 von 32
2000 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.00CH37100), 2000, Vol.3, p.1299-1302 vol.3
2000
Volltextzugriff (PDF)

Details

Autor(en) / Beteiligte
Titel
Speech reconstruction from mel frequency cepstral coefficients and pitch frequency
Ist Teil von
  • 2000 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.00CH37100), 2000, Vol.3, p.1299-1302 vol.3
Ort / Verlag
IEEE
Erscheinungsjahr
2000
Quelle
IEEE Electronic Library Online
Beschreibungen/Notizen
  • This paper presents a novel low complexity, frequency domain algorithm for reconstruction of speech from the mel-frequency cepstral coefficients (MFCC), commonly used by speech recognition systems, and the pitch frequency values. The reconstruction technique is based on the sinusoidal speech representation. A set of sine-wave frequencies is derived using the pitch frequency and voicing decisions, and synthetic phases are then assigned to each respective sine wave. The sine-wave amplitudes are generated by sampling a linear combination of frequency domain basis functions. The basis function gains are determined such that the mel-frequency binned spectrum of the reconstructed speech is similar to the mel-frequency binned spectrum, obtained from the original MFCC vector by IDCT and antilog operations. Natural sounding, good quality intelligible speech is obtained by this procedure.
Sprache
Englisch
Identifikatoren
ISBN: 9780780362932, 0780362934
ISSN: 1520-6149
eISSN: 2379-190X
DOI: 10.1109/ICASSP.2000.861816
Titel-ID: cdi_ieee_primary_861816

Weiterführende Literatur

Empfehlungen zum selben Thema automatisch vorgeschlagen von bX