UB Paderborn / Katalog / Suche / Details

Ergebnis 25 von 203

Proceedings of 2002 IEEE Workshop on Speech Synthesis, 2002, 2002, p.155-158

2002

Volltextzugriff (PDF)

Autor(en) / Beteiligte

Titel

Statistic prosody structure prediction

Ist Teil von

Ort / Verlag

IEEE

Erscheinungsjahr

2002

Quelle

IEEE Electronic Library (IEL)

Beschreibungen/Notizen

Hierarchical prosody structure generation is a key component for a speech synthesis system. This paper presents a statistic method that predicts the prosody structure for the Chinese text-to-speech (TTS) system by combining a dynamic program method with the rules. The method is based on a manually annotated corpus extracted from the natural speech (IBM Mandarin TTS Corpus for Female 02). The experimental results show that an accuracy of 91.2% for predicting prosodic structure can be achieved. A state-of-the-art Mandarin TTS system is worked out based on the hierarchical prosody structure. Listening tests show that the prosody structure works pretty well.

Sprache: Englisch
Identifikatoren: ISBN: 0780373952, 9780780373952
DOI: 10.1109/WSS.2002.1224397
Titel-ID: cdi_ieee_primary_1224397

Format: –
Schlagworte: Buildings, Data mining, Electronic mail, Natural languages, Rhythm, Speech synthesis, Statistics, Tagging, Testing, Training data

Empfehlungen zum selben Thema automatisch vorgeschlagen von bX