Sie befinden Sich nicht im Netzwerk der Universität Paderborn. Der Zugriff auf elektronische Ressourcen ist gegebenenfalls nur via VPN oder Shibboleth (DFN-AAI) möglich. mehr Informationen...
Ergebnis 17 von 21
2009 Oriental COCOSDA International Conference on Speech Database and Assessments, 2009, p.56-59
2009
Volltextzugriff (PDF)

Details

Autor(en) / Beteiligte
Titel
Construction of Chinese conversational corpora for spontaneous speech recognition and comparative study on the trilingual parallel corpora
Ist Teil von
  • 2009 Oriental COCOSDA International Conference on Speech Database and Assessments, 2009, p.56-59
Ort / Verlag
IEEE
Erscheinungsjahr
2009
Quelle
IEEE
Beschreibungen/Notizen
  • In this paper, we describe the development of Chinese conversational segmented and POS-tagged corpora currently used in the NICT/ATR speech-to-speech translation system. Over 500 K manually checked utterances provide 3.5 M words of Chinese corpora. As far as we know, they are the largest conversational textual corpora; in the domain of travel. A set of three parallel corpora is obtained with the corresponding pairs of Japanese and English words from which the Chinese words are translated. Based on these parallel corpora, we make an investigation on the statistics of each language, performances of language model and speech recognition, and find the differences among these languages. The problems and their solutions to the present Chinese corpora are also analyzed and discussed.
Sprache
Englisch
Identifikatoren
DOI: 10.1109/ICSDA.2009.5278375
Titel-ID: cdi_ieee_primary_5278375

Weiterführende Literatur

Empfehlungen zum selben Thema automatisch vorgeschlagen von bX