Sie befinden Sich nicht im Netzwerk der Universität Paderborn. Der Zugriff auf elektronische Ressourcen ist gegebenenfalls nur via VPN oder Shibboleth (DFN-AAI) möglich. mehr Informationen...
Ergebnis 21 von 314
2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2017, p.4895-4899
2017

Details

Autor(en) / Beteiligte
Titel
An autoregressive recurrent mixture density network for parametric speech synthesis
Ist Teil von
  • 2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2017, p.4895-4899
Ort / Verlag
IEEE
Erscheinungsjahr
2017
Link zum Volltext
Quelle
IEEE Xplore
Beschreibungen/Notizen
  • Neural-network-based generative models, such as mixture density networks, are potential solutions for speech synthesis. In this paper we follow this path and propose a recurrent mixture density network that incorporates a trainable autoregressive model. An advantage of incorporating an autoregressive model is that the time dependency within acoustic feature trajectories can be modeled without using the conventional dynamic features. More interestingly, experiments show that this autoregressive model learns to be a filter that emphasizes the high frequency components of the target acoustic feature trajectories in the training stage. In the synthesis stage, it boosts the low frequency components of the generated feature trajectories and hence increases their global variance. Experimental results show that the proposed model achieved higher likelihood on the training data and generated speech with better quality than other models when dynamic features were not utilized in any model.
Sprache
Englisch
Identifikatoren
eISSN: 2379-190X
DOI: 10.1109/ICASSP.2017.7953087
Titel-ID: cdi_ieee_primary_7953087

Weiterführende Literatur

Empfehlungen zum selben Thema automatisch vorgeschlagen von bX