Sie befinden Sich nicht im Netzwerk der Universität Paderborn. Der Zugriff auf elektronische Ressourcen ist gegebenenfalls nur via VPN oder Shibboleth (DFN-AAI) möglich. mehr Informationen...
Ergebnis 20 von 88
2011 IEEE International Workshop on Machine Learning for Signal Processing, 2011, p.1-6
2011

Details

Autor(en) / Beteiligte
Titel
Learning distances to improve phoneme classification
Ist Teil von
  • 2011 IEEE International Workshop on Machine Learning for Signal Processing, 2011, p.1-6
Ort / Verlag
IEEE
Erscheinungsjahr
2011
Link zum Volltext
Quelle
IEEE Xplore Digital Library
Beschreibungen/Notizen
  • In this work we aim to learn a Mahalanobis distance to improve the performance of phoneme classification using the standard 39-dimensional MFCC features. To learn and to evaluate the performance of our distance, we use the simple k-nearest-neighbors (k-NN) classifier. Although this classifier exhibits low performance relative to state-of-the-art phoneme classifiers, it can be used to determine a distance metric that is applicable to many other better-performing machine learning methods. We devise a novel optimization method that minimizes the error function of the k-NN classifier with respect to the covariance matrix of the Mahalanobis distance, based on finite-difference stochastic approximation (FDSA) gradient estimates combined with a random perturbation term to avoid local minima. We apply our method to the problem of phoneme classification with the k-NN classifier and show that our learned distance provides performance improvement of up to 8:19% over the standard k-NN classifier, and additionally outperforms other state-of-the-art distance learning methods by approximately 4 percentage points. We also find that the computational complexity of our method, while not optimal, is better than other distance learning methods. The performance improvements for individual phoneme classes are given. The distances learned are applicable to other scale-variant machine learning methods, such as support vector machines, multidimensional scaling, and maximum variance unfolding, as well as others.
Sprache
Englisch
Identifikatoren
ISBN: 1457716216, 9781457716218
ISSN: 1551-2541
eISSN: 2378-928X
DOI: 10.1109/MLSP.2011.6064601
Titel-ID: cdi_ieee_primary_6064601

Weiterführende Literatur

Empfehlungen zum selben Thema automatisch vorgeschlagen von bX