UB Paderborn / Katalog / Suche / Details

Ergebnis 1 von 35

Artificial Intelligence and Machine Learning, 2023, Vol.1805, p.167-177

2023

Autor(en) / Beteiligte

Titel

Examining Speaker and Keyword Uniqueness: Partitioning Keyword Spotting Datasets for Federated Learning with the Largest Differencing Method

Ist Teil von

Ort / Verlag

Switzerland: Springer International Publishing AG

Erscheinungsjahr

2023

Link zum Volltext

Quelle

Alma/SFX Local Collection

Beschreibungen/Notizen

Federated learning is a powerful training strategy for neural networks where several independent clients train a model without the need of sharing potentially sensitive data. However, real world client-local data is usually biased: A single client might have access to only a few lighting conditions in computer visions, patient groups in a hospital or speakers and keywords in a smart device performing keyword spotting. We help researchers to better understand and estimate the expected performance impacts by introducing a new method to partition a given dataset into an arbitrary amount of clients, each with unique properties, to simulate such conditions. We apply the method to partition the Google Speech Command dataset into clients with non-overlapping speakers and additionally unique keywords and share the script to create the novel GSC-FL dataset. The results, using convolutional neural networks, show that the performance of the final model is stable up to at least 16 clients and models trained only on local data are clearly outperformed by federated learning. However, unique speakers for each client have a negative performance impact and it increases even more with unique keywords. Our script can be applied with only minor adjustments to partition any other dataset for federated learning investigations as well.

Sprache: Englisch
Identifikatoren: ISBN: 3031391438, 9783031391439
ISSN: 1865-0929
eISSN: 1865-0937
DOI: 10.1007/978-3-031-39144-6_11
Titel-ID: cdi_springer_books_10_1007_978_3_031_39144_6_11

Format: –
Schlagworte: deep learning, federated learning, keyword spotting, multiway number partitioning, speech recognition

Empfehlungen zum selben Thema automatisch vorgeschlagen von bX