UB Paderborn / Katalog / Details

Sie befinden Sich nicht im Netzwerk der Universität Paderborn. Der Zugriff auf elektronische Ressourcen ist gegebenenfalls nur via VPN oder Shibboleth (DFN-AAI) möglich. mehr Informationen...

Construction of an English-Uyghur WordNet Dataset

Chinese Computational Linguistics, p.382-393

Details

Autor(en) / Beteiligte

Titel

Construction of an English-Uyghur WordNet Dataset

Ist Teil von

Chinese Computational Linguistics, p.382-393

Ort / Verlag

Cham: Springer International Publishing

Link zum Volltext

Quelle

Alma/SFX Local Collection

Beschreibungen/Notizen

Automatically building semantic resources is essential to low resource-languages like Uyghur. However, Uyghur suffers from a lack of publicly available evaluation dataset for automatically building semantic resources like WordNet. To cope with this problem, first, we build the largest Uyghur-English and English-Uyghur dictionaries by exploiting many possible online and offline resources. Then by using Princeton WordNet (PWN) 3.0 and Contemporary Uyghur Detailed Dictionary (CUDD), we construct an English-Uyghur WordNet evaluation dataset which is publicly available (https://github.com/kaharjan/uywordnet). In this dataset, more than 73,000 English synsets are mapped Uyghur automatically, in which over 20,000 are annotated manually. And the corresponding Uyghur words include definition and examples in Uyghur language context. We also propose a Synset Mapping based on Word Embeddings (SMWE) method. The experimental results on the dataset are promising.

Sprache: Englisch
Identifikatoren: ISBN: 3030323803, 9783030323806
ISSN: 0302-9743
eISSN: 1611-3349
DOI: 10.1007/978-3-030-32381-3_31
Titel-ID: cdi_springer_books_10_1007_978_3_030_32381_3_31

Format: –
Schlagworte: Dataset, Synset mapping, Uyghur, WordNet

Weiterführende Literatur

Empfehlungen zum selben Thema automatisch vorgeschlagen von bX

Menü

Weitere Dienste

Einstellungen

Construction of an English-Uyghur WordNet Dataset

Details

Weiterführende Literatur