Sie befinden Sich nicht im Netzwerk der Universität Paderborn. Der Zugriff auf elektronische Ressourcen ist gegebenenfalls nur via VPN oder Shibboleth (DFN-AAI) möglich. mehr Informationen...
Ergebnis 12 von 15
2013 International Conference on Soft Computing and Pattern Recognition (SoCPaR), 2013, p.31-36
2013
Volltextzugriff (PDF)

Details

Autor(en) / Beteiligte
Titel
Identifying coordinated compound words for Vietnamese word segmentation
Ist Teil von
  • 2013 International Conference on Soft Computing and Pattern Recognition (SoCPaR), 2013, p.31-36
Ort / Verlag
IEEE
Erscheinungsjahr
2013
Quelle
IEEE Xplore
Beschreibungen/Notizen
  • This paper proposes a dictionary-based method for determining coordinated compound words in Vietnamese. The main idea to determine whether two contiguous simple words in a text forms a coordinated compound word is based on their properties, part-of-speeches and the similarity between their definitions in the dictionary of the Vietnamese Computational Lexicon (VCL). We also based on the sets of synonym and antonym to identify, recognize, and establish a list of coordinated compound words (coordinated di-syllable phrases). We have used a number of rules to identify 3 or 4 syllable phrases/idioms based on relations of coordinated di-syllable phrases. We carried out two major experiments: one for identifying and creating a list of coordinated compounds, the other for improving the accuracy of Vietnamese word segmentation. The second experiment showed that the word segmentation F-scores increases from 0.11% to 0.41% (the error rate decreases from 3.32% to 12.6%). This is a new approach and highly practical value.
Sprache
Englisch
Identifikatoren
DOI: 10.1109/SOCPAR.2013.7054145
Titel-ID: cdi_ieee_primary_7054145

Weiterführende Literatur

Empfehlungen zum selben Thema automatisch vorgeschlagen von bX