Sie befinden Sich nicht im Netzwerk der Universität Paderborn. Der Zugriff auf elektronische Ressourcen ist gegebenenfalls nur via VPN oder Shibboleth (DFN-AAI) möglich. mehr Informationen...
Journal of computer science and technology, 2023-12, Vol.38 (6), p.1223-1236
2023
Volltextzugriff (PDF)

Details

Autor(en) / Beteiligte
Titel
Visual Topic Semantic Enhanced Machine Translation for Multi-Modal Data Efficiency
Ist Teil von
  • Journal of computer science and technology, 2023-12, Vol.38 (6), p.1223-1236
Ort / Verlag
Singapore: Springer Nature Singapore
Erscheinungsjahr
2023
Quelle
Alma/SFX Local Collection
Beschreibungen/Notizen
  • The scarcity of bilingual parallel corpus imposes limitations on exploiting the state-of-the-art supervised translation technology. One of the research directions is employing relations among multi-modal data to enhance performance. However, the reliance on manually annotated multi-modal datasets results in a high cost of data labeling. In this paper, the topic semantics of images is proposed to alleviate the above problem. First, topic-related images can be automatically collected from the Internet by search engines. Second, topic semantics is sufficient to encode the relations between multi-modal data such as texts and images. Specifically, we propose a visual topic semantic enhanced translation (VTSE) model that utilizes topic-related images to construct a cross-lingual and cross-modal semantic space, allowing the VTSE model to simultaneously integrate the syntactic structure and semantic features. In the above process, topic similar texts and images are wrapped into groups so that the model can extract more robust topic semantics from a set of similar images and then further optimize the feature integration. The results show that our model outperforms competitive baselines by a large margin on the Multi30k and the Ambiguous COCO datasets. Our model can use external images to bring gains to translation, improving data efficiency.
Sprache
Englisch
Identifikatoren
ISSN: 1000-9000
eISSN: 1860-4749
DOI: 10.1007/s11390-023-1302-6
Titel-ID: cdi_proquest_journals_2921193790

Weiterführende Literatur

Empfehlungen zum selben Thema automatisch vorgeschlagen von bX