UB Paderborn / Katalog / Details

Sie befinden Sich nicht im Netzwerk der Universität Paderborn. Der Zugriff auf elektronische Ressourcen ist gegebenenfalls nur via VPN oder Shibboleth (DFN-AAI) möglich. mehr Informationen...

Disambiguating Visual Verbs

IEEE transactions on pattern analysis and machine intelligence, 2019-02, Vol.41 (2), p.311-322

2019

Details

Autor(en) / Beteiligte

Titel

Disambiguating Visual Verbs

Ist Teil von

IEEE transactions on pattern analysis and machine intelligence, 2019-02, Vol.41 (2), p.311-322

Ort / Verlag

United States: IEEE

Erscheinungsjahr

2019

Link zum Volltext

Volltext

Quelle

IEEE Xplore

Beschreibungen/Notizen

In this article, we introduce a new task, visual sense disambiguation for verbs: given an image and a verb, assign the correct sense of the verb, i.e., the one that describes the action depicted in the image. Just as textual word sense disambiguation is useful for a wide range of NLP tasks, visual sense disambiguation can be useful for multimodal tasks such as image retrieval, image description, and text illustration. We introduce a new dataset, which we call VerSe (short for Verb Sense) that augments existing multimodal datasets (COCO and TUHOI) with verb and sense labels. We explore supervised and unsupervised models for the sense disambiguation task using textual, visual, and multimodal embeddings. We also consider a scenario in which we must detect the verb depicted in an image prior to predicting its sense (i.e., there is no verbal information associated with the image). We find that textual embeddings perform well when gold-standard annotations (object labels and image descriptions) are available, while multimodal embeddings perform well on unannotated images. VerSe is publicly available at https://github.com/spandanagella/verse.

Sprache: Englisch
Identifikatoren: ISSN: 0162-8828
eISSN: 1939-3539
DOI: 10.1109/TPAMI.2017.2786699
Titel-ID: cdi_ieee_primary_8240977

Format: –
Schlagworte: Annotations, Bicycles, Computer vision, distributed representations, Horses, Image detection, Image management, Image recognition, Image retrieval, Labels, Natural language processing, Retrieval, Semantics, Visual tasks, Visualization, Word sense disambiguation

Weiterführende Literatur

Empfehlungen zum selben Thema automatisch vorgeschlagen von bX

Menü

Weitere Dienste

Einstellungen

Disambiguating Visual Verbs

Details

Weiterführende Literatur