UB Paderborn / Katalog / Suche / Details

Zur Ergebnisliste

Ergebnis 14 von 562

Contextual Translation Embedding for Visual Relationship Detection and Scene Graph Generation

IEEE transactions on pattern analysis and machine intelligence, 2021-11, Vol.43 (11), p.3820-3832

2021

Volltextzugriff (PDF)

Details

Autor(en) / Beteiligte

Titel

Contextual Translation Embedding for Visual Relationship Detection and Scene Graph Generation

Ist Teil von

IEEE transactions on pattern analysis and machine intelligence, 2021-11, Vol.43 (11), p.3820-3832

Ort / Verlag

United States: IEEE

Erscheinungsjahr

2021

Quelle

IEEE Electronic Library (IEL)

Beschreibungen/Notizen

Relations amongst entities play a central role in image understanding. Due to the complexity of modeling ( subject , predicate , object ) relation triplets, it is crucial to develop a method that can not only recognize seen relations, but also generalize to unseen cases. Inspired by a previously proposed visual translation embedding model, or VTransE <xref ref-type="bibr" rid="ref1">[1] , we propose a context-augmented translation embedding model that can capture both common and rare relations. The previous VTransE model maps entities and predicates into a low-dimensional embedding vector space where the predicate is interpreted as a translation vector between the embedded features of the bounding box regions of the subject and the object . Our model additionally incorporates the contextual information captured by the bounding box of the union of the subject and the object, and learns the embeddings guided by the constraint predicate <inline-formula><tex-math notation="LaTeX">\approx</tex-math> <mml:math><mml:mo>≈</mml:mo></mml:math><inline-graphic xlink:href="hung-ieq1-2992222.gif"/> </inline-formula> union ( subject , object ) <inline-formula><tex-math notation="LaTeX">-</tex-math> <mml:math><mml:mo>-</mml:mo></mml:math><inline-graphic xlink:href="hung-ieq2-2992222.gif"/> </inline-formula> subject <inline-formula><tex-math notation="LaTeX">-</tex-math> <mml:math><mml:mo>-</mml:mo></mml:math><inline-graphic xlink:href="hung-ieq3-2992222.gif"/> </inline-formula> object . In a comprehensive evaluation on multiple challenging benchmarks, our approach outperforms previous translation-based models and comes close to or exceeds the state of the art across a range of settings, from small-scale to large-scale datasets, from common to previously unseen relations. It also achieves promising results for the recently introduced task of scene graph generation.

Sprache: Englisch
Identifikatoren: ISSN: 0162-8828
eISSN: 1939-3539, 2160-9292
DOI: 10.1109/TPAMI.2020.2992222
Titel-ID: cdi_ieee_primary_9085893

Format: –
Schlagworte: Bicycles, Embedding, Feature extraction, Image edge detection, scene graph generation, scene understanding, Semantics, Task analysis, Training, Visual relationship detection, Visualization

Weiterführende Literatur

Empfehlungen zum selben Thema automatisch vorgeschlagen von bX

Menü

Weitere Dienste

Einstellungen

Contextual Translation Embedding for Visual Relationship Detection and Scene Graph Generation

Details

Weiterführende Literatur