Sie befinden Sich nicht im Netzwerk der Universität Paderborn. Der Zugriff auf elektronische Ressourcen ist gegebenenfalls nur via VPN oder Shibboleth (DFN-AAI) möglich. mehr Informationen...
Ergebnis 5 von 23
2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2023, p.19027-19036
2023
Volltextzugriff (PDF)

Details

Autor(en) / Beteiligte
Titel
Discovering the Real Association: Multimodal Causal Reasoning in Video Question Answering
Ist Teil von
  • 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2023, p.19027-19036
Ort / Verlag
IEEE
Erscheinungsjahr
2023
Quelle
IEEE Electronic Library (IEL)
Beschreibungen/Notizen
  • Video Question Answering (VideoQA) is challenging as it requires capturing accurate correlations between modalities from redundant information. Recent methods focus on the explicit challenges of the task, e.g. multimodal feature extraction, video-text alignment and fusion. Their frameworks reason the answer relying on statistical evidence causes, which ignores potential bias in the multimodal data. In our work, we investigate relational structure from a causal representation perspective on multimodal data and propose a novel inference framework. For visual data, question-irrelevant objects may establish simple matching associations with the answer. For textual data, the model prefers the local phrase semantics which may deviate from the global semantics in long sentences. Therefore, to enhance the generalization of the model, we discover the real association by explicitly capturing visual features that are causally related to the question semantics and weakening the impact of local language semantics on question answering. The experimental results on two large causal VideoQA datasets verify that our proposed framework 1) improves the accuracy of the existing VideoQA backbone, 2) demonstrates robustness on complex scenes and questions. The code will be released at https://github.com/Chuanqi-Zang/Discovering-the-Real-Association.
Sprache
Englisch
Identifikatoren
eISSN: 2575-7075
DOI: 10.1109/CVPR52729.2023.01824
Titel-ID: cdi_ieee_primary_10204968

Weiterführende Literatur

Empfehlungen zum selben Thema automatisch vorgeschlagen von bX