UB Paderborn / Katalog / Suche / Details

Ergebnis 7 von 348743

Computer Vision – ECCV 2018, p.639-655

Autor(en) / Beteiligte

Titel

In the Eye of Beholder: Joint Learning of Gaze and Actions in First Person Video

Ist Teil von

Ort / Verlag

Cham: Springer International Publishing

Link zum Volltext

Quelle

Alma/SFX Local Collection

Beschreibungen/Notizen

We address the task of jointly determining what a person is doing and where they are looking based on the analysis of video captured by a headworn camera. We propose a novel deep model for joint gaze estimation and action recognition in First Person Vision. Our method describes the participant’s gaze as a probabilistic variable and models its distribution using stochastic units in a deep network. We sample from these stochastic units to generate an attention map. This attention map guides the aggregation of visual features in action recognition, thereby providing coupling between gaze and action. We evaluate our method on the standard EGTEA dataset and demonstrate performance that exceeds the state-of-the-art by a significant margin of 3.5%\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$3.5\%$$\end{document}.

Sprache: Englisch
Identifikatoren: ISBN: 9783030012274, 3030012271
ISSN: 0302-9743
eISSN: 1611-3349
DOI: 10.1007/978-3-030-01228-1_38
Titel-ID: cdi_springer_books_10_1007_978_3_030_01228_1_38

Empfehlungen zum selben Thema automatisch vorgeschlagen von bX