Sie befinden Sich nicht im Netzwerk der Universität Paderborn. Der Zugriff auf elektronische Ressourcen ist gegebenenfalls nur via VPN oder Shibboleth (DFN-AAI) möglich. mehr Informationen...
Ergebnis 20 von 1019
Pattern recognition, 2024-11, Vol.155, p.110704, Article 110704
2024

Details

Autor(en) / Beteiligte
Titel
Discriminative action tubelet detector for weakly-supervised action detection
Ist Teil von
  • Pattern recognition, 2024-11, Vol.155, p.110704, Article 110704
Ort / Verlag
Elsevier Ltd
Erscheinungsjahr
2024
Link zum Volltext
Quelle
Alma/SFX Local Collection
Beschreibungen/Notizen
  • We propose a novel framework for spatiotemporal action detection using only video-level class labels as weak supervision. Traditional fully-supervised approaches rely on labor-intensive manual annotation of bounding boxes for each frame. In contrast, collecting video-level class labels is significantly less tedious and more feasible compared to annotating frame-level sequences with bounding boxes. To address this challenge, we propose a discriminative action tubelet detector, called DAT-detector, designed to discern discriminative tubelets from action tubelet proposals (ATPs). Whereas the previous approaches have only focused on tubelet selection among the predefined object proposals, our DAT-detector prioritizes the generation of more precise action tubelets using regression and attention modules. Moreover, we introduce an ATP generation method that enhances the quality of tubelet proposals. Our approach achieves state-of-the-art performance on several benchmarks, and also demonstrates competitive performance even with fully-supervised approaches. •We propose a DAT-detector for spatiotemporal action detection using video-level class labels as weak supervision.•The DAT-detector generates precise actions tubes via proposed attention and regression modules.•We enhace tubelet proposal quality with our action tubelet proposal generation method.•Our method significantly outperforms the state-of-the-art action proposal methods.•We achieve remarkable performance in spatiotemporal action detection across multiple benchmarks, effectively competing with fully supervised approaches.
Sprache
Englisch
Identifikatoren
ISSN: 0031-3203
eISSN: 1873-5142
DOI: 10.1016/j.patcog.2024.110704
Titel-ID: cdi_crossref_primary_10_1016_j_patcog_2024_110704

Weiterführende Literatur

Empfehlungen zum selben Thema automatisch vorgeschlagen von bX