Sie befinden Sich nicht im Netzwerk der Universität Paderborn. Der Zugriff auf elektronische Ressourcen ist gegebenenfalls nur via VPN oder Shibboleth (DFN-AAI) möglich. mehr Informationen...
DFAformer: A Dual Filtering Auxiliary Transformer for Efficient Online Action Detection in Streaming Videos
Ist Teil von
Pattern Recognition and Computer Vision, p.134-145
Ort / Verlag
Singapore: Springer Nature Singapore
Link zum Volltext
Quelle
Alma/SFX Local Collection
Beschreibungen/Notizen
Online action detection (OAD) aims to identify the specific type of ongoing action frame by frame without future information. The full exploration of historical memory with limited yet redundant information constraints for potential patterns thus becomes an important yet challenging problem. We propose a novel transformer-based framework called Dual Filtering Auxiliary Transformer (DFAformer) to achieve this goal. In DFAformer, a two-stage filtering mechanism filters impurities related to background and uninterested actions in the historical memory at the frame and element levels. To make the model concentrate on the ongoing action, we elaborate an auxiliary task, Jaccard Summary Unit, explicitly correlates the past with the future. This auxiliary task guide the learning of model weights without extra computational costs during inference. Experiments on three real-world benchmark datasets demonstrate the superiority of the proposed method.