Scientific conference in universities or research centers (Scientific conferences in universities or research centers)
Deep Set Conditioned Latent Representations for Action Recognition
Singh, Akash; de Schepper, Tom; Mets, Kevin et al.
2022
Dataset
 

Files


Full Text
SinghAl-deepSetConditionedRepresentations_VISAPP_2022.pdf
Author postprint (472.72 kB)
Download

All documents in ORBi are protected by a user license.

Send to



Details



Keywords :
Action recognition; Deep set; Deep learning; Computer vision
Abstract :
[en] In recent years multi-label, multi-class video action recognition has gained significant popularity. While reasoning over temporally connected atomic actions is mundane for intelligent species, standard artificial neural networks (ANN) still struggle to classify them. In the real world, atomic actions often temporally connect to form more complex composite actions. The challenge lies in recognising composite action of varying durations while other distinct composite or atomic actions occur in the background. Drawing upon the success of relational networks, we propose methods that learn to reason over the semantic concept of objects and actions. We empirically show how ANNs benefit from pretraining, relational inductive biases and unordered set-based latent representations. In this paper we propose deep set conditioned I3D (SCI3D), a two stream relational network that employs latent representation of state and visual representation for reasoning over events and actions. They learn to reason about temporally connected actions in order to identify all of them in the video. The proposed method achieves an improvement of around 1.49% mAP in atomic action recognition and 17.57% mAP in composite action recognition, over a I3D-NL baseline, on the CATER dataset.
Disciplines :
Computer science
Author, co-author :
Singh, Akash  ;  Université de Liège - ULiège > HEC Liège : UER > UER Opérations : Systèmes d'information de gestion ; IDLab, Department of Computer Science, University of Antwerp - imec, Sint-Pietersvliet 7, 2000 Antwerp, Belgium, --- Select a Country ---
de Schepper, Tom;  IDLab, Department of Computer Science, University of Antwerp - imec, Sint-Pietersvliet 7, 2000 Antwerp, Belgium, --- Select a Country ---
Mets, Kevin;  IDLab, Department of Computer Science, University of Antwerp - imec, Sint-Pietersvliet 7, 2000 Antwerp, Belgium, --- Select a Country ---
Hellinckx, Peter;  IDLab, Faculty of Applied Engineering, University of Antwerp - imec, Sint-Pietersvliet 7, 2000 Antwerp, Belgium, --- Select a Country ---
Oramas, José;  IDLab, Department of Computer Science, University of Antwerp - imec, Sint-Pietersvliet 7, 2000 Antwerp, Belgium, --- Select a Country ---
Latré, Steven;  IDLab, Department of Computer Science, University of Antwerp - imec, Sint-Pietersvliet 7, 2000 Antwerp, Belgium, --- Select a Country ---
Language :
English
Title :
Deep Set Conditioned Latent Representations for Action Recognition
Publication date :
2022
Event name :
VISAPP
Event date :
6 to 8 February 2022
Audience :
International
Name of the research project :
Flanders AI Research Program
Funders :
Flemish Government
Data Set :
Available on ORBi :
since 10 May 2022

Statistics


Number of views
71 (13 by ULiège)
Number of downloads
53 (6 by ULiège)

OpenCitations
 
0
OpenAlex citations
 
1

Bibliography


Similar publications



Contact ORBi