Survey of action detection and localization in space and time using deep learning

morrislee
Aug 4, 2023
1 min read

A Survey on Deep Learning-based Spatio-temporal Action Detection

arXiv paper abstract https://arxiv.org/abs/2308.01618

arXiv PDF paper https://arxiv.org/pdf/2308.01618.pdf

Spatio-temporal action detection (STAD) aims to classify the actions present in a video and localize them in space and time.

It has become a particularly active area of research in computer vision because of its explosively emerging real-world applications, such as autonomous driving, visual surveillance, entertainment, etc.

... This paper provides a comprehensive review of the state-of-the-art deep learning-based methods for STAD.

Firstly, a taxonomy is developed to organize these methods.

Next, the linking algorithms, which aim to associate the frame- or clip-level detection results together to form action tubes, are reviewed.

Then, the commonly used benchmark datasets and evaluation metrics are introduced, and the performance of state-of-the-art models is compared ...

Please like and share this post if you enjoyed it using the buttons at the bottom!

Stay up to date. Subscribe to my posts https://morrislee1234.wixsite.com/website/contact

Web site with my other posts by category https://morrislee1234.wixsite.com/website

LinkedIn https://www.linkedin.com/in/morris-lee-47877b7b

#ComputerVision #ActionRecognition #AINewsClips #AI #ML #ArtificialIntelligence #MachineLearning

News to help your R&D in artificial intelligence, machine learning, robotics, computer vision, smart hardware

Survey of action detection and localization in space and time using deep learning

Recent Posts

Comments