Action recognition by turning videos into an image mosaic

morrislee
Jul 1, 2021
1 min read

An Image Classifier Can Suffice Video Understanding

arXiv paper abstract https://arxiv.org/abs/2106.14104

arXiv PDF paper https://arxiv.org/pdf/2106.14104.pdf

... casting the video recognition problem as an image recognition task.

... composes input frames into a super image to train an image classifier to fulfill the task of action recognition, in exactly the same way as classifying an image.

... demonstrating strong and promising performance on four public datasets including Kinetics400, Something-to-something (V2), MiT and Jester, using a recently developed vision transformer.

... also experiment with the prevalent ResNet image classifiers

... results on Kinetics400 are comparable to some of the best-performed CNN approaches based on spatio-temporal modeling. ...

Please like and share this post if you enjoyed it using the buttons at the bottom!

Stay up to date. Subscribe to my posts https://morrislee1234.wixsite.com/website/contact

Web site with my other posts by category https://morrislee1234.wixsite.com/website

#ComputerVision #Video #AINewsClips #AI #ML #ArtificialIntelligence #MachineLearning

News to help your R&D in artificial intelligence, machine learning, robotics, computer vision, smart hardware

Action recognition by turning videos into an image mosaic

Recent Posts

Comments