Survey of vision transformers for action recognition
Survey of vision transformers for action recognition
Vision Transformers for Action Recognition: A Survey
arXiv paper abstract https://arxiv.org/abs/2209.05700v1
arXiv PDF paper https://arxiv.org/pdf/2209.05700v1.pdf
... provides the first comprehensive survey of vision transformer techniques for action recognition.
... analyze and summarize the existing and emerging literature in this direction while highlighting the popular trends in adapting transformers for action recognition.
... literature review provides suitable taxonomies for action transformers based on their architecture, modality, and intended objective.
... explore the techniques to encode spatio-temporal data, dimensionality reduction, frame patch and spatio-temporal cube construction, and various representation methods.
... investigate the optimization of spatio-temporal attention in transformer layers to handle longer sequences, typically by reducing the number of tokens in a single attention operation.
... investigate different network learning strategies, such as self-supervised and zero-shot learning, along with their associated losses for transformer-based action recognition ...
Please like and share this post if you enjoyed it using the buttons at the bottom!
Stay up to date. Subscribe to my posts https://morrislee1234.wixsite.com/website/contact
Web site with my other posts by category https://morrislee1234.wixsite.com/website
Comments