Vision transformer morphed to CNN works better

morrislee
May 28, 2021
1 min read

Visformer: The Vision-friendly Transformer

arXiv paper PDF https://arxiv.org/abs/2104.12533

arXiv PDF paper https://arxiv.org/pdf/2104.12533.pdf

GitHub https://github.com/danczs/Visformer

... rapid development ... Transformer module to vision ...

... there are still growing number of evidences showing that these models suffer over-fitting especially when the training data is limited.

... gradually transit a Transformer-based model to a convolution-based model.

... With the same computational complexity, Visformer outperforms both the Transformer-based and convolution-based models in terms of ImageNet classification accuracy, and the advantage becomes more significant when the model complexity is lower or the training set is smaller. ...

Please like and share this post if you enjoyed it using the buttons at the bottom!

Stay up to date. Subscribe to my posts https://morrislee1234.wixsite.com/website/contact

Web site with my other posts by category https://morrislee1234.wixsite.com/website

#ComputerVision #Transformers #AINewsClips #AI #ML #ArtificialIntelligence #MachineLearning

News to help your R&D in artificial intelligence, machine learning, robotics, computer vision, smart hardware

Vision transformer morphed to CNN works better

Recent Posts

Comments