Search


Vision Mamba more accurate than transformers and is 2.8x faster and uses 86.8% less GPU memory
Vision Mamba more accurate than transformers and is 2.8x faster and uses 86.8% less GPU memory Vision Mamba: Efficient Visual...

morrislee
Feb 1, 20241 min read


Survey of image classification with vision transformers
Survey of image classification with vision transformers A Comprehensive Study of Vision Transformers in Image Classification Tasks arXiv...

morrislee
Dec 6, 20231 min read


Survey of vision transformer efficiency
Survey of vision transformer efficiency Which Transformer to Favor: A Comparative Analysis of Efficiency in Vision Transformers arXiv...

morrislee
Aug 21, 20231 min read


Survey of transformer inference optimization techniques
Survey of transformer inference optimization techniques A Survey of Techniques for Optimizing Transformer Inference arXiv paper abstract...

morrislee
Jul 18, 20231 min read


Survey of transformers for vision language
Survey of transformers for vision language Vision Language Transformers: A Survey arXiv paper abstract https://arxiv.org/abs/2307.03254...

morrislee
Jul 10, 20231 min read


Survey of transformers for 2D object detection
Survey of transformers for 2D object detection 2D Object Detection with Transformers: A Review arXiv paper abstract...

morrislee
Jun 9, 20231 min read


Survey of vision transformers and hybrid CNN-transformer variants
Survey of vision transformers and hybrid CNN-transformer variants A survey of the Vision Transformers and its CNN-Transformer based...

morrislee
May 18, 20231 min read


1.5x faster vision transformers by using activation sparsity with SparseViT
1.5x faster vision transformers by using activation sparsity with SparseViT SparseViT: Revisiting Activation Sparsity for Efficient...

morrislee
Mar 31, 20231 min read


Vision transformer beats CNN on mobile devices for accuracy and speed with ElasticViT
Vision transformer beats CNN on mobile devices for accuracy and speed with ElasticViT ElasticViT: Conflict-aware Supernet Training for...

morrislee
Mar 20, 20231 min read


Survey of computer vision using transformers by Jamil
Survey of computer vision using transformers by Jamil A Comprehensive Survey of Transformers for Computer Vision arXiv paper abstract...

morrislee
Nov 14, 20221 min read


Survey of transformers for video
Survey of transformers for video Video Transformers: A Survey arXiv paper abstract https://arxiv.org/abs/2201.05991v1 arXiv PDF paper...

morrislee
Jan 20, 20221 min read


Survey of computer vision using transformers
Survey of computer vision using transformers A Survey of Visual Transformers arXiv paper abstract https://arxiv.org/abs/2111.06091 arXiv...

morrislee
Nov 12, 20211 min read


Improve vision transformer by using anti-aliasing
Improve vision transformer by using anti-aliasing Blending Anti-Aliasing into Vision Transformer arXiv paper abstract...

morrislee
Nov 2, 20211 min read


MobileViT: an accurate, light-weight, mobile-friendly vision transformer
MobileViT: an accurate, light-weight, mobile-friendly vision transformer MobileViT: Light-weight, General-purpose, and Mobile-friendly...

morrislee
Oct 6, 20211 min read


Transformer handles images, video, point clouds, and audio to understand world
Transformer handles images, video, point clouds, and audio to understand world Perceiver: General Perception with Iterative Attention...

morrislee
Jul 7, 20211 min read


Training with modified data like using 10 times more data
Training with modified data like using 10 times more data How to train your ViT? Data, Augmentation, and Regularization in Vision...

morrislee
Jun 21, 20211 min read


Get 3D pose and shape of people from monocular images
Get 3D pose and shape of people from monocular images THUNDR: Transformer-based 3D HUmaN Reconstruction with Markers arXiv paper abtract...

morrislee
Jun 18, 20211 min read


Transformer for super-resolution video
Transformer for super-resolution video Video Super-Resolution Transformer arXiv paper abstract https://arxiv.org/abs/2106.06847 arXiv PDF...

morrislee
Jun 15, 20211 min read


Survey of transformers for vision, text, and audio
Survey of transformers for vision, text, and audio A Survey of Transformers arXiv paper abstract https://arxiv.org/abs/2106.04554 arXiv...

morrislee
Jun 9, 20211 min read


Advantages of nested transformers for computer vision
Advantages of nested transformers for computer vision Aggregating Nested Transformers arXiv paper abstract https://arxiv.org/abs/2105.127...

morrislee
Jun 3, 20211 min read