Search


Vision Mamba more accurate than transformers and is 2.8x faster and uses 86.8% less GPU memory
Vision Mamba more accurate than transformers and is 2.8x faster and uses 86.8% less GPU memory Vision Mamba: Efficient Visual...
morrislee
Feb 1, 20241 min read
78 views
0 comments

Survey of image classification with vision transformers
Survey of image classification with vision transformers A Comprehensive Study of Vision Transformers in Image Classification Tasks arXiv...
morrislee
Dec 6, 20231 min read
76 views
0 comments


Survey of vision transformer efficiency
Survey of vision transformer efficiency Which Transformer to Favor: A Comparative Analysis of Efficiency in Vision Transformers arXiv...
morrislee
Aug 21, 20231 min read
122 views
0 comments

Survey of transformer inference optimization techniques
Survey of transformer inference optimization techniques A Survey of Techniques for Optimizing Transformer Inference arXiv paper abstract...
morrislee
Jul 18, 20231 min read
63 views
0 comments


Survey of transformers for vision language
Survey of transformers for vision language Vision Language Transformers: A Survey arXiv paper abstract https://arxiv.org/abs/2307.03254...
morrislee
Jul 10, 20231 min read
188 views
0 comments

Survey of transformers for 2D object detection
Survey of transformers for 2D object detection 2D Object Detection with Transformers: A Review arXiv paper abstract...
morrislee
Jun 9, 20231 min read
83 views
0 comments


Survey of vision transformers and hybrid CNN-transformer variants
Survey of vision transformers and hybrid CNN-transformer variants A survey of the Vision Transformers and its CNN-Transformer based...
morrislee
May 18, 20231 min read
229 views
0 comments


1.5x faster vision transformers by using activation sparsity with SparseViT
1.5x faster vision transformers by using activation sparsity with SparseViT SparseViT: Revisiting Activation Sparsity for Efficient...
morrislee
Mar 31, 20231 min read
89 views
0 comments


Vision transformer beats CNN on mobile devices for accuracy and speed with ElasticViT
Vision transformer beats CNN on mobile devices for accuracy and speed with ElasticViT ElasticViT: Conflict-aware Supernet Training for...
morrislee
Mar 20, 20231 min read
188 views
0 comments

Survey of computer vision using transformers by Jamil
Survey of computer vision using transformers by Jamil A Comprehensive Survey of Transformers for Computer Vision arXiv paper abstract...
morrislee
Nov 14, 20221 min read
163 views
0 comments


Survey of transformers for video
Survey of transformers for video Video Transformers: A Survey arXiv paper abstract https://arxiv.org/abs/2201.05991v1 arXiv PDF paper...
morrislee
Jan 20, 20221 min read
98 views
0 comments


Survey of computer vision using transformers
Survey of computer vision using transformers A Survey of Visual Transformers arXiv paper abstract https://arxiv.org/abs/2111.06091 arXiv...
morrislee
Nov 12, 20211 min read
275 views
0 comments


Improve vision transformer by using anti-aliasing
Improve vision transformer by using anti-aliasing Blending Anti-Aliasing into Vision Transformer arXiv paper abstract...
morrislee
Nov 2, 20211 min read
289 views
0 comments

MobileViT: an accurate, light-weight, mobile-friendly vision transformer
MobileViT: an accurate, light-weight, mobile-friendly vision transformer MobileViT: Light-weight, General-purpose, and Mobile-friendly...
morrislee
Oct 6, 20211 min read
49 views
0 comments


Transformer handles images, video, point clouds, and audio to understand world
Transformer handles images, video, point clouds, and audio to understand world Perceiver: General Perception with Iterative Attention...
morrislee
Jul 7, 20211 min read
51 views
0 comments

Training with modified data like using 10 times more data
Training with modified data like using 10 times more data How to train your ViT? Data, Augmentation, and Regularization in Vision...
morrislee
Jun 21, 20211 min read
19 views
0 comments

Get 3D pose and shape of people from monocular images
Get 3D pose and shape of people from monocular images THUNDR: Transformer-based 3D HUmaN Reconstruction with Markers arXiv paper abtract...
morrislee
Jun 18, 20211 min read
115 views
0 comments

Transformer for super-resolution video
Transformer for super-resolution video Video Super-Resolution Transformer arXiv paper abstract https://arxiv.org/abs/2106.06847 arXiv PDF...
morrislee
Jun 15, 20211 min read
82 views
0 comments

Survey of transformers for vision, text, and audio
Survey of transformers for vision, text, and audio A Survey of Transformers arXiv paper abstract https://arxiv.org/abs/2106.04554 arXiv...
morrislee
Jun 9, 20211 min read
124 views
0 comments

Advantages of nested transformers for computer vision
Advantages of nested transformers for computer vision Aggregating Nested Transformers arXiv paper abstract https://arxiv.org/abs/2105.127...
morrislee
Jun 3, 20211 min read
38 views
0 comments