Search
morrislee
Jun 9, 20211 min read
Survey of transformers for vision, text, and audio
Survey of transformers for vision, text, and audio A Survey of Transformers arXiv paper abstract https://arxiv.org/abs/2106.04554 arXiv...
124 views0 comments
morrislee
Jun 8, 20211 min read
Survey of deep learning for image retrieval
Survey of deep learning for image retrieval A Decade Survey of Content Based Image Retrieval using Deep Learning arXiv paper abstract...
270 views0 comments
morrislee
Jun 4, 20211 min read
Unblurring defocused images using multi-branch neural networks
Unblurring defocused images using multi-branch neural networks BaMBNet: A Blur-aware Multi-branch Network for Defocus Deblurring arXiv...
176 views0 comments
morrislee
Jun 3, 20211 min read
Advantages of nested transformers for computer vision
Advantages of nested transformers for computer vision Aggregating Nested Transformers arXiv paper abstract https://arxiv.org/abs/2105.127...
38 views0 comments
morrislee
Jun 1, 20211 min read
Neural network for counting crowds
Neural network for counting crowds Multi-Level Attentive Convoluntional Neural Network for Crowd Counting arXiv paper abstract...
17 views0 comments
morrislee
May 28, 20211 min read
Vision transformer morphed to CNN works better
Vision transformer morphed to CNN works better Visformer: The Vision-friendly Transformer arXiv paper PDF https://arxiv.org/abs/2104.1253...
18 views0 comments
morrislee
May 27, 20211 min read
Getting better placement of objects in a scene
Getting better placement of objects in a scene SBEVNet: End-to-End Deep Stereo Layout Estimation arXiv paper abstract...
23 views0 comments
morrislee
May 26, 20211 min read
Multi-layer perceptrons for vision competitive with transformers and CNN
Multi-layer perceptrons for vision competitive with transformers and CNN MLP is all you need... again? ... MLP-Mixer: An all-MLP...
25 views0 comments
morrislee
May 25, 20211 min read
Image segmentation of camouflaged objects
Image segmentation of camouflaged objects Anabranch Network for Camouflaged Object Segmentation arXiv paper abstract...
22 views0 comments
morrislee
May 24, 20211 min read
Facebook AI software does speech recognition without any transcribed data
Facebook AI software does speech recognition without any transcribed data High-performance speech recognition with no supervision at all...
17 views0 comments
morrislee
May 21, 20211 min read
Making street maps from satellite images
Making street maps from satellite images Image to Image Translation : Generating maps from satellite images arXiv paper abstract...
26 views0 comments
morrislee
May 20, 20211 min read
Google Vertex AI builds, trains, and deploys scalable machine learning models
Google Vertex AI builds, trains, and deploys scalable machine learning models Google launches Vertex AI, a fully managed cloud AI service...
20 views0 comments
morrislee
May 19, 20211 min read
Survey of work on remaining challenges in biometrics
Survey of work on remaining challenges in biometrics Biometrics: Trust, but Verify arXiv paper abstract https://arxiv.org/abs/2105.06625v...
9 views0 comments
morrislee
May 18, 20211 min read
From video get 3D shape of people, animals, and other non-rigid objects
From video get 3D shape of people, animals, and other non-rigid objects 3D Reconstruction from Videos: LASR Generate 3D models of humans...
11 views0 comments
morrislee
May 17, 20211 min read
Detect and segment 3D objects in room after train only with list of room objects
Detect and segment 3D objects in room after train only with list of room objects Recognizing 3D spaces without spatial labels Facebook AI...
12 views0 comments
morrislee
May 14, 20211 min read
Automating Data Science: Prospects and Challenges
Automating Data Science: Prospects and Challenges (accepted by the Communications of the ACM) arXiv paper abstract...
16 views0 comments
morrislee
May 13, 20211 min read
Computer vision transformer models (CLIP, ViT, DeiT) released by Hugging Face
Computer vision transformer models (CLIP, ViT, DeiT) released by Hugging Face Transformers meets VISION v4.6.0 is the first CV dedicated...
9 views0 comments
morrislee
May 12, 20211 min read
Get image matching text plus image, also get descriptions of images
Get image matching text plus image, also get descriptions of images ALIGN: Scaling Up Visual and Vision-Language Representation Learning...
10 views0 comments
morrislee
May 11, 20211 min read
Results of a competition on enhancing video resolution
Results of a competition on enhancing video resolution NTIRE 2021 Challenge on Video Super-Resolution arXiv paper abstract...
15 views0 comments
morrislee
May 9, 20211 min read
Get shape of room and people in it by echoes using one mike
Get shape of room and people in it by echoes using one mike "Bat-Sense" Technology for Smartphones Generates Images From Sound...
43 views0 comments