Search


Survey of transformers for vision, text, and audio
Survey of transformers for vision, text, and audio A Survey of Transformers arXiv paper abstract https://arxiv.org/abs/2106.04554 arXiv...

morrislee
Jun 9, 20211 min read


Survey of deep learning for image retrieval
Survey of deep learning for image retrieval A Decade Survey of Content Based Image Retrieval using Deep Learning arXiv paper abstract...

morrislee
Jun 8, 20211 min read


Unblurring defocused images using multi-branch neural networks
Unblurring defocused images using multi-branch neural networks BaMBNet: A Blur-aware Multi-branch Network for Defocus Deblurring arXiv...

morrislee
Jun 4, 20211 min read


Advantages of nested transformers for computer vision
Advantages of nested transformers for computer vision Aggregating Nested Transformers arXiv paper abstract https://arxiv.org/abs/2105.127...

morrislee
Jun 3, 20211 min read


Neural network for counting crowds
Neural network for counting crowds Multi-Level Attentive Convoluntional Neural Network for Crowd Counting arXiv paper abstract...

morrislee
Jun 1, 20211 min read


Vision transformer morphed to CNN works better
Vision transformer morphed to CNN works better Visformer: The Vision-friendly Transformer arXiv paper PDF https://arxiv.org/abs/2104.1253...

morrislee
May 28, 20211 min read


Getting better placement of objects in a scene
Getting better placement of objects in a scene SBEVNet: End-to-End Deep Stereo Layout Estimation arXiv paper abstract...

morrislee
May 27, 20211 min read


Multi-layer perceptrons for vision competitive with transformers and CNN
Multi-layer perceptrons for vision competitive with transformers and CNN MLP is all you need... again? ... MLP-Mixer: An all-MLP...

morrislee
May 26, 20211 min read


Image segmentation of camouflaged objects
Image segmentation of camouflaged objects Anabranch Network for Camouflaged Object Segmentation arXiv paper abstract...

morrislee
May 25, 20211 min read


Facebook AI software does speech recognition without any transcribed data
Facebook AI software does speech recognition without any transcribed data High-performance speech recognition with no supervision at all...

morrislee
May 24, 20211 min read


Making street maps from satellite images
Making street maps from satellite images Image to Image Translation : Generating maps from satellite images arXiv paper abstract...

morrislee
May 21, 20211 min read


Google Vertex AI builds, trains, and deploys scalable machine learning models
Google Vertex AI builds, trains, and deploys scalable machine learning models Google launches Vertex AI, a fully managed cloud AI service...

morrislee
May 20, 20211 min read


Survey of work on remaining challenges in biometrics
Survey of work on remaining challenges in biometrics Biometrics: Trust, but Verify arXiv paper abstract https://arxiv.org/abs/2105.06625v...

morrislee
May 19, 20211 min read


From video get 3D shape of people, animals, and other non-rigid objects
From video get 3D shape of people, animals, and other non-rigid objects 3D Reconstruction from Videos: LASR Generate 3D models of humans...

morrislee
May 18, 20211 min read


Detect and segment 3D objects in room after train only with list of room objects
Detect and segment 3D objects in room after train only with list of room objects Recognizing 3D spaces without spatial labels Facebook AI...

morrislee
May 17, 20211 min read


Automating Data Science: Prospects and Challenges
Automating Data Science: Prospects and Challenges (accepted by the Communications of the ACM) arXiv paper abstract...

morrislee
May 14, 20211 min read


Computer vision transformer models (CLIP, ViT, DeiT) released by Hugging Face
Computer vision transformer models (CLIP, ViT, DeiT) released by Hugging Face Transformers meets VISION v4.6.0 is the first CV dedicated...

morrislee
May 13, 20211 min read


Get image matching text plus image, also get descriptions of images
Get image matching text plus image, also get descriptions of images ALIGN: Scaling Up Visual and Vision-Language Representation Learning...

morrislee
May 12, 20211 min read


Results of a competition on enhancing video resolution
Results of a competition on enhancing video resolution NTIRE 2021 Challenge on Video Super-Resolution arXiv paper abstract...

morrislee
May 11, 20211 min read


Get shape of room and people in it by echoes using one mike
Get shape of room and people in it by echoes using one mike "Bat-Sense" Technology for Smartphones Generates Images From Sound...

morrislee
May 9, 20211 min read