Search


Object segmentation using transformers and multilayer perceptron
Object segmentation using transformers and multilayer perceptron SegFormer: Simple and Efficient Design for Semantic Segmentation with...

morrislee
Jun 2, 20211 min read


Neural network for counting crowds
Neural network for counting crowds Multi-Level Attentive Convoluntional Neural Network for Crowd Counting arXiv paper abstract...

morrislee
Jun 1, 20211 min read


Vision transformer morphed to CNN works better
Vision transformer morphed to CNN works better Visformer: The Vision-friendly Transformer arXiv paper PDF https://arxiv.org/abs/2104.1253...

morrislee
May 28, 20211 min read


Getting better placement of objects in a scene
Getting better placement of objects in a scene SBEVNet: End-to-End Deep Stereo Layout Estimation arXiv paper abstract...

morrislee
May 27, 20211 min read


Multi-layer perceptrons for vision competitive with transformers and CNN
Multi-layer perceptrons for vision competitive with transformers and CNN MLP is all you need... again? ... MLP-Mixer: An all-MLP...

morrislee
May 26, 20211 min read


Image segmentation of camouflaged objects
Image segmentation of camouflaged objects Anabranch Network for Camouflaged Object Segmentation arXiv paper abstract...

morrislee
May 25, 20211 min read


Making street maps from satellite images
Making street maps from satellite images Image to Image Translation : Generating maps from satellite images arXiv paper abstract...

morrislee
May 21, 20211 min read


Survey of work on remaining challenges in biometrics
Survey of work on remaining challenges in biometrics Biometrics: Trust, but Verify arXiv paper abstract https://arxiv.org/abs/2105.06625v...

morrislee
May 19, 20211 min read


From video get 3D shape of people, animals, and other non-rigid objects
From video get 3D shape of people, animals, and other non-rigid objects 3D Reconstruction from Videos: LASR Generate 3D models of humans...

morrislee
May 18, 20211 min read


Detect and segment 3D objects in room after train only with list of room objects
Detect and segment 3D objects in room after train only with list of room objects Recognizing 3D spaces without spatial labels Facebook AI...

morrislee
May 17, 20211 min read


Computer vision transformer models (CLIP, ViT, DeiT) released by Hugging Face
Computer vision transformer models (CLIP, ViT, DeiT) released by Hugging Face Transformers meets VISION v4.6.0 is the first CV dedicated...

morrislee
May 13, 20211 min read


Get image matching text plus image, also get descriptions of images
Get image matching text plus image, also get descriptions of images ALIGN: Scaling Up Visual and Vision-Language Representation Learning...

morrislee
May 12, 20211 min read


Results of a competition on enhancing video resolution
Results of a competition on enhancing video resolution NTIRE 2021 Challenge on Video Super-Resolution arXiv paper abstract...

morrislee
May 11, 20211 min read


Get shape of room and people in it by echoes using one mike
Get shape of room and people in it by echoes using one mike "Bat-Sense" Technology for Smartphones Generates Images From Sound...

morrislee
May 9, 20211 min read


Get 3D shapes from a single color image
Get 3D shapes from a single color image Cuboids Revisited: Learning Robust 3D Shape Fitting to Single RGB Images arXiv paper abstract...

morrislee
May 7, 20211 min read


Neural net explains its prediction with images and audio
Neural net explains its prediction with images and audio Where and When: Space-Time Attention for Audio-Visual Explanations arXiv paper...

morrislee
May 6, 20211 min read


Better low-resolution image classification using attributes
Better low-resolution image classification using attributes Enhancing Fine-Grained Classification for Low Resolution Images DeepAI Web...

morrislee
May 5, 20211 min read


Facebook segments objects in image or video with no supervision
Facebook segments objects in image or video with no supervision Advancing the state of the art in computer vision with self-supervised...

morrislee
May 3, 20211 min read


Removing haze in images
Removing haze in images Contrastive Learning for Compact Single Image Dehazing arXiv paper abstract https://arxiv.org/abs/2104.09367v1...

morrislee
Apr 30, 20211 min read


Get summary video from a text search
Get summary video from a text search GPT2MVS: Generative Pre-trained Transformer-2 for Multi-modal Video Summarization arXiv paper...

morrislee
Apr 28, 20211 min read