Search


Small object detection using sparsely-connected convolution for less compute with TinyDet
Small object detection using sparsely-connected convolution for less compute with TinyDet TinyDet: Accurate Small Object Detection in...

morrislee
Apr 10, 20231 min read


Learn new objects without forgetting old ones with transformers by using reliability with CL-DETR
Learn new objects without forgetting old ones with transformers by using reliability with CL-DETR Continual Detection Transformer for...

morrislee
Apr 7, 20231 min read


Real-time segment all objects in image without training on CPU with Segment Anything
Real-time segment all objects in image without training on CPU with Segment Anything Segment Anything Project page...

morrislee
Apr 6, 20231 min read


Get 3D head that is photorealistic and animatable from 1 minute of monocular video with MonoAvatar
Get 3D head that is photorealistic and animatable from 1 minute of monocular video with MonoAvatar Learning Personalized High Quality...

morrislee
Apr 5, 20231 min read


Survey of vision-language models for vision tasks
Survey of vision-language models for vision tasks Vision-Language Models for Vision Tasks: A Survey arXiv paper abstract...

morrislee
Apr 4, 20231 min read


Get 3D shape even with changing light using view-dependence normalization with VDN-NeRF
Get 3D shape even with changing light using view-dependence normalization with VDN-NeRF VDN-NeRF: Resolving Shape-Radiance Ambiguity via...

morrislee
Apr 3, 20231 min read


1.5x faster vision transformers by using activation sparsity with SparseViT
1.5x faster vision transformers by using activation sparsity with SparseViT SparseViT: Revisiting Activation Sparsity for Efficient...

morrislee
Mar 31, 20231 min read


Get object pose in new domain without labels by automatically fine-tune on new images with TTA-COPE
Get object pose in new domain without labels by automatically fine-tune on new images with TTA-COPE TTA-COPE: Test-Time Adaptation for...

morrislee
Mar 30, 20231 min read


Segment objects in videos using only bounding boxes along with time consistency with MaskFreeVIS
Segment objects in videos using only bounding boxes along with time consistency with MaskFreeVIS Mask-Free Video Instance Segmentation...

morrislee
Mar 29, 20231 min read


Get high-quality 3D shape with realistic texture from LiDAR smartphone by neural geometry with TMO
Get high-quality 3D shape with realistic texture from LiDAR smartphone by neural geometry with TMO TMO: Textured Mesh Acquisition of...

morrislee
Mar 28, 20231 min read


Real-time shape of moving 3D object and 6-DoF tracking using neural object field with BundleSDF
Real-time shape of moving 3D object and 6-DoF tracking using neural object field with BundleSDF BundleSDF: Neural 6-DoF Tracking and 3D...

morrislee
Mar 27, 20231 min read


Complete point clouds using unsupervised learning by the Sinkhorn algorithm with UDPReg
Complete point clouds using unsupervised learning by the Sinkhorn algorithm with UDPReg Unsupervised Deep Probabilistic Approach for...

morrislee
Mar 24, 20231 min read


Segment objects in videos using only 2 labeled frames with Two-shot-Video-Object-Segmentation
Segment objects in videos using only 2 labeled frames with Two-shot-Video-Object-Segmentation Two-shot Video Object Segmentation arXiv...

morrislee
Mar 23, 20231 min read


Detect all known and unknown objects using many sources and probability calibration with UniDetector
Detect all known and unknown objects using many sources and probability calibration with UniDetector Detecting Everything in the Open...

morrislee
Mar 22, 20231 min read


Get 3D shape of novel object from one image using pre-trained diffusion models with Zero-1-to-3
Get 3D shape of novel object from one image using pre-trained diffusion models with Zero-1-to-3 Zero-1-to-3: Zero-shot One Image to 3D...

morrislee
Mar 21, 20231 min read


Vision transformer beats CNN on mobile devices for accuracy and speed with ElasticViT
Vision transformer beats CNN on mobile devices for accuracy and speed with ElasticViT ElasticViT: Conflict-aware Supernet Training for...

morrislee
Mar 20, 20231 min read


Get 3D surface from Neural Radiance Field using signed surface approximation with NeRFMeshing
Get 3D surface from Neural Radiance Field using signed surface approximation with NeRFMeshing NeRFMeshing: Distilling Neural Radiance...

morrislee
Mar 17, 20231 min read


Survey of visual recognition with deep learning on new domains using only a few examples
Survey of visual recognition with deep learning on new domains using only a few examples Deep Learning for Cross-Domain Few-Shot Visual...

morrislee
Mar 16, 20231 min read


Get absolute depth in monocular self-supervised depth estimation by global scale factor with Dana
Get absolute depth in monocular self-supervised depth estimation by global scale factor with Dana One scalar is all you need -- absolute...

morrislee
Mar 15, 20231 min read


3D object detection boxes directly from image and point data using multi-modal features with CMT
3D object detection boxes directly from image and point data using multi-modal features with CMT Cross Modal Transformer: Towards Fast...

morrislee
Mar 14, 20231 min read




