Search


Answer question about an image using structured information graph with SA-VQA
Answer question about an image using structured information graph with SA-VQA SA-VQA: Structured Alignment of Visual and Semantic...

morrislee
Jan 28, 20221 min read


Detect people in top-view, fish-eye images with ARPD
Detect people in top-view, fish-eye images with ARPD ARPD: Anchor-free Rotation-aware People Detection using Topview Fisheye Camera arXiv...

morrislee
Jan 27, 20221 min read


Count objects of any type in an image with only a few examples
Count objects of any type in an image with only a few examples Iterative Correlation-based Feature Refinement for Few-shot Counting arXiv...

morrislee
Jan 26, 20221 min read


Put partial 3D point clouds into standard orientation with self-supervised ConDor
Put partial 3D point clouds into standard orientation with self-supervised ConDor ConDor: Self-Supervised Canonicalization of 3D Pose for...

morrislee
Jan 25, 20221 min read


Real-time 3D object detection on low CPU headsets
Real-time 3D object detection on low CPU headsets Realtime 3D Object Detection for Headsets arXiv paper abstract...

morrislee
Jan 24, 20221 min read


Survey of methods to determine camera position
Survey of methods to determine camera position A Critical Analysis of Image-based Camera Pose Estimation Techniques arXiv paper abstract...

morrislee
Jan 21, 20221 min read


Survey of transformers for video
Survey of transformers for video Video Transformers: A Survey arXiv paper abstract https://arxiv.org/abs/2201.05991v1 arXiv PDF paper...

morrislee
Jan 20, 20221 min read


Improve pedestrian detection by using general object detector with Cascade RCNN
Improve pedestrian detection by using general object detector with Cascade RCNN Pedestrian Detection: Domain Generalization, CNNs,...

morrislee
Jan 19, 20221 min read


Enhance dim images better and simpler using imperfectly aligned images with CIDN
Enhance dim images better and simpler using imperfectly aligned images with CIDN Enhancing Low-Light Images in Real World via Cross-Image...

morrislee
Jan 18, 20221 min read


Better depth and motion from thermal images by improving self-supervised learning
Better depth and motion from thermal images by improving self-supervised learning Maximizing Self-supervision from Thermal Image for...

morrislee
Jan 17, 20221 min read


Identify better the events and participants in an image with CLIP-Event
Identify better the events and participants in an image with CLIP-Event CLIP-Event: Connecting Text and Images with Event Structures...

morrislee
Jan 14, 20221 min read


Detect known and also unknown objects which are later labeled and added without forgetting
Detect known and also unknown objects which are later labeled and added without forgetting Revisiting Open World Object Detection arXiv...

morrislee
Jan 13, 20221 min read


Get eye gaze direction using low-cost camera with edge device
Get eye gaze direction using low-cost camera with edge device Resolving Camera Position for a Practical Application of Gaze Estimation on...

morrislee
Jan 12, 20221 min read


Crop an image based on user text description
Crop an image based on user text description Repurposing Existing Deep Networks for Caption and Aesthetic-Guided Image Cropping arXiv...

morrislee
Jan 11, 20221 min read


Train object detector on image data without box annotation with Detic
Train object detector on image data without box annotation with Detic Detecting Twenty-thousand Classes using Image-level Supervision...

morrislee
Jan 10, 20221 min read


Recognizing the place shown in an image of a scene despite distracting factors with TransVPR
Recognizing the place shown in an image of a scene despite distracting factors with TransVPR TransVPR: Transformer-based place...

morrislee
Jan 7, 20221 min read


Improved segmenting of objects in a video that are mentioned in a text query with ReferFormer
Improved segmenting of objects in a video that are mentioned in a text query with ReferFormer Language as Queries for Referring Video...

morrislee
Jan 6, 20221 min read


Faster restoration of a single non-uniformly blurred image by using local and global data
Faster restoration of a single non-uniformly blurred image by using local and global data Adaptive Single Image Deblurring arXiv paper...

morrislee
Jan 5, 20221 min read


Survey of vision for locating where an image is captured or where objects are located in an image
Survey of vision for locating where an image is captured or where objects are located in an image Visual and Object Geo-localization: A...

morrislee
Jan 4, 20221 min read


Identify regions in road image mentioned in sentence
Identify regions in road image mentioned in sentence Grounding Linguistic Commands to Navigable Regions arXiv paper abstract...

morrislee
Jan 3, 20221 min read