Correcting an image classifier prediction by using a single image
Better document understanding without OCR using Donut transformer
Get centimeter depth image from smartphone using LiDAR and unsteadiness of hand
Classifying visual and audio events of various durations in videos with MM-Pyramid
Multi-label image classification using information on context, space, and meaning
Survey of panoptic image segmentation for objects and regions
Many types of computer vision tasks possible with new customizable vision foundation model, Florence
Correcting Face Distortion in Wide-Angle Videos
Train new object detector without bounding box annotations using captioned images
Reidentify people in new scenes better by using multiple networks
Faster multi-person pose estimation by using object modeling
Better training data for driving by using simulator to guide realistic image synthesis
Survey of fine-grained image analysis using deep learning
Survey of computer vision using transformers
Match 3D points with 99% outliers quickly with VOCRA
Improved super-resolution for images by using flows
Detect image anomalies by using low-dimensional embeddings of patches
Survey of video anomaly detection using self-supervised deep learning
Get 3D scene geometry and segmentation from a single RGB image
Video segmentation with less carrying over of errors by using a cyclic workflow