Unsupervised multi-label image classification using many image snippet embeddings with Abdelfattah
Detect known and unknown objects with semi-supervised learning by auto-encoder ensemble with OWSSD
Biometric recognition with occlusions by using multi-scale graph with MS-DGR
Semi-supervised segmentation using learning with augmentation in image and feature space with DSSN
Real-time 3D scene reconstruction from monocular images using sparse points from SLAM with SST
Learn new objects without forgetting old ones by only replaying old foreground objects with ABR
Survey of question answering on images with Ma
Object detection with transformer on many domains by attention to past predictions with Cascade-DETR
Segment objects with limited labels by collaboration of output and representation spaces with CSS
Survey of unknown object detection and segmentation
Survey of transformer inference optimization techniques
Super-resolution on a single image by using an adaptive MaxViT transformer with MaxSR
Real-time object detector on the edge using cells of interest instead of pixels with YOLIC
Get 3D object shape from sparse views with noisy poses by consistent surface in views with SC-NeuS
Enhance dark images using a learned color invariant and instance-aware translation with DiCo
Segment objects at many granularities by generating masks at multiple levels with Semantic-SAM
Survey of transformers for vision language
Fix motion blur image by matching reference patch and fusing features using with Zou
Segment untrained objects by grouping visual features and enhancing descriptions with O3S
Get pose of many symmetric objects from images using view fusion and symmetry objective with SyMFM6D