Re-identify people in images using multi-stage transformers with COAT
Train object detector by labeling only 1 point of object with NSS
Fix incomplete 3D depth data by using RGB image and GAN with RDF-GAN
Scene segmentation without labeling by combining images and LiDAR with Drive&Segment
Real-time HybridNets detects traffic object, drivable area, and road lane
Object detection using one example by using scale and features with SaFT
Image segmentation without training labels by clustering features with STEGO
Real-time 3D reconstruction despite occlusion using motion prediction with OcclusionFusion
Faster and more accurate scene segmentation by being aware of the decoder stages with SFANet
Combine knowledge of many vision transformers into a smaller multi-talented model
Improve segmentation in bin picking by using part awareness
Get 3D layout of room from a panoramic image with LGT-Net
Convert a blurry image to a sharp video using a continuous intensity function with E-CIR
Survey of continuous human action recognition
Survey of re-identifying people seen by multiple cameras for various problem types
Real-time object segmentation in video with faster training using EfficientVIS
Better recognition of human actions using two-stage detection transformers
Improve scene segmentation with smaller models by distilling knowledge with SKR+PEA
Estimate the pixel map homograhy using image features and pixels
Segment objects of any type in image using model trained without manual annotations with FreeSOLO