Segmentation with only a few examples by using image captions instead of pixel labels with IMR-HSNet
Segment unknown objects using top-down learning and bottom-up segmentations with UDOS
Survey of semantic segmentation for autonomous driving including efficiency and use of depth or time
Detect objects in new domain without labels in target by using domain-invariant frequencies with FIT
Count objects in image with zero examples using patches like a generated sample with ZSC
Small object detection using bounding boxes guided by confidences with C-BBL
Object segmentation with only image labels using intermediate patch features with ToCo
Indoor 3D scene reconstruction using plane-regularized signed distance field with P2SDF
Complete point clouds by using missing part sensitive transformer with ProxyFormer
Survey of action labeling over time in video
Complete 3D scene using only images having occlusions by masked autoencoder with VoxFormer
3D semantic segmentation using RGB and depth with PDCNet
Get 3D human in video by self-supervised scene decomposition without prior datasets with Vid2Avatar
Reconstruct 3D object from one image using diffusion image generator with RealFusion
Survey of semi-supervised semantic segmentation
Survey of machine learning on the edge
Monocular depth using transformer, CNN, and uncertainty with URCDC-Depth
Real-time object detector on the edge using better augmentation and FCOS with EdgeYOLO
Survey of real-time object detection networks including versatility, robustness, resources, energy
Survey of semantic image segmentation over two decades