Real-time object segmentation in video with faster training using EfficientVIS

morrislee
Mar 4, 2022
1 min read

Efficient Video Instance Segmentation via Tracklet Query and Proposal

arXiv paper abstract https://arxiv.org/abs/2203.01853

arXiv PDF paper https://arxiv.org/pdf/2203.01853.pdf

Project page https://jialianwu.com/projects/EfficientVIS.html

Video Instance Segmentation (VIS) aims to simultaneously classify, segment, and track multiple object instances in videos.

... recent VIS transformer (VisTR) which performs VIS end-to-end within a clip. ... suffers from long training time due to its frame-wise dense attention.

... proposes EfficientVIS, a fully end-to-end framework with efficient training and inference.

... tracklet query and tracklet proposal that associate and segment regions-of-interest (RoIs) across space and time by an iterative query-video interaction.

... Compared to VisTR, EfficientVIS requires 15x fewer training epochs while achieving state-of-the-art accuracy on the YouTube-VIS benchmark.

... method enables whole video instance segmentation in a single end-to-end pass without data association at all.

Please like and share this post if you enjoyed it using the buttons at the bottom!

Stay up to date. Subscribe to my posts https://morrislee1234.wixsite.com/website/contact

Web site with my other posts by category https://morrislee1234.wixsite.com/website

LinkedIn https://www.linkedin.com/in/morris-lee-47877b7b

#ComputerVision #Segmentation #AINewsClips #AI #ML #ArtificialIntelligence #MachineLearning

News to help your R&D in artificial intelligence, machine learning, robotics, computer vision, smart hardware

Real-time object segmentation in video with faster training using EfficientVIS

Recent Posts

Comments