Segment objects in videos using only bounding boxes by predict targets in training set with PM-VIS
Segment objects in videos using only bounding boxes by predict targets in training set with PM-VIS
PM-VIS: High-Performance Box-Supervised Video Instance Segmentation
arXiv paper abstract https://arxiv.org/abs/2404.13863
arXiv PDF paper https://arxiv.org/pdf/2404.13863.pdf
... Box-supervised Video Instance Segmentation (VIS) methods have emerged as a viable solution to mitigate the labor-intensive annotation process.
... Inspired by ... Segment Anything Model (SAM), ... introduce a novel approach that aims at harnessing instance box annotations from multiple perspectives to generate high-quality instance pseudo masks
... leverage ground-truth boxes to create three types of pseudo masks using the HQ-SAM model, the box-supervised VIS model (IDOL-BoxInst), and the VOS model (DeAOT) separately, along with three corresponding optimization mechanisms.
... introduce two ground-truth data filtering methods, assisted by ... pseudo masks, to ... enhance the training dataset quality and improve the performance of fully supervised VIS
... To ... capitalize on the ... Pseudo Masks, ... introduce a novel algorithm, PM-VIS, to integrate mask losses into IDOL-BoxInst.
... PM-VIS model ... demonstrates strong ability in instance mask prediction, achieving state-of-the-art performance ...
Please like and share this post if you enjoyed it using the buttons at the bottom!
Stay up to date. Subscribe to my posts https://morrislee1234.wixsite.com/website/contact
Web site with my other posts by category https://morrislee1234.wixsite.com/website
Comments