Count objects in image with point prompts from object localization and CLIP to identify with PseCo

morrislee
Nov 22, 2023
1 min read

Point, Segment and Count: A Generalized Framework for Object Counting

arXiv paper abstract https://arxiv.org/abs/2311.12386

arXiv PDF paper https://arxiv.org/pdf/2311.12386.pdf

Class-agnostic object counting aims to count all objects in an image with respect to example boxes or class names, a.k.a few-shot and zero-shot counting. Current state-of-the-art methods highly rely on density maps to predict object counts, which lacks model interpretability.

... propose a generalized framework for both few-shot and zero-shot object counting based on detection.

... framework combines the superior advantages of two foundation models without compromising their zero-shot capability: (i) SAM to segment all possible objects as mask proposals, and (ii) CLIP to classify proposals to obtain accurate object counts.

... framework, termed PseCo, follows three steps: point, segment, and count ... propose a class-agnostic object localization to provide but least point prompts for SAM, which ... reduces computation ... avoids missing small objects.

... propose ... object classification that leverages CLIP image/text embeddings as the classifier, following a hierarchical knowledge distillation to obtain discriminative classifications among ... mask proposals.

... demonstrate that PseCo achieves state-of-the-art performance in both few-shot/zero-shot object counting/detection ...

Please like and share this post if you enjoyed it using the buttons at the bottom!

Stay up to date. Subscribe to my posts https://morrislee1234.wixsite.com/website/contact

Web site with my other posts by category https://morrislee1234.wixsite.com/website

LinkedIn https://www.linkedin.com/in/morris-lee-47877b7b

#ComputerVision #ObjectDetection #AINewsClips #AI #ML #ArtificialIntelligence #MachineLearning

News to help your R&D in artificial intelligence, machine learning, robotics, computer vision, smart hardware

Count objects in image with point prompts from object localization and CLIP to identify with PseCo

Recent Posts

Comments