Re-identify people in images using multi-stage transformers with COAT

morrislee
Mar 25, 2022
1 min read

Cascade Transformers for End-to-End Person Search

arXiv paper abstract https://arxiv.org/abs/2203.09642v1

arXiv PDF paper https://arxiv.org/pdf/2203.09642v1.pdf

The goal of person search is to localize a target person from a gallery set of scene images, which is extremely challenging due to large scale variations, pose/viewpoint changes, and occlusions.

... propose the Cascade Occluded Attention Transformer (COAT) for end-to-end person search.

... three-stage cascade ... focuses on detecting people in the first stage, while later stages simultaneously and progressively refine the representation for person detection and re-identification.

At each stage the occluded attention transformer applies tighter intersection over union thresholds, forcing the network to learn coarse-to-fine pose/scale invariant features.

... calculate each detection's occluded attention to differentiate a person's tokens from other people or the background.

... demonstrate the benefits of ... method by achieving state-of-the-art performance on two benchmark datasets.

Please like and share this post if you enjoyed it using the buttons at the bottom! Stay up to date. Subscribe to my posts https://morrislee1234.wixsite.com/website/contact Web site with my other posts by category https://morrislee1234.wixsite.com/website LinkedIn https://www.linkedin.com/in/morris-lee-47877b7b #ComputerVision #Surveillance #AINewsClips #AI #ML #ArtificialIntelligence #MachineLearning

News to help your R&D in artificial intelligence, machine learning, robotics, computer vision, smart hardware

Re-identify people in images using multi-stage transformers with COAT

Recent Posts

Comments