top of page

News to help your R&D in artificial intelligence, machine learning, robotics, computer vision, smart hardware

As an Amazon Associate I earn

from qualifying purchases

Writer's picturemorrislee

Match an image taken at ground-level to an aerial photo with TransGCNN

Match an image taken at ground-level to an aerial photo with TransGCNN


Transformer-Guided Convolutional Neural Network for Cross-View Geolocalization



Ground-to-aerial geolocalization refers to localizing a ground-level query image by matching it to a reference database of geo-tagged aerial imagery.


This is very challenging due to the huge perspective differences in visual appearances and geometric configurations between these two views.


... propose ... Transformer-guided convolutional neural network (TransGCNN) ... couples CNN-based local features with Transformer-based global representations for enhanced representation learning.


... TransGCNN consists of a CNN backbone extracting feature map from an input image and a Transformer head modeling global context from the CNN map.


... Transformer head acts as a spatial-aware importance generator to select salient CNN features as the final feature representation.


... model achieves top-1 accuracy ... which outperforms the second-performing baseline with less than 50% parameters and almost 2x higher frame rate



Please like and share this post if you enjoyed it using the buttons at the bottom!


Stay up to date. Subscribe to my posts https://morrislee1234.wixsite.com/website/contact

Web site with my other posts by category https://morrislee1234.wixsite.com/website



175 views0 comments

Comments


ClickBank paid link

bottom of page