Survey of image captioning using deep learning
Survey of image captioning using deep learning
Deep Learning Approaches on Image Captioning: A Review
arXiv paper abstract https://arxiv.org/abs/2201.12944
arXiv PDF paper https://arxiv.org/pdf/2201.12944.pdf
Image captioning is a research area of immense importance, aiming to generate natural language descriptions for visual content in the form of still images.
... deep learning and more recently vision-language pre-training techniques has revolutionized the field, leading to more sophisticated methods and improved performance.
In this survey paper, ... provide a structured review of deep learning methods in image captioning by presenting a comprehensive taxonomy and discussing each method category in detail.
... examine the datasets commonly employed in image captioning research, as well as the evaluation metrics used to assess the performance of different captioning models.
... address the challenges faced in this field by emphasizing issues such as object hallucination, missing context, illumination conditions, contextual understanding, and referring expressions.
... rank different deep learning methods' performance according to widely used evaluation metrics, giving insight into the current state of the art ...
Please like and share this post if you enjoyed it using the buttons at the bottom!
Stay up to date. Subscribe to my posts https://morrislee1234.wixsite.com/website/contact
Web site with my other posts by category https://morrislee1234.wixsite.com/website
Comments