Computer vision transformer models (CLIP, ViT, DeiT) released by Hugging Face
Computer vision transformer models (CLIP, ViT, DeiT) released by Hugging Face
Transformers meets VISION
v4.6.0 is the first CV dedicated release!
Hugging Face Twitter post https://twitter.com/huggingface/status/1392503426978881536
Google Colab
Quick demo: Vision Transformer (ViT) by Google Brain
Fine-tune the Vision Transformer on CIFAR-10
Includes computer vision transformers from
OpenAI
CLIP - image-text similarity or zero-shot image classification
Documentation https://huggingface.co/transformers/model_doc/clip.html
Google AI
ViT - Vision Transformer
Documentation https://huggingface.co/transformers/model_doc/vit.html
Facebook AI
DeiT - Data-efficient vision transformer (improvement over ViT)
State-of-the-art image classification
Documentation https://huggingface.co/transformers/model_doc/deit.html
Please like and share this post if you enjoyed it using the buttons at the bottom!
Stay up to date. Subscribe to my posts https://morrislee1234.wixsite.com/website/contact
Web site with my other posts by category https://morrislee1234.wixsite.com/website
Comments