Computer vision transformer models (CLIP, ViT, DeiT) released by Hugging Face
- morrislee
- May 13, 2021
- 1 min read
Computer vision transformer models (CLIP, ViT, DeiT) released by Hugging Face
Transformers meets VISION
v4.6.0 is the first CV dedicated release!
Hugging Face Twitter post https://twitter.com/huggingface/status/1392503426978881536

Google Colab
Quick demo: Vision Transformer (ViT) by Google Brain
Fine-tune the Vision Transformer on CIFAR-10
Includes computer vision transformers from
OpenAI
CLIP - image-text similarity or zero-shot image classification
Documentation https://huggingface.co/transformers/model_doc/clip.html
Google AI
ViT - Vision Transformer
Documentation https://huggingface.co/transformers/model_doc/vit.html
Facebook AI
DeiT - Data-efficient vision transformer (improvement over ViT)
State-of-the-art image classification
Documentation https://huggingface.co/transformers/model_doc/deit.html
Please like and share this post if you enjoyed it using the buttons at the bottom!
Stay up to date. Subscribe to my posts https://morrislee1234.wixsite.com/website/contact
Web site with my other posts by category https://morrislee1234.wixsite.com/website
Comentarios