This repository contains a blog article titled "CNNs look, ViTs pay Attention"
blog.md: A detailed blog post on Vision Transformers.images/: Folder containing images referenced in the blog.
The blog discusses Vision Transformers (ViTs) and how they are used in computer vision tasks. It compares ViTs with traditional CNNs, explores their advantages, and covers real-world applications.
The images/ folder contains the following images:
- vit.png: Vision Transformer architecture.
- transformer.png: Working of transformer in NLP task.
- attention.png: Self-attention mechanism in ViTs.
To view the blog locally:
- Clone the repository:
git clone https://github.com/manyagupta13/ViT-Blog.git