"ViT_from_scratch.py" is a simple implementation of ViT from scratch. The Transformer Encoder is from nn Module.
The dataset is available here. You can read more from the original paper.
"vit-mnist.ipynb" is a more detailed implementation of ViT. The Transformer Encoder is implemented from scratch.
Youngsiii/ViT
Folders and files
| Name | Name | Last commit date | ||
|---|---|---|---|---|

