tahmid0007 / VisionTransformer

A complete easy to follow implementation of Google's Vision Transformer proposed in "AN IMAGE IS WORTH 16X16 WORDS". This pytorch implementation has comments for better understanding.
94Updated 4 years ago

Alternatives and similar repositories for VisionTransformer:

Users that are interested in VisionTransformer are comparing it to the libraries listed below