vulus98 / Rethinking-attention

My implementation of the original transformer model (Vaswani et al.). I've additionally included the playground.py file for visualizing otherwise seemingly hard concepts. Currently included IWSLT pretrained models.
44Updated 2 months ago

Alternatives and similar repositories for Rethinking-attention:

Users that are interested in Rethinking-attention are comparing it to the libraries listed below