gordicaleksa / pytorch-original-transformer
My implementation of the original transformer model (Vaswani et al.). I've additionally included the playground.py file for visualizing otherwise seemingly hard concepts. Currently included IWSLT pretrained models.
☆1,024Updated 4 years ago
Alternatives and similar repositories for pytorch-original-transformer:
Users that are interested in pytorch-original-transformer are comparing it to the libraries listed below
- PyTorch implementation of some attentions for Deep Learning Researchers.☆530Updated 3 years ago
- Simple transformer implementation from scratch in pytorch. (archival, latest version on codeberg)☆1,083Updated last month
- An implementation of Performer, a linear attention-based transformer, in Pytorch☆1,122Updated 3 years ago
- Pytorch Lightning code guideline for conferences☆1,261Updated last year
- Reformer, the efficient Transformer, in Pytorch☆2,163Updated last year
- Long Range Arena for Benchmarking Efficient Transformers☆751Updated last year
- Transformer based on a variant of attention that is linear complexity in respect to sequence length☆756Updated 11 months ago
- Pytorch library for fast transformer implementations☆1,697Updated 2 years ago
- Paper implementations from scratch and machine learning tutorials☆348Updated last year
- ☆795Updated 2 weeks ago
- Flexible components pairing 🤗 Transformers with Pytorch Lightning☆609Updated 2 years ago
- FastFormers - highly efficient transformer models for NLU☆706Updated last month
- Toolbox of models, callbacks, and datasets for AI/ML researchers.☆1,718Updated 2 weeks ago
- PyTorch 101 series covering everything from the basic building blocks all the way to building custom architectures.☆261Updated 4 years ago
- A smaller subset of 10 easily classified classes from Imagenet, and a little more French☆1,026Updated 2 years ago
- PyTorch extensions for high performance and large scale training.☆3,306Updated 2 weeks ago
- Generic template to bootstrap your PyTorch project.☆641Updated last year
- A library to inspect and extract intermediate layers of PyTorch models.☆472Updated 2 years ago
- All about the fundamental blocks of TF and JAX!☆274Updated 3 years ago
- View model summaries in PyTorch!☆2,765Updated this week
- Machine learning metrics for distributed, scalable PyTorch applications.☆2,253Updated this week
- Shape and dimension inference (Keras-like) for PyTorch layers and neural networks☆571Updated 2 years ago
- Original transformer paper: Implementation of Vaswani, Ashish, et al. "Attention is all you need." Advances in neural information process…☆236Updated 11 months ago
- My implementation of various GAN (generative adversarial networks) architectures like vanilla GAN (Goodfellow et al.), cGAN (Mirza et al.…☆381Updated 4 years ago
- Library for 8-bit optimizers and quantization routines.☆716Updated 2 years ago
- 🧠 A study guide to learn about Transformers☆1,576Updated last year
- Unifying Variational Autoencoder (VAE) implementations in Pytorch (NeurIPS 2022)☆1,903Updated 8 months ago
- Official DeiT repository☆4,177Updated last year
- An unofficial styleguide and best practices summary for PyTorch☆1,976Updated 3 years ago
- Quick, visual, principled introduction to pytorch code through five colab notebooks.☆424Updated 3 months ago