gordicaleksa / pytorch-original-transformer
My implementation of the original transformer model (Vaswani et al.). I've additionally included the playground.py file for visualizing otherwise seemingly hard concepts. Currently included IWSLT pretrained models.
☆1,017Updated 4 years ago
Alternatives and similar repositories for pytorch-original-transformer:
Users that are interested in pytorch-original-transformer are comparing it to the libraries listed below
- Simple transformer implementation from scratch in pytorch.☆1,080Updated 10 months ago
- Reformer, the efficient Transformer, in Pytorch☆2,151Updated last year
- Original transformer paper: Implementation of Vaswani, Ashish, et al. "Attention is all you need." Advances in neural information process…☆232Updated 10 months ago
- Pytorch library for fast transformer implementations☆1,684Updated last year
- An implementation of Performer, a linear attention-based transformer, in Pytorch☆1,115Updated 3 years ago
- Pytorch Lightning code guideline for conferences☆1,257Updated last year
- All about the fundamental blocks of TF and JAX!☆275Updated 3 years ago
- The purpose of this repo is to make it easy to get started with JAX, Flax, and Haiku. It contains my "Machine Learning with JAX" series o…☆699Updated last year
- PyTorch implementation of some attentions for Deep Learning Researchers.☆526Updated 3 years ago
- Attention Is All You Need | a PyTorch Tutorial to Transformers☆297Updated last year
- A library to inspect and extract intermediate layers of PyTorch models.☆472Updated 2 years ago
- PyTorch 101 series covering everything from the basic building blocks all the way to building custom architectures.☆259Updated 4 years ago
- VQVAEs, GumbelSoftmaxes and friends☆554Updated 3 years ago
- Transformer based on a variant of attention that is linear complexity in respect to sequence length☆746Updated 10 months ago
- Flexible components pairing 🤗 Transformers with Pytorch Lightning☆611Updated 2 years ago
- 100 exercises to learn JAX☆576Updated 2 years ago
- Tez is a super-simple and lightweight Trainer for PyTorch. It also comes with many utils that you can use to tackle over 90% of deep lear…☆1,163Updated 2 years ago
- Collection of Pytorch lightning tutorial form as rich scripts automatically transformed to ipython notebooks.☆306Updated last week
- Longformer: The Long-Document Transformer☆2,086Updated 2 years ago
- TorchMultimodal is a PyTorch library for training state-of-the-art multimodal multi-task models at scale.☆1,552Updated this week
- Repository of Jupyter notebook tutorials for teaching the Deep Learning Course at the University of Amsterdam (MSc AI), Fall 2023☆2,744Updated 4 months ago
- FastFormers - highly efficient transformer models for NLU☆704Updated last year
- A walkthrough of transformer architecture code☆339Updated last year
- Complete deep learning project developed in Full Stack Deep Learning, Spring 2021☆448Updated 3 years ago
- ☆782Updated last year
- Quick, visual, principled introduction to pytorch code through five colab notebooks.☆415Updated 2 months ago
- Implementation of Perceiver, General Perception with Iterative Attention, in Pytorch☆1,127Updated last year
- Transformers for Longer Sequences☆594Updated 2 years ago
- VICReg official code base☆531Updated last year
- STAT 453: Intro to Deep Learning @ UW-Madison (Spring 2021)☆487Updated 3 years ago