gordicaleksa / pytorch-original-transformer
My implementation of the original transformer model (Vaswani et al.). I've additionally included the playground.py file for visualizing otherwise seemingly hard concepts. Currently included IWSLT pretrained models.
☆1,012Updated 4 years ago
Alternatives and similar repositories for pytorch-original-transformer:
Users that are interested in pytorch-original-transformer are comparing it to the libraries listed below
- ☆779Updated 10 months ago
- Reformer, the efficient Transformer, in Pytorch☆2,144Updated last year
- Cockpit: A Practical Debugging Tool for Training Deep Neural Networks☆474Updated 2 years ago
- Pytorch library for fast transformer implementations☆1,670Updated last year
- Original transformer paper: Implementation of Vaswani, Ashish, et al. "Attention is all you need." Advances in neural information process…☆231Updated 9 months ago
- Flexible components pairing 🤗 Transformers with Pytorch Lightning☆612Updated 2 years ago
- The purpose of this repo is to make it easy to get started with JAX, Flax, and Haiku. It contains my "Machine Learning with JAX" series o…☆680Updated last year
- Simple transformer implementation from scratch in pytorch.☆1,068Updated 8 months ago
- ☆760Updated last week
- PyTorch implementation of some attentions for Deep Learning Researchers.☆520Updated 2 years ago
- PyTorch 101 series covering everything from the basic building blocks all the way to building custom architectures.☆256Updated 4 years ago
- Long Range Arena for Benchmarking Efficient Transformers☆740Updated last year
- Pytorch Lightning code guideline for conferences☆1,245Updated last year
- Toolbox of models, callbacks, and datasets for AI/ML researchers.☆1,709Updated 3 weeks ago
- Fast Block Sparse Matrices for Pytorch☆546Updated 4 years ago
- Complete deep learning project developed in Full Stack Deep Learning, Spring 2021☆448Updated 3 years ago
- torch-optimizer -- collection of optimizers for Pytorch☆3,070Updated 10 months ago
- A library to inspect and extract intermediate layers of PyTorch models.☆470Updated 2 years ago
- All about the fundamental blocks of TF and JAX!☆275Updated 3 years ago
- Longformer: The Long-Document Transformer☆2,073Updated last year
- An unofficial styleguide and best practices summary for PyTorch☆1,950Updated 3 years ago
- Tez is a super-simple and lightweight Trainer for PyTorch. It also comes with many utils that you can use to tackle over 90% of deep lear…☆1,163Updated 2 years ago
- This is a collection of the code that accompanies the reports in The Gallery by Weights & Biases.☆332Updated 2 years ago
- 100 exercises to learn JAX☆575Updated 2 years ago
- Helps you write algorithms in PyTorch that adapt to the available (CUDA) memory☆431Updated 5 months ago
- Generic template to bootstrap your PyTorch project.☆639Updated last year
- The goal of this library is to generate more helpful exception messages for matrix algebra expressions for numpy, pytorch, jax, tensorflo…☆801Updated 2 years ago
- PyTorch tutorials and best practices.☆1,673Updated 2 years ago
- PyTorch extensions for high performance and large scale training.☆3,235Updated 2 weeks ago
- Paper implementations from scratch and machine learning tutorials☆343Updated last year