gordicaleksa / pytorch-original-transformerLinks
My implementation of the original transformer model (Vaswani et al.). I've additionally included the playground.py file for visualizing otherwise seemingly hard concepts. Currently included IWSLT pretrained models.
☆1,038Updated 4 years ago
Alternatives and similar repositories for pytorch-original-transformer
Users that are interested in pytorch-original-transformer are comparing it to the libraries listed below
Sorting:
- Simple transformer implementation from scratch in pytorch. (archival, latest version on codeberg)☆1,088Updated 5 months ago
- PyTorch tutorials and best practices.☆1,695Updated 5 months ago
- All about the fundamental blocks of TF and JAX!☆276Updated 3 years ago
- Pytorch Lightning code guideline for conferences☆1,279Updated last year
- PyTorch 101 series covering everything from the basic building blocks all the way to building custom architectures.☆265Updated 5 years ago
- ☆814Updated 4 months ago
- Reformer, the efficient Transformer, in Pytorch☆2,177Updated 2 years ago
- Original transformer paper: Implementation of Vaswani, Ashish, et al. "Attention is all you need." Advances in neural information process…☆238Updated last year
- PyTorch implementation of some attentions for Deep Learning Researchers.☆536Updated 3 years ago
- An implementation of Performer, a linear attention-based transformer, in Pytorch☆1,147Updated 3 years ago
- Attention Is All You Need | a PyTorch Tutorial to Transformers☆337Updated last year
- Complete deep learning project developed in Full Stack Deep Learning, Spring 2021☆446Updated 3 years ago
- FrancescoSaverioZuppichini / Pytorch-how-and-when-to-use-Module-Sequential-ModuleList-and-ModuleDictCode for my medium article☆372Updated 4 years ago
- Pytorch library for fast transformer implementations☆1,727Updated 2 years ago
- Example deep learning projects that use wandb's features.☆1,179Updated last week
- Build, train, deploy, scale and maintain deep learning models. Understand ML infrastructure and MLOps using hands-on examples.☆1,176Updated 2 years ago
- This is a collection of the code that accompanies the reports in The Gallery by Weights & Biases.☆341Updated 3 years ago
- 100 exercises to learn JAX☆590Updated 3 years ago
- Reproducing Yann LeCun 1989 paper "Backpropagation Applied to Handwritten Zip Code Recognition", to my knowledge the earliest real-world …☆654Updated last year
- Collection of Pytorch lightning tutorial form as rich scripts automatically transformed to ipython notebooks.☆319Updated 3 weeks ago
- Course notes☆710Updated last year
- Toolbox of models, callbacks, and datasets for AI/ML researchers.☆1,736Updated 3 weeks ago
- Host repository for the "Reproducible Deep Learning" PhD course☆406Updated 3 years ago
- A walkthrough of transformer architecture code☆353Updated last year
- Generic template to bootstrap your PyTorch project.☆646Updated last year
- High-quality Neural Networks for Computer Vision 😎☆447Updated 2 years ago
- NYU Deep Learning Spring 2021☆1,630Updated 11 months ago
- Transformer based on a variant of attention that is linear complexity in respect to sequence length☆795Updated last year
- Explain, analyze, and visualize NLP language models. Ecco creates interactive visualizations directly in Jupyter notebooks explaining the…☆2,051Updated last year
- Long Range Arena for Benchmarking Efficient Transformers☆762Updated last year