gordicaleksa / pytorch-original-transformerLinks

My implementation of the original transformer model (Vaswani et al.). I've additionally included the playground.py file for visualizing otherwise seemingly hard concepts. Currently included IWSLT pretrained models.

☆1,033

Alternatives and similar repositories for pytorch-original-transformer

Users that are interested in pytorch-original-transformer are comparing it to the libraries listed below

Sorting:

pbloem / former
Simple transformer implementation from scratch in pytorch. (archival, latest version on codeberg)
☆1,087Updated 3 months ago
Paperspace / PyTorch-101-Tutorial-Series
PyTorch 101 series covering everything from the basic building blocks all the way to building custom architectures.
☆263Updated 4 years ago
the-full-stack / fsdl-text-recognizer-2021-labs
Complete deep learning project developed in Full Stack Deep Learning, Spring 2021
☆448Updated 3 years ago
vahidk / EffectivePyTorch
PyTorch tutorials and best practices.
☆1,690Updated 3 months ago
brandokoch / attention-is-all-you-need-paper
Original transformer paper: Implementation of Vaswani, Ashish, et al. "Attention is all you need." Advances in neural information process…
☆238Updated last year
sooftware / attentions
PyTorch implementation of some attentions for Deep Learning Researchers.
☆534Updated 3 years ago
Lightning-AI / deep-learning-project-template
Pytorch Lightning code guideline for conferences
☆1,273Updated last year
abhishekkrthakur / tez
Tez is a super-simple and lightweight Trainer for PyTorch. It also comes with many utils that you can use to tackle over 90% of deep lear…
☆1,164Updated 2 years ago
rll / deepul
☆811Updated 3 months ago
lucidrains / reformer-pytorch
Reformer, the efficient Transformer, in Pytorch
☆2,174Updated 2 years ago
AakashKumarNain / TF_JAX_tutorials
All about the fundamental blocks of TF and JAX!
☆274Updated 3 years ago
The-AI-Summer / Deep-Learning-In-Production
Build, train, deploy, scale and maintain deep learning models. Understand ML infrastructure and MLOps using hands-on examples.
☆1,169Updated 2 years ago
ritchieng / deep-learning-wizard
Open source guides/codes for mastering deep learning to deploying deep learning in production in PyTorch, Python, Apptainer, and more.
☆848Updated 2 weeks ago
Lightning-Universe / lightning-bolts
Toolbox of models, callbacks, and datasets for AI/ML researchers.
☆1,732Updated last week
Lightning-AI / tutorials
Collection of Pytorch lightning tutorial form as rich scripts automatically transformed to ipython notebooks.
☆317Updated last week
sscardapane / reprodl2021
Host repository for the "Reproducible Deep Learning" PhD course
☆407Updated 3 years ago
wandb / examples
Example deep learning projects that use wandb's features.
☆1,170Updated last week
FrancescoSaverioZuppichini / Pytorch-how-and-when-to-use-Module-Sequential-ModuleList-and-ModuleDict
Code for my medium article
☆371Updated 4 years ago
jankrepl / mildlyoverfitted
Paper implementations from scratch and machine learning tutorials
☆347Updated last year
FrancescoSaverioZuppichini / glasses
High-quality Neural Networks for Computer Vision 😎
☆446Updated 2 years ago
idiap / fast-transformers
Pytorch library for fast transformer implementations
☆1,724Updated 2 years ago
wandb / awesome-dl-projects
This is a collection of the code that accompanies the reports in The Gallery by Weights & Biases.
☆340Updated 3 years ago
shreyansh26 / Annotated-ML-Papers
Annotations of the interesting ML papers I read
☆243Updated this week
dair-ai / pytorch_notebooks
🔥 A collection of PyTorch notebooks for learning and practicing deep learning
☆573Updated 2 years ago
lucidrains / performer-pytorch
An implementation of Performer, a linear attention-based transformer, in Pytorch
☆1,136Updated 3 years ago
parrt / tensor-sensor
The goal of this library is to generate more helpful exception messages for matrix algebra expressions for numpy, pytorch, jax, tensorflo…
☆810Updated 3 years ago
vlgiitr / papers_we_read
Summaries for exciting works in the field of Deep Learning.
☆354Updated last month
google-research / long-range-arena
Long Range Arena for Benchmarking Efficient Transformers
☆759Updated last year
Atcold / NYU-DLSP21
NYU Deep Learning Spring 2021
☆1,625Updated 10 months ago
davidbau / how-to-read-pytorch
Quick, visual, principled introduction to pytorch code through five colab notebooks.
☆430Updated 6 months ago