gordicaleksa / pytorch-original-transformerLinks
My implementation of the original transformer model (Vaswani et al.). I've additionally included the playground.py file for visualizing otherwise seemingly hard concepts. Currently included IWSLT pretrained models.
☆1,076Updated 4 years ago
Alternatives and similar repositories for pytorch-original-transformer
Users that are interested in pytorch-original-transformer are comparing it to the libraries listed below
Sorting:
- Simple transformer implementation from scratch in pytorch. (archival, latest version on codeberg)☆1,093Updated 9 months ago
- ☆826Updated 8 months ago
- Pytorch Lightning code guideline for conferences☆1,280Updated 2 years ago
- PyTorch implementation of some attentions for Deep Learning Researchers.☆548Updated 3 years ago
- Complete deep learning project developed in Full Stack Deep Learning, Spring 2021☆447Updated 4 years ago
- PyTorch tutorials and best practices.☆1,703Updated 9 months ago
- Generic template to bootstrap your PyTorch project.☆650Updated 2 years ago
- Original transformer paper: Implementation of Vaswani, Ashish, et al. "Attention is all you need." Advances in neural information process…☆241Updated last year
- Attention Is All You Need | a PyTorch Tutorial to Transformers☆359Updated last year
- Over 200 figures and diagrams of the most popular deep learning architectures and layers FREE TO USE in your blog posts, slides, presenta…☆1,510Updated 4 years ago
- Paper implementations from scratch and machine learning tutorials☆347Updated last year
- Example deep learning projects that use wandb's features.☆1,197Updated 2 weeks ago
- This is a collection of the code that accompanies the reports in The Gallery by Weights & Biases.☆343Updated 3 years ago
- Open source guides/codes for mastering deep learning to deploying deep learning in production in PyTorch, Python, Apptainer, and more.☆863Updated 3 months ago
- Tez is a super-simple and lightweight Trainer for PyTorch. It also comes with many utils that you can use to tackle over 90% of deep lear…☆1,159Updated 2 years ago
- The purpose of this repo is to make it easy to get started with JAX, Flax, and Haiku. It contains my "Machine Learning with JAX" series o…☆774Updated 2 years ago
- Course notes☆737Updated last year
- Toolbox of models, callbacks, and datasets for AI/ML researchers.☆1,755Updated 3 weeks ago
- 🧠 A study guide to learn about Transformers☆1,623Updated 2 years ago
- All about the fundamental blocks of TF and JAX!☆275Updated 4 years ago
- 🟠 A study guide to learn about Graph Neural Networks (GNNs)☆1,265Updated 2 years ago
- FrancescoSaverioZuppichini / Pytorch-how-and-when-to-use-Module-Sequential-ModuleList-and-ModuleDictCode for my medium article☆374Updated 5 years ago
- High-quality Neural Networks for Computer Vision 😎☆447Updated 3 years ago
- Host repository for the "Reproducible Deep Learning" PhD course☆407Updated 3 years ago
- Cramming the training of a (BERT-type) language model into limited compute.☆1,355Updated last year
- PyTorch 101 series covering everything from the basic building blocks all the way to building custom architectures.☆265Updated 5 years ago
- A collection of resources to study Transformers in depth.☆559Updated 2 years ago
- Repository of Jupyter notebook tutorials for teaching the Deep Learning Course at the University of Amsterdam (MSc AI), Fall 2023☆3,046Updated last month
- 100 exercises to learn JAX☆594Updated 3 years ago
- Reformer, the efficient Transformer, in Pytorch☆2,191Updated 2 years ago