gordicaleksa / pytorch-original-transformer
My implementation of the original transformer model (Vaswani et al.). I've additionally included the playground.py file for visualizing otherwise seemingly hard concepts. Currently included IWSLT pretrained models.
☆1,028Updated 4 years ago
Alternatives and similar repositories for pytorch-original-transformer
Users that are interested in pytorch-original-transformer are comparing it to the libraries listed below
Sorting:
- PyTorch implementation of some attentions for Deep Learning Researchers.☆532Updated 3 years ago
- Simple transformer implementation from scratch in pytorch. (archival, latest version on codeberg)☆1,084Updated last month
- Pytorch library for fast transformer implementations☆1,703Updated 2 years ago
- Toolbox of models, callbacks, and datasets for AI/ML researchers.☆1,726Updated last week
- Repository of Jupyter notebook tutorials for teaching the Deep Learning Course at the University of Amsterdam (MSc AI), Fall 2023☆2,836Updated last month
- High-quality Neural Networks for Computer Vision 😎☆445Updated 2 years ago
- Flexible components pairing 🤗 Transformers with Pytorch Lightning☆608Updated 2 years ago
- Pytorch Lightning code guideline for conferences☆1,264Updated last year
- Reformer, the efficient Transformer, in Pytorch☆2,168Updated last year
- Generic template to bootstrap your PyTorch project.☆642Updated last year
- Original transformer paper: Implementation of Vaswani, Ashish, et al. "Attention is all you need." Advances in neural information process…☆237Updated last year
- FrancescoSaverioZuppichini / Pytorch-how-and-when-to-use-Module-Sequential-ModuleList-and-ModuleDictCode for my medium article☆370Updated 4 years ago
- PyTorch tutorials and best practices.☆1,679Updated last month
- Transformer based on a variant of attention that is linear complexity in respect to sequence length☆764Updated last year
- Library for 8-bit optimizers and quantization routines.☆716Updated 2 years ago
- Collection of Pytorch lightning tutorial form as rich scripts automatically transformed to ipython notebooks.☆314Updated 2 weeks ago
- Implementation of Perceiver, General Perception with Iterative Attention, in Pytorch☆1,142Updated last year
- Paper implementations from scratch and machine learning tutorials☆348Updated last year
- An unofficial styleguide and best practices summary for PyTorch☆1,979Updated 3 years ago
- PyTorch 101 series covering everything from the basic building blocks all the way to building custom architectures.☆262Updated 4 years ago
- Example deep learning projects that use wandb's features.☆1,161Updated 3 weeks ago
- An implementation of Performer, a linear attention-based transformer, in Pytorch☆1,129Updated 3 years ago
- STAT 453: Intro to Deep Learning @ UW-Madison (Spring 2021)☆501Updated 3 years ago
- Tensors, for human consumption☆1,251Updated 5 months ago
- Quick, visual, principled introduction to pytorch code through five colab notebooks.☆426Updated 4 months ago
- torch-optimizer -- collection of optimizers for Pytorch☆3,109Updated last year
- A learning rate range test implementation in PyTorch☆958Updated 2 weeks ago
- Longformer: The Long-Document Transformer☆2,120Updated 2 years ago
- ☆800Updated last month
- VQVAEs, GumbelSoftmaxes and friends☆561Updated 3 years ago