pbloem / formerLinks
Simple transformer implementation from scratch in pytorch. (archival, latest version on codeberg)
☆1,088Updated 5 months ago
Alternatives and similar repositories for former
Users that are interested in former are comparing it to the libraries listed below
Sorting:
- My implementation of the original transformer model (Vaswani et al.). I've additionally included the playground.py file for visualizing o…☆1,043Updated 4 years ago
- A collection of resources to study Transformers in depth.☆546Updated last year
- Pytorch library for fast transformer implementations☆1,730Updated 2 years ago
- PyTorch implementation of some attentions for Deep Learning Researchers.☆541Updated 3 years ago
- Reformer, the efficient Transformer, in Pytorch☆2,182Updated 2 years ago
- Fast, general, and tested differentiable structured prediction in PyTorch☆1,115Updated 3 years ago
- An implementation of Performer, a linear attention-based transformer, in Pytorch☆1,148Updated 3 years ago
- Longformer: The Long-Document Transformer☆2,162Updated 2 years ago
- ☆813Updated 5 months ago
- Examples of using sparse attention, as in "Generating Long Sequences with Sparse Transformers"☆1,588Updated 5 years ago
- Cramming the training of a (BERT-type) language model into limited compute.☆1,348Updated last year
- Original transformer paper: Implementation of Vaswani, Ashish, et al. "Attention is all you need." Advances in neural information process…☆240Updated last year
- Notebooks and various random fun☆1,103Updated 2 years ago
- PyTorch tutorials and best practices.☆1,696Updated 5 months ago
- Implementation of Transformer Model in Tensorflow☆473Updated 2 years ago
- Transformer seq2seq model, program that can build a language translator from parallel corpus☆1,406Updated 2 years ago
- Hopfield Networks is All You Need☆1,846Updated 2 years ago
- Course notes☆715Updated last year
- The goal of this library is to generate more helpful exception messages for matrix algebra expressions for numpy, pytorch, jax, tensorflo…☆811Updated 3 years ago
- Attention Is All You Need | a PyTorch Tutorial to Transformers☆338Updated last year
- FrancescoSaverioZuppichini / Pytorch-how-and-when-to-use-Module-Sequential-ModuleList-and-ModuleDictCode for my medium article☆372Updated 5 years ago
- PyTorch 101 series covering everything from the basic building blocks all the way to building custom architectures.☆265Updated 5 years ago
- Long Range Arena for Benchmarking Efficient Transformers☆764Updated last year
- VQVAEs, GumbelSoftmaxes and friends☆585Updated 3 years ago
- A repository containing tutorials for practical NLP using PyTorch☆538Updated 6 years ago
- Well documented, unit tested, type checked and formatted implementation of a vanilla transformer - for educational purposes.☆259Updated last year
- The Hitchiker's Guide to PyTorch☆1,198Updated 4 years ago
- Single Headed Attention RNN - "Stop thinking with your head"☆1,183Updated 3 years ago
- Pytorch Lightning code guideline for conferences☆1,279Updated last year
- ☆3,669Updated 2 years ago