pbloem / formerLinks
Simple transformer implementation from scratch in pytorch. (archival, latest version on codeberg)
☆1,087Updated 5 months ago
Alternatives and similar repositories for former
Users that are interested in former are comparing it to the libraries listed below
Sorting:
- My implementation of the original transformer model (Vaswani et al.). I've additionally included the playground.py file for visualizing o…☆1,038Updated 4 years ago
- A collection of resources to study Transformers in depth.☆543Updated last year
- Pytorch library for fast transformer implementations☆1,727Updated 2 years ago
- Fast, general, and tested differentiable structured prediction in PyTorch☆1,113Updated 3 years ago
- Pytorch Lightning code guideline for conferences☆1,279Updated last year
- PyTorch tutorials and best practices.☆1,693Updated 4 months ago
- ☆812Updated 4 months ago
- Longformer: The Long-Document Transformer☆2,159Updated 2 years ago
- Hopfield Networks is All You Need☆1,838Updated 2 years ago
- Cramming the training of a (BERT-type) language model into limited compute.☆1,345Updated last year
- Reformer, the efficient Transformer, in Pytorch☆2,180Updated 2 years ago
- Examples of using sparse attention, as in "Generating Long Sequences with Sparse Transformers"☆1,584Updated 5 years ago
- Tutorials on implementing a few sequence-to-sequence (seq2seq) models with PyTorch and TorchText.☆5,597Updated last year
- Long Range Arena for Benchmarking Efficient Transformers☆762Updated last year
- PyTorch implementation of some attentions for Deep Learning Researchers.☆534Updated 3 years ago
- FrancescoSaverioZuppichini / Pytorch-how-and-when-to-use-Module-Sequential-ModuleList-and-ModuleDictCode for my medium article☆372Updated 4 years ago
- Implementation of Transformer Model in Tensorflow☆472Updated 2 years ago
- Well documented, unit tested, type checked and formatted implementation of a vanilla transformer - for educational purposes.☆256Updated last year
- Papers & presentation materials from Hugging Face's internal science day☆2,048Updated 4 years ago
- Github repo with tutorials to fine tune transformers for diff NLP tasks☆856Updated last year
- Course notes☆708Updated last year
- Original transformer paper: Implementation of Vaswani, Ashish, et al. "Attention is all you need." Advances in neural information process…☆239Updated last year
- PyTorch 101 series covering everything from the basic building blocks all the way to building custom architectures.☆264Updated 5 years ago
- The Hitchiker's Guide to PyTorch☆1,197Updated 3 years ago
- A Visual Analysis Tool to Explore Learned Representations in Transformers Models☆599Updated last year
- The website for the CMU Language Technologies Institute low resource NLP bootcamp 2020☆600Updated 5 years ago
- 🧠 A study guide to learn about Transformers☆1,605Updated 2 years ago
- An implementation of Performer, a linear attention-based transformer, in Pytorch☆1,144Updated 3 years ago
- Transformers for Longer Sequences☆618Updated 2 years ago
- A walkthrough of transformer architecture code☆352Updated last year