gordicaleksa / pytorch-original-transformer
My implementation of the original transformer model (Vaswani et al.). I've additionally included the playground.py file for visualizing otherwise seemingly hard concepts. Currently included IWSLT pretrained models.
☆982Updated 3 years ago
Related projects: ⓘ
- Pytorch Lightning code guideline for conferences☆1,228Updated 11 months ago
- Simple transformer implementation from scratch in pytorch.☆1,035Updated 4 months ago
- Toolbox of models, callbacks, and datasets for AI/ML researchers.☆1,678Updated this week
- Reformer, the efficient Transformer, in Pytorch☆2,097Updated last year
- Pytorch library for fast transformer implementations☆1,621Updated last year
- An unofficial styleguide and best practices summary for PyTorch☆1,900Updated 2 years ago
- PyTorch implementation of some attentions for Deep Learning Researchers.☆511Updated 2 years ago
- An implementation of Performer, a linear attention-based transformer, in Pytorch☆1,080Updated 2 years ago
- PyTorch tutorials and best practices.☆1,648Updated 2 years ago
- Original transformer paper: Implementation of Vaswani, Ashish, et al. "Attention is all you need." Advances in neural information process…☆230Updated 4 months ago
- Example deep learning projects that use wandb's features.☆1,113Updated last week
- torch-optimizer -- collection of optimizers for Pytorch☆3,012Updated 5 months ago
- A concise but complete full-attention transformer with a set of promising experimental features from various papers☆4,573Updated last week
- Long Range Arena for Benchmarking Efficient Transformers☆711Updated 9 months ago
- Transformer based on a variant of attention that is linear complexity in respect to sequence length☆670Updated 4 months ago
- The WeightWatcher tool for predicting the accuracy of Deep Neural Networks☆1,438Updated last week
- Longformer: The Long-Document Transformer☆2,028Updated last year
- Flexible components pairing 🤗 Transformers with Pytorch Lightning☆610Updated last year
- A python library for self-supervised learning on images.☆2,923Updated this week
- A Unified Library for Parameter-Efficient and Modular Transfer Learning☆2,525Updated 3 weeks ago
- A learning rate range test implementation in PyTorch☆912Updated 3 weeks ago
- Transformers for Longer Sequences☆564Updated 2 years ago
- A PyTorch repo for data loading and utilities to be shared by the PyTorch domain libraries.☆1,116Updated this week
- Over 200 figures and diagrams of the most popular deep learning architectures and layers FREE TO USE in your blog posts, slides, presenta…☆1,369Updated 3 years ago
- [CVPR 2021] Official PyTorch implementation for Transformer Interpretability Beyond Attention Visualization, a novel method to visualize …☆1,759Updated 7 months ago
- Model explainability that works seamlessly with 🤗 transformers. Explain your transformers model in just 2 lines of code.☆1,266Updated last year
- Unifying Variational Autoencoder (VAE) implementations in Pytorch (NeurIPS 2022)☆1,769Updated last month
- Model interpretability and understanding for PyTorch☆4,810Updated this week
- 🟠 A study guide to learn about Graph Neural Networks (GNNs)☆1,099Updated last year
- Build, train, deploy, scale and maintain deep learning models. Understand ML infrastructure and MLOps using hands-on examples.☆1,103Updated last year