gordicaleksa / pytorch-original-transformerLinks
My implementation of the original transformer model (Vaswani et al.). I've additionally included the playground.py file for visualizing otherwise seemingly hard concepts. Currently included IWSLT pretrained models.
☆1,032Updated 4 years ago
Alternatives and similar repositories for pytorch-original-transformer
Users that are interested in pytorch-original-transformer are comparing it to the libraries listed below
Sorting:
- Simple transformer implementation from scratch in pytorch. (archival, latest version on codeberg)☆1,086Updated 2 months ago
- Repository of Jupyter notebook tutorials for teaching the Deep Learning Course at the University of Amsterdam (MSc AI), Fall 2023☆2,861Updated 2 months ago
- Pytorch Lightning code guideline for conferences☆1,266Updated last year
- Explain, analyze, and visualize NLP language models. Ecco creates interactive visualizations directly in Jupyter notebooks explaining the…☆2,035Updated 9 months ago
- A library to inspect and extract intermediate layers of PyTorch models.☆473Updated 3 years ago
- ☆803Updated last month
- Reformer, the efficient Transformer, in Pytorch☆2,169Updated last year
- An implementation of Performer, a linear attention-based transformer, in Pytorch☆1,133Updated 3 years ago
- 🧠 A study guide to learn about Transformers☆1,593Updated 2 years ago
- Flexible components pairing 🤗 Transformers with Pytorch Lightning☆609Updated 2 years ago
- Generic template to bootstrap your PyTorch project.☆643Updated last year
- Toolbox of models, callbacks, and datasets for AI/ML researchers.☆1,730Updated this week
- MADGRAD Optimization Method☆801Updated 4 months ago
- Pytorch library for fast transformer implementations☆1,709Updated 2 years ago
- Longformer: The Long-Document Transformer☆2,130Updated 2 years ago
- 🔥 A collection of PyTorch notebooks for learning and practicing deep learning☆564Updated 2 years ago
- VQVAEs, GumbelSoftmaxes and friends☆566Updated 3 years ago
- Library for 8-bit optimizers and quantization routines.☆716Updated 2 years ago
- Transformer seq2seq model, program that can build a language translator from parallel corpus☆1,397Updated 2 years ago
- Transformer based on a variant of attention that is linear complexity in respect to sequence length☆768Updated last year
- Github repo with tutorials to fine tune transformers for diff NLP tasks☆855Updated last year
- 100 exercises to learn JAX☆579Updated 2 years ago
- Tez is a super-simple and lightweight Trainer for PyTorch. It also comes with many utils that you can use to tackle over 90% of deep lear…☆1,164Updated 2 years ago
- Complete deep learning project developed in Full Stack Deep Learning, Spring 2021☆448Updated 3 years ago
- FastFormers - highly efficient transformer models for NLU☆705Updated 2 months ago
- Unifying Variational Autoencoder (VAE) implementations in Pytorch (NeurIPS 2022)☆1,915Updated 10 months ago
- Long Range Arena for Benchmarking Efficient Transformers☆757Updated last year
- The purpose of this repo is to make it easy to get started with JAX, Flax, and Haiku. It contains my "Machine Learning with JAX" series o…☆732Updated last year
- Implementation of Perceiver, General Perception with Iterative Attention, in Pytorch☆1,145Updated last year
- A PyTorch repo for data loading and utilities to be shared by the PyTorch domain libraries.☆1,201Updated this week