greentfrapp / attention-primer
A demonstration of the attention mechanism with some toy experiments and explanations.
☆108Updated 6 years ago
Alternatives and similar repositories for attention-primer:
Users that are interested in attention-primer are comparing it to the libraries listed below
- Training Transformer-XL on 128 GPUs☆140Updated 4 years ago
- Configure Python functions explicitly and safely☆126Updated 5 months ago
- ☆153Updated 4 years ago
- ☆28Updated 6 years ago
- learning to search in pytorch☆110Updated 5 years ago
- PyTorch functions and utilities to make your life easier☆195Updated 4 years ago
- Tensor Shape Annotation Library (numpy, tensorflow, pytorch, ...)☆266Updated 4 years ago
- The Annotated Encoder Decoder with Attention☆166Updated 4 years ago
- Synthetic book pages created with a PGGAN☆73Updated 6 years ago
- ☆151Updated 2 years ago
- Probabilistic classification in PyTorch/TensorFlow/scikit-learn with Fenchel-Young losses☆185Updated last year
- Pip-installable differentiable stacks in PyTorch!☆65Updated 4 years ago
- Pytorch Cheatsheet☆90Updated 6 years ago
- ☆103Updated 4 years ago
- ☆64Updated 5 years ago
- Create interactive textual heat maps for Jupiter notebooks☆196Updated 11 months ago
- AI/ML citation graph with postgres + graphql☆187Updated 4 years ago
- Loss Patterns of Neural Networks☆84Updated 3 years ago
- Visualising the Transformer encoder☆111Updated 4 years ago
- a lightweight and simple logger for Machine Learning☆127Updated 4 years ago
- This code was developed for the Intro to GANs workshop for Machine Learning Tokyo (MLT).☆66Updated 6 years ago
- Code for scaling Transformers☆26Updated 4 years ago
- Code for the Shortformer model, from the ACL 2021 paper by Ofir Press, Noah A. Smith and Mike Lewis.☆147Updated 3 years ago
- Weekly humanitarian AI reading group at Mila.☆21Updated 6 years ago
- Python implementation of GLN in different frameworks☆98Updated 4 years ago
- Explorations in building seq2seq models using PyTorch and fast.ai☆14Updated 5 years ago
- Mixture Density Networks (Bishop, 1994) tutorial in JAX☆59Updated 5 years ago
- MT Tutorial for the JSALT 2019 Summer School☆48Updated 5 years ago
- Unit Testing for pytorch, based on mltest☆310Updated 4 years ago
- Code for: Implicit Competitive Regularization in GANs☆114Updated 3 years ago