greentfrapp / attention-primer
A demonstration of the attention mechanism with some toy experiments and explanations.
☆107Updated 6 years ago
Alternatives and similar repositories for attention-primer:
Users that are interested in attention-primer are comparing it to the libraries listed below
- Configure Python functions explicitly and safely☆126Updated 4 months ago
- ☆153Updated 4 years ago
- Training Transformer-XL on 128 GPUs☆140Updated 4 years ago
- ☆64Updated 4 years ago
- ☆102Updated 4 years ago
- learning to search in pytorch☆110Updated 5 years ago
- Python implementation of GLN in different frameworks☆98Updated 4 years ago
- Pip-installable differentiable stacks in PyTorch!☆65Updated 4 years ago
- Experiment orchestration☆103Updated 4 years ago
- Weekly humanitarian AI reading group at Mila.☆21Updated 6 years ago
- Mixture Density Networks (Bishop, 1994) tutorial in JAX☆59Updated 4 years ago
- PyTorch implementation of the NIPS'17 paper Training Deep Networks without Learning Rates Through Coin Betting.☆37Updated 6 years ago
- ☆21Updated 6 years ago
- Create interactive textual heat maps for Jupiter notebooks☆196Updated 9 months ago
- Implementation of papers on Deep Seq2seq learning using Pytorch.☆219Updated 6 years ago
- PyTorch functions and utilities to make your life easier☆195Updated 4 years ago
- LM Pretraining with PyTorch/TPU☆134Updated 5 years ago
- ☆149Updated 2 years ago
- A generative modelling toolkit for PyTorch.☆70Updated 3 years ago
- Visualising the Transformer encoder☆111Updated 4 years ago
- A library for evaluating representations.☆76Updated 3 years ago
- Different types of autoencoders illustrated on MNIST using TensorFlow.☆36Updated 6 years ago
- A set of simple examples ported from PyTorch for Tensorflow Eager Execution☆73Updated 6 years ago
- Generic reinforcement learning codebase in TensorFlow☆96Updated 3 years ago
- Timetrack - a simple command line program for analysing your own calendar data.☆69Updated 4 years ago
- Code for the Shortformer model, from the ACL 2021 paper by Ofir Press, Noah A. Smith and Mike Lewis.☆146Updated 3 years ago
- presentations☆44Updated 6 years ago
- Boiler plate code for Torch based ML projects☆10Updated 3 years ago
- Code for scaling Transformers☆26Updated 4 years ago
- The Annotated Encoder Decoder with Attention☆166Updated 4 years ago