greentfrapp / attention-primerLinks
A demonstration of the attention mechanism with some toy experiments and explanations.
☆108Updated 6 years ago
Alternatives and similar repositories for attention-primer
Users that are interested in attention-primer are comparing it to the libraries listed below
Sorting:
- Configure Python functions explicitly and safely☆126Updated 6 months ago
- ☆153Updated 5 years ago
- ☆64Updated 5 years ago
- ☆28Updated 6 years ago
- Code for scaling Transformers☆26Updated 4 years ago
- Mixture Density Networks (Bishop, 1994) tutorial in JAX☆59Updated 5 years ago
- Training Transformer-XL on 128 GPUs☆140Updated 4 years ago
- A library for evaluating representations.☆76Updated 3 years ago
- Python implementation of GLN in different frameworks☆98Updated 4 years ago
- Pip-installable differentiable stacks in PyTorch!☆65Updated 4 years ago
- PyTorch functions and utilities to make your life easier☆195Updated 4 years ago
- ☆78Updated 5 years ago
- a lightweight and simple logger for Machine Learning☆127Updated 4 years ago
- learning to search in pytorch☆110Updated 5 years ago
- http://nlp.seas.harvard.edu/2018/04/03/attention.html☆62Updated 4 years ago
- A generative modelling toolkit for PyTorch.☆70Updated 3 years ago
- Framework-agnostic library for checking array/tensor shapes at runtime.☆46Updated 4 years ago
- 👩 Pytorch and Jax code for the Madam optimiser.☆51Updated 4 years ago
- A collection of code snippets for my PyTorch Lightning projects☆107Updated 4 years ago
- My implementation of DeepMind's Perceiver☆63Updated 4 years ago
- Implementation of Feedback Transformer in Pytorch☆107Updated 4 years ago
- Weekly humanitarian AI reading group at Mila.☆21Updated 6 years ago
- 🧀 Pytorch code for the Fromage optimiser.☆124Updated 10 months ago
- Code for: Implicit Competitive Regularization in GANs☆114Updated 3 years ago
- Code for the Shortformer model, from the ACL 2021 paper by Ofir Press, Noah A. Smith and Mike Lewis.☆147Updated 3 years ago
- ☆75Updated 5 years ago
- A queue service for quickly developing scripts that use all your GPUs efficiently☆85Updated 2 years ago
- Official Tensorflow implementation of the paper "Y-Autoencoders: disentangling latent representations via sequential-encoding", Pattern R…☆52Updated 4 years ago
- A discrete sequential VAE☆39Updated 5 years ago
- Synthetic book pages created with a PGGAN☆73Updated 6 years ago