greentfrapp / attention-primerLinks

A demonstration of the attention mechanism with some toy experiments and explanations.

☆107

Alternatives and similar repositories for attention-primer

Users that are interested in attention-primer are comparing it to the libraries listed below

Sorting:

PetrochukM / HParams
Configure Python functions explicitly and safely
☆128Updated last year
cybertronai / transformer-xl
Training Transformer-XL on 128 GPUs
☆141Updated 5 years ago
hal3 / macarico
learning to search in pytorch
☆110Updated 5 years ago
criteo-research / iclr_analysis
☆28Updated 6 years ago
srush / parallax
☆153Updated 5 years ago
viking-sudo-rm / stacknn-core
Pip-installable differentiable stacks in PyTorch!
☆65Updated 5 years ago
mblondel / fenchel-young-losses
Probabilistic classification in PyTorch/TensorFlow/scikit-learn with Fenchel-Young losses
☆192Updated 2 years ago
srush / awesome-ml-tracking
☆104Updated 4 years ago
bminixhofer / permon
A tool to monitor everything you want. Clean, simple, extensible and in one place.
☆82Updated 4 years ago
TimDettmers / transformer-xl
☆65Updated 5 years ago
AI-ON / Few-Shot-Music-Generation
☆158Updated 7 years ago
aiwabdn / pygln
Python implementation of GLN in different frameworks
☆97Updated 5 years ago
facebookresearch / dagger
Experiment orchestration
☆102Updated 5 years ago
AndreasMadsen / stable-nalu
Code for Neural Arithmetic Units (ICLR) and Measuring Arithmetic Extrapolation Performance (SEDL|NeurIPS)
☆146Updated 4 years ago
hardmaru / mdn_jax_tutorial
Mixture Density Networks (Bishop, 1994) tutorial in JAX
☆61Updated 5 years ago
bastings / annotated_encoder_decoder
The Annotated Encoder Decoder with Attention
☆167Updated 4 years ago
kyunghyuncho / backprop-kalman-filter
☆45Updated 6 years ago
anandsaha / nips.cocob.pytorch
PyTorch implementation of the NIPS'17 paper Training Deep Networks without Learning Rates Through Coin Betting.
☆38Updated 7 years ago
Vermeille / Torchelie
Torchélie is a set of utility functions, layers, losses, models, trainers and other things for PyTorch.
☆110Updated this week
AnnaBeers / DeepZine
Synthetic book pages created with a PGGAN
☆73Updated 7 years ago
EdwardRaff / Quantifying-Independently-Reproducible-ML
☆75Updated 6 years ago
Tgaaly / pytorch-cheatsheet
Pytorch Cheatsheet
☆91Updated 7 years ago
lilianweng / generalization-experiment
TBA
☆77Updated 6 years ago
r0mainK / outperformer
Code for scaling Transformers
☆26Updated 5 years ago
arogozhnikov / readable_capsnet
Blazingly fast capsule networks in 75 lines of pytorch+einops
☆26Updated 4 years ago
jxbz / madam
👩 Pytorch and Jax code for the Madam optimiser.
☆53Updated 4 years ago
szymonmaszke / torchfunc
PyTorch functions and utilities to make your life easier
☆194Updated 4 years ago
IBM / pytorchpipe
PyTorchPipe (PTP) is a component-oriented framework for rapid prototyping and training of computational pipelines combining vision and la…
☆226Updated 6 years ago
IndicoDataSolutions / Enso
Enso: An Open Source Library for Benchmarking Embeddings + Transfer Learning Methods
☆96Updated 4 years ago
fastai / fastgpu
A queue service for quickly developing scripts that use all your GPUs efficiently
☆88Updated 3 years ago