IDSIA / recurrent-fwpLinks

Official repository for the paper "Going Beyond Linear Transformers with Recurrent Fast Weight Programmers" (NeurIPS 2021)

☆50

Alternatives and similar repositories for recurrent-fwp

Users that are interested in recurrent-fwp are comparing it to the libraries listed below

Sorting:

ischlag / fast-weight-transformers
Official code repository of the paper Linear Transformers Are Secretly Fast Weight Programmers.
☆108Updated 4 years ago
lucidrains / HTM-pytorch
Implementation of Hierarchical Transformer Memory (HTM) for Pytorch
☆76Updated 4 years ago
yilundu / ebm_compositionality
[NeurIPS'20] Code for the Paper Compositional Visual Generation and Inference with Energy Based Models
☆46Updated 2 years ago
lucidrains / ESBN-pytorch
Usable implementation of Emerging Symbol Binding Network (ESBN), in Pytorch
☆25Updated 4 years ago
lucidrains / ponder-transformer
Implementation of a Transformer that Ponders, using the scheme from the PonderNet paper
☆81Updated 4 years ago
joelouismarino / variational_rl
Variational Reinforcement Learning
☆16Updated last year
RobertCsordas / transformer_generalization
The official repository for our paper "The Devil is in the Detail: Simple Tricks Improve Systematic Generalization of Transformers". We s…
☆67Updated 2 years ago
yilundu / improved_contrastive_divergence
[ICML'21] Improved Contrastive Divergence Training of Energy Based Models
☆66Updated 3 years ago
Kaixhin / GUDRL
Generalised UDRL
☆37Updated 3 years ago
lucidrains / isab-pytorch
An implementation of (Induced) Set Attention Block, from the Set Transformers paper
☆65Updated 2 years ago
dylandoblar / noether-networks
Meta-learning inductive biases in the form of useful conserved quantities.
☆38Updated 2 years ago
choidami / sst
☆50Updated 5 years ago
RedRyan111 / GLOM
An implementation of 2021 paper by Geoffrey Hinton: "How to represent part-whole hierarchies in a neural network" in Pytorch.
☆57Updated 4 years ago
mcbal / deep-implicit-attention
Implementation of deep implicit attention in PyTorch
☆65Updated 4 years ago
toshas / torch-discounted-cumsum
Fast Discounted Cumulative Sums in PyTorch
☆96Updated 4 years ago
PAL-ML / PEARL_v1
☆30Updated 3 years ago
sarthmit / Compositional-Attention
Code to reproduce the results for Compositional Attention
☆59Updated 3 years ago
ml-jku / helm
☆57Updated last year
ssnl / poisson_quasimetric_embedding
Open source code for paper "On the Learning and Learnability of Quasimetrics".
☆32Updated 2 years ago
AllanYangZhou / metalearning-symmetries
Experiments for Meta-Learning Symmetries by Reparameterization
☆57Updated 4 years ago
prajjwal1 / rl_paradigm
☆17Updated last year
nec-research / tf-imle
Tensorflow implementation and notebooks for Implicit Maximum Likelihood Estimation
☆67Updated 3 years ago
wouterkool / estimating-gradients-without-replacement
Estimating Gradients for Discrete Random Variables by Sampling without Replacement
☆40Updated 5 years ago
shwinshaker / LipGrow
An adaptive training algorithm for residual network
☆17Updated 5 years ago
srush / mamba-scans
Blog post
☆17Updated last year
lucidrains / mlp-gpt-jax
A GPT, made only of MLPs, in Jax
☆58Updated 4 years ago
thaihungle / SAM
Self-attentive Associative Memory & SAM-based Two-Memory Model
☆59Updated 3 years ago
lucidrains / token-shift-gpt
Implementation of Token Shift GPT - An autoregressive model that solely relies on shifting the sequence space for mixing
☆50Updated 3 years ago
google-deepmind / affordances_option_models
☆23Updated 4 years ago
ssnl / PyTorch-Reparam-Module
Reparameterize your PyTorch modules
☆71Updated 4 years ago