IDSIA / recurrent-fwpLinks
Official repository for the paper "Going Beyond Linear Transformers with Recurrent Fast Weight Programmers" (NeurIPS 2021)
☆50Updated 4 months ago
Alternatives and similar repositories for recurrent-fwp
Users that are interested in recurrent-fwp are comparing it to the libraries listed below
Sorting:
- Official code repository of the paper Linear Transformers Are Secretly Fast Weight Programmers.☆105Updated 4 years ago
- Implementation of Hierarchical Transformer Memory (HTM) for Pytorch☆75Updated 4 years ago
- Usable implementation of Emerging Symbol Binding Network (ESBN), in Pytorch☆25Updated 4 years ago
- Implementation of a Transformer that Ponders, using the scheme from the PonderNet paper☆81Updated 3 years ago
- [NeurIPS'20] Code for the Paper Compositional Visual Generation and Inference with Energy Based Models☆46Updated 2 years ago
- An implementation of 2021 paper by Geoffrey Hinton: "How to represent part-whole hierarchies in a neural network" in Pytorch.☆57Updated 4 years ago
- Generalised UDRL☆37Updated 3 years ago
- Open source code for paper "On the Learning and Learnability of Quasimetrics".☆32Updated 2 years ago
- [ICML'21] Improved Contrastive Divergence Training of Energy Based Models☆66Updated 3 years ago
- An adaptive training algorithm for residual network☆17Updated 5 years ago
- Meta-learning inductive biases in the form of useful conserved quantities.☆37Updated 2 years ago
- The official repository for our paper "The Devil is in the Detail: Simple Tricks Improve Systematic Generalization of Transformers". We s…☆67Updated 2 years ago
- ☆50Updated 5 years ago
- Variational Reinforcement Learning☆16Updated last year
- ☆56Updated 11 months ago
- ☆30Updated 3 years ago
- Implementation of deep implicit attention in PyTorch☆65Updated 4 years ago
- Experiments for Meta-Learning Symmetries by Reparameterization☆57Updated 4 years ago
- Code for "Recurrent Independent Mechanisms"☆118Updated 3 years ago
- Code to reproduce the results for Compositional Attention☆59Updated 2 years ago
- ☆44Updated 5 years ago
- ☆17Updated last year
- Supplementary Data for Evolving Reinforcement Learning Algorithms☆46Updated 4 years ago
- A pytorch implementation for the LSTM experiments in the paper: Why Gradient Clipping Accelerates Training: A Theoretical Justification f…☆46Updated 5 years ago
- Fast Discounted Cumulative Sums in PyTorch☆96Updated 4 years ago
- Computing the eigenvalues of Neural Tangent Kernel and Conjugate Kernel (aka NNGP kernel) over the boolean cube☆47Updated 6 years ago
- ☆23Updated 3 years ago
- STABILIZING GRADIENTS FOR DEEP NEURAL NETWORKS VIA EFFICIENT SVD PARAMETERIZATION☆16Updated 7 years ago
- Tensorflow implementation and notebooks for Implicit Maximum Likelihood Estimation☆67Updated 3 years ago
- ☆45Updated 5 years ago