IDSIA / modern-srwm
Official repository for the paper "A Modern Self-Referential Weight Matrix That Learns to Modify Itself" (ICML 2022 & NeurIPS 2021 Deep RL Workshop) and "Accelerating Neural Self-Improvement via Bootstrapping" (ICLR 2023 Workshop)
☆170Updated last year
Alternatives and similar repositories for modern-srwm:
Users that are interested in modern-srwm are comparing it to the libraries listed below
- Hierarchical Associative Memory User Experience☆101Updated last year
- Easy Hypernetworks in Pytorch and Jax☆100Updated 2 years ago
- Automatic gradient descent☆207Updated last year
- ☆192Updated 3 weeks ago
- The Energy Transformer block, in JAX☆57Updated last year
- Neural Networks and the Chomsky Hierarchy☆206Updated last year
- ☆104Updated 3 years ago
- Official Implementation of the ICML 2023 paper: "Neural Wave Machines: Learning Spatiotemporally Structured Representations with Locally …☆71Updated last year
- ☆25Updated 2 years ago
- Official repository for the paper "Can You Learn an Algorithm? Generalizing from Easy to Hard Problems with Recurrent Networks"☆59Updated 3 years ago
- Implementation of a Transformer that Ponders, using the scheme from the PonderNet paper☆80Updated 3 years ago
- Code for "Meta Learning Backpropagation And Improving It" @ NeurIPS 2021 https://arxiv.org/abs/2012.14905☆31Updated 3 years ago
- Code implementing "Efficient Parallelization of a Ubiquitious Sequential Computation" (Heinsen, 2023)☆92Updated 5 months ago
- Gaussian-Bernoulli Restricted Boltzmann Machines☆104Updated 2 years ago
- Neural Turing Machines in pytorch☆48Updated 3 years ago
- The Abstraction and Reasoning Corpus made into a web game☆89Updated 8 months ago
- ☆246Updated 7 months ago
- ☆51Updated 2 years ago
- Code for the paper "Predictive Coding Approximates Backprop along Arbitrary Computation Graphs"☆151Updated 4 years ago
- Reference implementation of "An Algorithm for Routing Vectors in Sequences" (Heinsen, 2022) and "An Algorithm for Routing Capsules in All…☆169Updated 2 years ago
- ☆39Updated 3 years ago
- Meta-learning inductive biases in the form of useful conserved quantities.☆37Updated 2 years ago
- ☆56Updated 2 years ago
- ☆49Updated last year
- Image augmentation library for Jax☆39Updated last year
- Implementation of Mega, the Single-head Attention with Multi-headed EMA architecture that currently holds SOTA on Long Range Arena☆203Updated last year
- unofficial re-implementation of "Grokking: Generalization Beyond Overfitting on Small Algorithmic Datasets"☆78Updated 2 years ago
- Code for our paper "Generative Flow Networks for Discrete Probabilistic Modeling"☆82Updated 2 years ago
- Sequence Modeling with Structured State Spaces☆63Updated 2 years ago
- Official code repository of the paper Linear Transformers Are Secretly Fast Weight Programmers.☆105Updated 3 years ago