IDSIA / modern-srwmLinks
Official repository for the paper "A Modern Self-Referential Weight Matrix That Learns to Modify Itself" (ICML 2022 & NeurIPS 2021 Deep RL Workshop) and "Accelerating Neural Self-Improvement via Bootstrapping" (ICLR 2023 Workshop)
☆173Updated 2 months ago
Alternatives and similar repositories for modern-srwm
Users that are interested in modern-srwm are comparing it to the libraries listed below
Sorting:
- Official repository for the paper "Can You Learn an Algorithm? Generalizing from Easy to Hard Problems with Recurrent Networks"☆59Updated 3 years ago
- Easy Hypernetworks in Pytorch and Jax☆103Updated 2 years ago
- Meta-learning inductive biases in the form of useful conserved quantities.☆37Updated 2 years ago
- The Abstraction and Reasoning Corpus made into a web game☆90Updated 11 months ago
- Hierarchical Associative Memory User Experience☆103Updated 3 weeks ago
- Official Implementation of the ICML 2023 paper: "Neural Wave Machines: Learning Spatiotemporally Structured Representations with Locally …☆72Updated 2 years ago
- Gaussian-Bernoulli Restricted Boltzmann Machines☆104Updated 2 years ago
- Code for "Meta Learning Backpropagation And Improving It" @ NeurIPS 2021 https://arxiv.org/abs/2012.14905☆33Updated 3 years ago
- Code implementing "Efficient Parallelization of a Ubiquitious Sequential Computation" (Heinsen, 2023)☆94Updated 8 months ago
- ☆192Updated last month
- ☆39Updated 3 years ago
- Language-annotated Abstraction and Reasoning Corpus☆90Updated 2 years ago
- Neural Networks and the Chomsky Hierarchy☆207Updated last year
- Implementation of a Transformer that Ponders, using the scheme from the PonderNet paper☆81Updated 3 years ago
- Reference implementation of "An Algorithm for Routing Vectors in Sequences" (Heinsen, 2022) and "An Algorithm for Routing Capsules in All…☆172Updated 2 years ago
- ☆54Updated 2 years ago
- A centralized place for deep thinking code and experiments☆85Updated 2 years ago
- Differentiable Algorithms and Algorithmic Supervision.☆116Updated 2 years ago
- Pytorch implementation of preconditioned stochastic gradient descent (Kron and affine preconditioner, low-rank approximation precondition…☆179Updated 2 weeks ago
- Neural Turing Machines in pytorch☆48Updated 3 years ago
- Automatic gradient descent☆208Updated 2 years ago
- ☆17Updated 11 months ago
- Official repository for the paper "Going Beyond Linear Transformers with Recurrent Fast Weight Programmers" (NeurIPS 2021)☆49Updated 2 months ago
- Brain-Inspired Modular Training (BIMT), a method for making neural networks more modular and interpretable.☆172Updated 2 years ago
- The Energy Transformer block, in JAX☆59Updated last year
- unofficial re-implementation of "Grokking: Generalization Beyond Overfitting on Small Algorithmic Datasets"☆78Updated 3 years ago
- Sequence Modeling with Structured State Spaces☆65Updated 3 years ago
- ☆51Updated last year
- Image augmentation library for Jax☆39Updated last year
- Unofficial but Efficient Implementation of "Mamba: Linear-Time Sequence Modeling with Selective State Spaces" in JAX☆85Updated last year