IDSIA / modern-srwmLinks
Official repository for the paper "A Modern Self-Referential Weight Matrix That Learns to Modify Itself" (ICML 2022 & NeurIPS 2021 Deep RL Workshop) and "Accelerating Neural Self-Improvement via Bootstrapping" (ICLR 2023 Workshop)
☆172Updated 5 months ago
Alternatives and similar repositories for modern-srwm
Users that are interested in modern-srwm are comparing it to the libraries listed below
Sorting:
- Meta-learning inductive biases in the form of useful conserved quantities.☆38Updated 3 years ago
- Easy Hypernetworks in Pytorch and Jax☆106Updated 2 years ago
- Code for "Meta Learning Backpropagation And Improving It" @ NeurIPS 2021 https://arxiv.org/abs/2012.14905☆33Updated 3 years ago
- Official repository for the paper "Can You Learn an Algorithm? Generalizing from Easy to Hard Problems with Recurrent Networks"☆59Updated 3 years ago
- Official Implementation of the ICML 2023 paper: "Neural Wave Machines: Learning Spatiotemporally Structured Representations with Locally …☆77Updated 2 years ago
- Language-annotated Abstraction and Reasoning Corpus☆97Updated 2 years ago
- Code implementing "Efficient Parallelization of a Ubiquitious Sequential Computation" (Heinsen, 2023)☆97Updated 11 months ago
- Gaussian-Bernoulli Restricted Boltzmann Machines☆105Updated 2 years ago
- ☆41Updated 3 years ago
- Code for the paper "Predictive Coding Approximates Backprop along Arbitrary Computation Graphs"☆163Updated 5 years ago
- Reference implementation of "An Algorithm for Routing Vectors in Sequences" (Heinsen, 2022) and "An Algorithm for Routing Capsules in All…☆172Updated 2 years ago
- The Energy Transformer block, in JAX☆62Updated last year
- Implementation of a Transformer that Ponders, using the scheme from the PonderNet paper☆81Updated 4 years ago
- Hierarchical Associative Memory User Experience☆104Updated 2 weeks ago
- ☆192Updated 5 months ago
- Swarm training framework using Haiku + JAX + Ray for layer parallel transformer language models on unreliable, heterogeneous nodes☆242Updated 2 years ago
- Official repository for the paper "Neural Differential Equations for Learning to Program Neural Nets Through Continuous Learning Rules" (…☆23Updated 5 months ago
- Neural Turing Machines in pytorch☆48Updated 3 years ago
- Automatic gradient descent☆215Updated 2 years ago
- Official repository for the paper "Going Beyond Linear Transformers with Recurrent Fast Weight Programmers" (NeurIPS 2021)☆50Updated 5 months ago
- Neural Networks and the Chomsky Hierarchy☆211Updated last year
- ☆56Updated 2 years ago
- Differentiable Algorithms and Algorithmic Supervision.☆116Updated 2 years ago
- Implementation of the specific Transformer architecture from PaLM - Scaling Language Modeling with Pathways - in Jax (Equinox framework)☆189Updated 3 years ago
- Sequence Modeling with Structured State Spaces☆66Updated 3 years ago
- DiCE: The Infinitely Differentiable Monte-Carlo Estimator☆32Updated 2 years ago
- ☆53Updated last year
- Stochastic Automatic Differentiation library for PyTorch.☆208Updated last year
- Official code repository of the paper Linear Transformers Are Secretly Fast Weight Programmers.☆110Updated 4 years ago
- Implementation of Hierarchical Transformer Memory (HTM) for Pytorch☆76Updated 4 years ago