NicolasZucchet / Online-learning-LR-dependenciesLinks
Implementation of the "Online learning of long-range dependencies" paper, NeurIPS 2023
☆20Updated last year
Alternatives and similar repositories for Online-learning-LR-dependencies
Users that are interested in Online-learning-LR-dependencies are comparing it to the libraries listed below
Sorting:
- A collection of meta-learning algorithms in Jax☆23Updated 3 years ago
- Benchmarking RL for POMDPs in Pure JAX [Code for "Structured State Space Models for In-Context Reinforcement Learning" (NeurIPS 2023)]☆110Updated 2 years ago
- Accelerated replay buffers in JAX☆45Updated 3 years ago
- An implementation of MuZero in JAX.☆57Updated 3 years ago
- Implementation of PSGD optimizer in JAX☆35Updated 11 months ago
- Dreamer on JAX☆16Updated 3 years ago
- Non official implementation of the Linear Recurrent Unit (LRU, Orvieto et al. 2023)☆61Updated 3 months ago
- The simplest, fastest repository for training/finetuning medium-sized GPTs.☆37Updated 2 years ago
- ☆19Updated 3 years ago
- Minimal but scalable implementation of large language models in JAX☆35Updated 2 weeks ago
- Python implementation of the methods in Meulemans et al. 2020 - A Theoretical Framework For Target Propagation☆32Updated last year
- Minimizing Control for Credit Assignment with Strong Feedback☆14Updated last year
- ☆22Updated 8 months ago
- General Modules for JAX☆71Updated 3 months ago
- Flax Implementation of DreamerV3 on Crafter☆18Updated 2 weeks ago
- Scaling scaling laws with board games.☆54Updated 2 years ago
- Standardized Minecraft Diamond Environment for Reinforcement Learning☆31Updated 2 years ago
- Accelerated minigrid environments with JAX☆153Updated last month
- Comparison between GFlowNets & Maximum Entropy RL☆19Updated last year
- 🪐 The Sebulba architecture to scale reinforcement learning on Cloud TPUs in JAX☆60Updated 2 years ago
- ☆88Updated 3 months ago
- An Open-Ended Agentic Simulator☆56Updated last year
- CleanRL's implementation of DeepMind's Podracer Sebulba Architecture for Distributed DRL☆118Updated last year
- Code and data for the paper "Bridging RL Theory and Practice with the Effective Horizon"☆50Updated last year
- Docker containers of baseline agents for the Crafter environment☆30Updated 3 years ago
- Jax/Flax rewrite of Karpathy's nanoGPT☆62Updated 2 years ago
- ☆46Updated last year
- ☆53Updated last year
- Jax implementation of Proximal Policy Optimization (PPO) specifically tuned for Procgen, with benchmarked results and saved model weights…☆59Updated 3 years ago
- Simple tools to mix and match PyTorch and Jax - Get the best of both worlds!☆36Updated last month