NicolasZucchet / Online-learning-LR-dependenciesLinks
Implementation of the "Online learning of long-range dependencies" paper, NeurIPS 2023
☆19Updated 9 months ago
Alternatives and similar repositories for Online-learning-LR-dependencies
Users that are interested in Online-learning-LR-dependencies are comparing it to the libraries listed below
Sorting:
- A collection of meta-learning algorithms in Jax☆23Updated 2 years ago
- Accelerated replay buffers in JAX☆43Updated 2 years ago
- Minimal but scalable implementation of large language models in JAX☆35Updated last month
- General Modules for JAX☆67Updated 4 months ago
- Non official implementation of the Linear Recurrent Unit (LRU, Orvieto et al. 2023)☆56Updated last month
- Benchmarking RL for POMDPs in Pure JAX [Code for "Structured State Space Models for In-Context Reinforcement Learning" (NeurIPS 2023)]☆110Updated last year
- The simplest, fastest repository for training/finetuning medium-sized GPTs.☆36Updated last year
- Dreamer on JAX☆16Updated 3 years ago
- Accelerated minigrid environments with JAX☆143Updated 2 weeks ago
- Scaling scaling laws with board games.☆53Updated 2 years ago
- An implementation of MuZero in JAX.☆56Updated 2 years ago
- Jax/Flax rewrite of Karpathy's nanoGPT☆59Updated 2 years ago
- Simple tools to mix and match PyTorch and Jax - Get the best of both worlds!☆33Updated 3 months ago
- Implementation of PSGD optimizer in JAX☆34Updated 7 months ago
- ☆40Updated last year
- Comparison between GFlowNets & Maximum Entropy RL☆19Updated last year
- JAX implementations of core Deep RL algorithms☆82Updated 3 years ago
- ☆81Updated 9 months ago
- 🪐 The Sebulba architecture to scale reinforcement learning on Cloud TPUs in JAX☆58Updated last year
- ☆82Updated 5 months ago
- ☆19Updated 2 years ago
- CleanRL's implementation of DeepMind's Podracer Sebulba Architecture for Distributed DRL☆114Updated last year
- A simple library for scaling up JAX programs☆143Updated 9 months ago
- ☆22Updated 5 months ago
- Jax implementation of Proximal Policy Optimization (PPO) specifically tuned for Procgen, with benchmarked results and saved model weights…☆58Updated 3 years ago
- TPU pod commander is a package for managing and launching jobs on Google Cloud TPU pods.☆20Updated last year
- An Open-Ended Agentic Simulator☆52Updated last year
- PyTorch Package For Quasimetric Learning☆42Updated 9 months ago
- flexible meta-learning in jax☆14Updated last year
- Baselines for gymnax 🤖☆71Updated 2 years ago