hr0nix / dejax
Accelerated replay buffers in JAX
☆41Updated 2 years ago
Alternatives and similar repositories for dejax:
Users that are interested in dejax are comparing it to the libraries listed below
- An implementation of MuZero in JAX.☆54Updated 2 years ago
- Single-file SAC-N implementation on jax with flax and equinox. 10x faster than pytorch☆47Updated last year
- General Modules for JAX☆62Updated 6 months ago
- Jax implementation of Proximal Policy Optimization (PPO) specifically tuned for Procgen, with benchmarked results and saved model weights…☆53Updated 2 years ago
- Vectorization techniques for fast population-based training.☆55Updated 2 years ago
- Baselines for gymnax 🤖☆61Updated last year
- JAX implementations of core Deep RL algorithms☆79Updated 2 years ago
- Docker containers of baseline agents for the Crafter environment☆28Updated 3 years ago
- An Open-Ended Agentic Simulator☆36Updated 5 months ago
- Simple JAX Graphics Library.☆29Updated 2 months ago
- ☆67Updated 5 months ago
- ☆18Updated this week
- GPT implementation in Flax☆18Updated 3 years ago
- ☆19Updated 7 months ago
- 🪐 The Sebulba architecture to scale reinforcement learning on Cloud TPUs in JAX☆54Updated last year
- Jax-Baseline is a Reinforcement Learning implementation using JAX and Flax/Haiku libraries, mirroring the functionality of Stable-Baselin…☆44Updated last week
- Implementation of Diversity Is All You Need (DIAYN) on top of Stable Baselines 3.☆12Updated 2 years ago
- Simplistic Pytorch Implementation of the Dreamer-RL☆20Updated 2 years ago
- Conservative Q learning in Jax☆52Updated last year
- Corax: Core RL in JAX☆36Updated 11 months ago
- ☆46Updated 2 years ago
- ☆41Updated last year
- Learning Robust Dynamics Through Variational Sparse Gating☆21Updated 2 years ago
- ☆72Updated 2 months ago
- Mitigating Partial Observability in Sequential Decision Processes via the Lambda Discrepancy☆17Updated 3 months ago
- Code for "Unsupervised Zero-Shot RL via Functional Reward Representations"☆54Updated 10 months ago
- Contains JAX implementation of algorithms for inverse reinforcement learning☆69Updated 5 months ago
- CleanRL's implementation of DeepMind's Podracer Sebulba Architecture for Distributed DRL☆109Updated 5 months ago
- Flax Implementation of DreamerV3 on Crafter☆10Updated 10 months ago
- Official Implementation of "Can Learned Optimization Make Reinforcement Learning Less Difficult"☆19Updated 2 months ago