RPegoud / jym
JAX implementation of RL algorithms and vectorized environments
☆38Updated last year
Alternatives and similar repositories for jym:
Users that are interested in jym are comparing it to the libraries listed below
- Simple JAX Graphics Library.☆29Updated 2 months ago
- ☆18Updated this week
- Highly scalable 2D JAX physics engine.☆44Updated this week
- Baselines for gymnax 🤖☆61Updated last year
- Skeleton for scalable and flexible Jax RL implementations☆69Updated last year
- Official Implementation of "Can Learned Optimization Make Reinforcement Learning Less Difficult"☆19Updated 2 months ago
- ☆42Updated 6 months ago
- Contains JAX implementation of algorithms for inverse reinforcement learning☆69Updated 5 months ago
- Goal-Conditioned Reinforcement Learning with JAX☆116Updated last week
- ☆67Updated 5 months ago
- ☆72Updated 2 months ago
- Clean single-file implementation of offline RL algorithms in JAX☆127Updated last month
- ☆178Updated last month
- An Open-Ended Agentic Simulator☆36Updated 5 months ago
- Benchmarking RL generalization in an interpretable way.☆138Updated 11 months ago
- Simple single-file baselines for Q-Learning in pure-GPU setting☆135Updated 2 months ago
- Single-file SAC-N implementation on jax with flax and equinox. 10x faster than pytorch☆47Updated last year
- ☆46Updated 2 years ago
- General Modules for JAX☆62Updated 6 months ago
- Conservative Q learning in Jax☆52Updated last year
- JAX implementations of various deep reinforcement learning algorithms.☆21Updated 3 years ago
- Implementation of Diversity Is All You Need (DIAYN) on top of Stable Baselines 3.☆12Updated 2 years ago
- Various reinforcement learning algorithms written in Jax + Flax☆23Updated last year
- Corax: Core RL in JAX☆36Updated 11 months ago
- ☆68Updated 3 months ago
- ⚡ Flashbax: Accelerated Replay Buffers in JAX☆220Updated last week
- Jax implementation of Proximal Policy Optimization (PPO) specifically tuned for Procgen, with benchmarked results and saved model weights…☆53Updated 2 years ago
- Official code release for "CrossQ: Batch Normalization in Deep Reinforcement Learning for Greater Sample Efficiency and Simplicity"☆64Updated 7 months ago
- 🪐 The Sebulba architecture to scale reinforcement learning on Cloud TPUs in JAX☆54Updated last year
- Mitigating Partial Observability in Sequential Decision Processes via the Lambda Discrepancy☆17Updated 3 months ago