tinker495 / jax-baseline
Jax-Baseline is a Reinforcement Learning implementation using JAX and Flax/Haiku libraries, mirroring the functionality of Stable-Baselines.
☆40Updated last week
Related projects ⓘ
Alternatives and complementary repositories for jax-baseline
- Accelerated replay buffers in JAX☆40Updated 2 years ago
- ☆63Updated 3 months ago
- Skeleton for scalable and flexible Jax RL implementations☆63Updated last year
- Official code release for "CrossQ: Batch Normalization in Deep Reinforcement Learning for Greater Sample Efficiency and Simplicity"☆57Updated 5 months ago
- ☆34Updated last year
- METRA: Scalable Unsupervised RL with Metric-Aware Abstraction (ICLR 2024)☆59Updated last year
- Deep Reinforcement Learning Framework done with PyTorch☆30Updated this week
- An implementation of MuZero in JAX.☆53Updated 2 years ago
- Code for "Unsupervised Zero-Shot RL via Functional Reward Representations"☆52Updated 7 months ago
- Vectorization techniques for fast population-based training.☆54Updated 2 years ago
- Implementation of Tactical Optimistic and Pessimistic value estimation☆25Updated last year
- When Do Transformers Shine in RL? Decoupling Memory from Credit Assignment, NeurIPS 2023 (oral)☆55Updated 10 months ago
- Single-file SAC-N implementation on jax with flax and equinox. 10x faster than pytorch☆46Updated last year
- Benchmarked implementations of Offline RL Algorithms.☆65Updated 6 months ago
- Various reinforcement learning algorithms written in Jax + Flax☆23Updated last year
- Simplifying Model-based RL: Learning Representations, Latent-space Models and Policies with One Objective☆79Updated last year
- General Modules for JAX☆59Updated 3 months ago
- Baselines for gymnax 🤖☆60Updated last year
- Simple single-file baselines for Q-Learning in pure-GPU setting☆104Updated 3 months ago
- A collection of RL algorithms written in JAX.☆95Updated 2 years ago
- POPGym Library in JAX☆11Updated 7 months ago
- ☆38Updated last year
- An Open-Ended Agentic Simulator☆28Updated 3 months ago
- A number of agents (PPO, MuZero) with a Perceiver-based NN architecture that can be trained to achieve goals in nethack/minihack environm…☆39Updated 2 years ago
- ☆42Updated last year
- Docker containers of baseline agents for the Crafter environment☆28Updated 2 years ago
- CleanRL's implementation of DeepMind's Podracer Sebulba Architecture for Distributed DRL☆105Updated 3 months ago
- Jax implementation of Proximal Policy Optimization (PPO) specifically tuned for Procgen, with benchmarked results and saved model weights…☆49Updated 2 years ago
- Corax: Core RL in JAX☆35Updated 9 months ago
- Scalable Opponent Shaping Experiments in JAX☆21Updated 7 months ago