mttga / purejaxqlLinks
Simple single-file baselines for Q-Learning in pure-GPU setting
☆172Updated 3 months ago
Alternatives and similar repositories for purejaxql
Users that are interested in purejaxql are comparing it to the libraries listed below
Sorting:
- ☆81Updated 3 months ago
- Unified Implementations of Offline Reinforcement Learning Algorithms☆80Updated last month
- ☆80Updated 7 months ago
- Hardware-Accelerated Reinforcement Learning Algorithms in pure Jax!☆226Updated 3 weeks ago
- MR.Q is a general-purpose model-free reinforcement learning algorithm.☆104Updated 2 months ago
- Accelerated minigrid environments with JAX☆139Updated 2 weeks ago
- Clean single-file implementation of offline RL algorithms in JAX☆146Updated 6 months ago
- CleanRL's implementation of DeepMind's Podracer Sebulba Architecture for Distributed DRL☆113Updated 10 months ago
- Benchmarking RL generalization in an interpretable way.☆157Updated last week
- Partially Observable Process Gym☆193Updated 2 weeks ago
- Goal-Conditioned Reinforcement Learning with JAX☆169Updated last month
- ⚡ Flashbax: Accelerated Replay Buffers in JAX☆238Updated 2 months ago
- Baselines for gymnax 🤖☆67Updated 2 years ago
- A tool for aggregating and plotting MARL experiment data.☆77Updated 5 months ago
- General Modules for JAX☆65Updated 2 months ago
- 🏛️A research-friendly codebase for fast experimentation of single-agent reinforcement learning in JAX • End-to-End JAX RL☆335Updated last week
- Evaluating long-term memory of reinforcement learning algorithms☆143Updated 2 years ago
- ☆93Updated 4 months ago
- Code for "SimbaV2: Hyperspherical Normalization for Scalable Deep Reinforcement Learning"☆51Updated 3 weeks ago
- ☆41Updated 11 months ago
- Contains JAX implementation of algorithms for inverse reinforcement learning☆73Updated 10 months ago
- Challenging Memory-based Deep Reinforcement Learning Agents☆100Updated 7 months ago
- Deep Hierarchical Planning from Pixels☆103Updated 2 years ago
- (Crafter + NetHack) in JAX. ICML 2024 Spotlight.☆319Updated last month
- Implementations of robust Dual Curriculum Design (DCD) algorithms for unsupervised environment design.☆130Updated 10 months ago
- Jax implementation of Proximal Policy Optimization (PPO) specifically tuned for Procgen, with benchmarked results and saved model weights…☆56Updated 2 years ago
- Efficient baselines for autocurricula in JAX.☆190Updated 10 months ago
- A categorised list of Multi-Agent Reinforcemnt Learning (MARL) papers☆52Updated 2 years ago
- a simple and scalable agent for training adaptive policies with sequence-based RL☆128Updated 3 weeks ago
- 🪐 The Sebulba architecture to scale reinforcement learning on Cloud TPUs in JAX☆57Updated last year