mttga / purejaxql
Simple single-file baselines for Q-Learning in pure-GPU setting
☆142Updated this week
Alternatives and similar repositories for purejaxql:
Users that are interested in purejaxql are comparing it to the libraries listed below
- ☆74Updated 6 months ago
- ☆191Updated 3 months ago
- ☆73Updated 4 months ago
- Goal-Conditioned Reinforcement Learning with JAX☆127Updated this week
- ☆41Updated 8 months ago
- CleanRL's implementation of DeepMind's Podracer Sebulba Architecture for Distributed DRL☆109Updated 6 months ago
- Clean single-file implementation of offline RL algorithms in JAX☆135Updated 2 months ago
- Baselines for gymnax 🤖☆66Updated last year
- General Modules for JAX☆64Updated 2 weeks ago
- Contains JAX implementation of algorithms for inverse reinforcement learning☆69Updated 7 months ago
- Accelerated minigrid environments with JAX☆132Updated 7 months ago
- Evaluating long-term memory of reinforcement learning algorithms☆141Updated last year
- 🪐 The Sebulba architecture to scale reinforcement learning on Cloud TPUs in JAX☆57Updated last year
- a simple and scalable agent for training adaptive policies with sequence-based RL☆114Updated 3 weeks ago
- ☆217Updated 4 months ago
- ⚡ Flashbax: Accelerated Replay Buffers in JAX☆227Updated 3 weeks ago
- Efficient baselines for autocurricula in JAX.☆185Updated 6 months ago
- ☆76Updated 3 weeks ago
- An Open-Ended Agentic Simulator☆45Updated 7 months ago
- 🏛️A research-friendly codebase for fast experimentation of single-agent reinforcement learning in JAX • End-to-End JAX RL☆293Updated this week
- Challenging Memory-based Deep Reinforcement Learning Agents☆93Updated 4 months ago
- Official Implementation of "Can Learned Optimization Make Reinforcement Learning Less Difficult"☆21Updated 3 months ago
- A categorised list of Multi-Agent Reinforcemnt Learning (MARL) papers☆50Updated 2 years ago
- ☆47Updated 2 years ago
- Benchmarking RL generalization in an interpretable way.☆148Updated last week
- Partially Observable Process Gym☆182Updated 8 months ago
- ☆18Updated last month
- Implementations of robust Dual Curriculum Design (DCD) algorithms for unsupervised environment design.☆130Updated 6 months ago