MichaelTMatthews / Craftax
(Crafter + NetHack) in JAX. ICML 2024 Spotlight.
☆294Updated last month
Alternatives and similar repositories for Craftax:
Users that are interested in Craftax are comparing it to the libraries listed below
- 🏛️A research-friendly codebase for fast experimentation of single-agent reinforcement learning in JAX • End-to-End JAX RL☆300Updated this week
- ⚡ Flashbax: Accelerated Replay Buffers in JAX☆228Updated this week
- ☆191Updated 3 months ago
- Accelerated minigrid environments with JAX☆132Updated 7 months ago
- ☆73Updated 4 months ago
- Simple single-file baselines for Q-Learning in pure-GPU setting☆149Updated this week
- Multi-Agent Reinforcement Learning with JAX☆541Updated 2 weeks ago
- ☆74Updated this week
- Efficient baselines for autocurricula in JAX.☆186Updated 6 months ago
- SBX: Stable Baselines Jax (SB3 + Jax) RL algorithms☆423Updated 2 weeks ago
- Clean single-file implementation of offline RL algorithms in JAX☆137Updated 2 months ago
- CleanRL's implementation of DeepMind's Podracer Sebulba Architecture for Distributed DRL☆109Updated 7 months ago
- RL Environments in JAX 🌍☆722Updated 8 months ago
- Goal-Conditioned Reinforcement Learning with JAX☆131Updated this week
- ♟️ Vectorized RL game environments in JAX☆456Updated 2 weeks ago
- Benchmarking the Spectrum of Agent Capabilities☆423Updated last year
- ☆298Updated 3 months ago
- ☆217Updated 4 months ago
- Evaluating long-term memory of reinforcement learning algorithms☆141Updated last year
- Really Fast End-to-End Jax RL Implementations☆841Updated 6 months ago
- Implementations of robust Dual Curriculum Design (DCD) algorithms for unsupervised environment design.☆130Updated 7 months ago
- Accelerated Quality-Diversity☆280Updated this week
- Benchmarking RL for POMDPs in Pure JAX [Code for "Structured State Space Models for In-Context Reinforcement Learning" (NeurIPS 2023)]☆99Updated last year
- LeanRL is a fork of CleanRL, where selected PyTorch scripts optimized for performance using compile and cudagraphs.☆524Updated 4 months ago
- An Open-Ended Agentic Simulator☆45Updated 7 months ago
- fast + parallel AlphaZero in JAX☆94Updated 3 months ago
- Partially Observable Process Gym☆183Updated 8 months ago
- Modular framework for Reinforcement Learning in python☆172Updated 2 years ago
- 🪐 The Sebulba architecture to scale reinforcement learning on Cloud TPUs in JAX☆57Updated last year