lowrollr / turbozero_torchLinks

fast + parallel AlphaZero in PyTorch

☆12

Alternatives and similar repositories for turbozero_torch

Users that are interested in turbozero_torch are comparing it to the libraries listed below

Sorting:

lowrollr / turbozero
fast + parallel AlphaZero in JAX
☆97Updated 6 months ago
MichaelTMatthews / Craftax
(Crafter + NetHack) in JAX. ICML 2024 Spotlight.
☆322Updated last week
Reytuag / transformerXL_PPO_JAX
☆81Updated 8 months ago
instadeepai / flashbax
⚡ Flashbax: Accelerated Replay Buffers in JAX
☆239Updated 3 months ago
epignatelli / navix
Accelerated minigrid environments with JAX
☆141Updated last month
EdanToledo / Stoix
🏛️A research-friendly codebase for fast experimentation of single-agent reinforcement learning in JAX • End-to-End JAX RL
☆343Updated this week
rlglab / minizero
MiniZero: An AlphaZero and MuZero Training Framework
☆94Updated 4 months ago
sotetsuk / pgx
♟️ Vectorized RL game environments in JAX
☆498Updated 4 months ago
mttga / purejaxql
Simple single-file baselines for Q-Learning in pure-GPU setting
☆173Updated 3 months ago
lowrollr / mctx-az
Monte Carlo tree search in JAX, with functionality to continue search from a previous subtree
☆20Updated 2 months ago
facebookresearch / MRQ
MR.Q is a general-purpose model-free reinforcement learning algorithm.
☆105Updated 3 weeks ago
vwxyzjn / cleanba
CleanRL's implementation of DeepMind's Podracer Sebulba Architecture for Distributed DRL
☆114Updated 10 months ago
instadeepai / sebulba
🪐 The Sebulba architecture to scale reinforcement learning on Cloud TPUs in JAX
☆58Updated last year
facebookresearch / minimax
Efficient baselines for autocurricula in JAX.
☆189Updated 10 months ago
DramaCow / jaxued
☆82Updated 3 months ago
keraJLi / rejax
Hardware-Accelerated Reinforcement Learning Algorithms in pure Jax!
☆230Updated last month
luchris429 / popjaxrl
Benchmarking RL for POMDPs in Pure JAX [Code for "Structured State Space Models for In-Context Reinforcement Learning" (NeurIPS 2023)]
☆107Updated last year
bwfbowen / muax
A project that provides help for using DeepMind's mctx on gym-style environments.
☆60Updated 8 months ago
NTT123 / a0-jax
AlphaZero in JAX
☆78Updated last year
Carbon225 / mctx-classic
Classic MCTS example with mctx
☆18Updated 2 years ago
danijar / ninjax
General Modules for JAX
☆66Updated 3 months ago
kevaday / alphazero-general
A fast, generalized, and modified implementation of Deepmind's distinguished AlphaZero in PyTorch.
☆78Updated 7 months ago
strakam / generals-bots
Develop your agent for generals.io!
☆55Updated 2 weeks ago
mohmdelsayed / streaming-drl
Deep reinforcement learning without experience replay, target networks, or batch updates.
☆256Updated 4 months ago
jurgisp / memory-maze
Evaluating long-term memory of reinforcement learning algorithms
☆145Updated 2 years ago
ollebompa / PGA-MAP-Elites
Repository for the PGA-MAP-Elites algorithm. PGA-MAP-Elites was developed to efficiently scale MAP-Elites to large genotypes and noisy d…
☆57Updated 3 years ago
danijar / crafter
Benchmarking the Spectrum of Agent Capabilities
☆454Updated last year
nissymori / JAX-CORL
Clean single-file implementation of offline RL algorithms in JAX
☆150Updated 6 months ago
MarcoMeter / endless-memory-gym
Challenging Memory-based Deep Reinforcement Learning Agents
☆101Updated 8 months ago
MichaelTMatthews / Craftax_Baselines
☆19Updated 2 months ago