koulanurag / muzero-pytorch
Pytorch Implementation of MuZero
☆335Updated last year
Related projects: ⓘ
- A clean implementation of MuZero and AlphaZero following the AlphaZero General framework. Train and Pit both algorithms against each othe…☆153Updated 3 years ago
- A structured implementation of MuZero☆203Updated 2 years ago
- A Python interface for reinforcement learning environments☆344Updated last year
- Open-source codebase for EfficientZero, from "Mastering Atari Games with Limited Data" at NeurIPS 2021.☆860Updated 8 months ago
- DQN Zoo is a collection of reference implementations of reinforcement learning agents developed at DeepMind based on the Deep Q-Network (…☆450Updated 5 months ago
- An environment of the board game Go using OpenAI's Gym API☆164Updated 2 years ago
- Code for Go-Explore: a New Approach for Hard-Exploration Problems☆554Updated last year
- Recurrent and multi-process PyTorch implementation of deep reinforcement Actor-Critic algorithms A2C and PPO☆191Updated last year
- Code for the paper "Phasic Policy Gradient"☆245Updated last year
- Dream to Control: Learning Behaviors by Latent Imagination☆506Updated 3 years ago
- CURL: Contrastive Unsupervised Representation Learning for Sample-Efficient Reinforcement Learning☆569Updated 3 years ago
- Offline Reinforcement Learning (aka Batch Reinforcement Learning) on Atari 2600 games☆526Updated last year
- ☆282Updated last year
- A collection of multi agent environments based on OpenAI gym.☆553Updated 2 months ago
- PyTorch implementation of FQF, IQN and QR-DQN.☆158Updated last month
- A simple implementation of MuZero algorithm for connect4 game☆93Updated 4 years ago
- A PyTorch Platform for Distributed RL☆737Updated 3 years ago
- Dream to Control: Learning Behaviors by Latent Imagination, implemented in PyTorch.☆277Updated 8 months ago
- Mirror of Stable-Baselines: a fork of OpenAI Baselines, implementations of reinforcement learning algorithms☆283Updated last year
- JAX (Flax) implementation of algorithms for Deep Reinforcement Learning with continuous action spaces.☆608Updated last year
- RAD: Reinforcement Learning with Augmented Data☆400Updated 3 years ago
- RL starter files in order to immediately train, visualize and evaluate an agent without writing any line of code☆644Updated 4 months ago
- Tools for accelerating safe exploration research.☆495Updated last year
- Benchmarking the Spectrum of Agent Capabilities☆373Updated 7 months ago
- Deep Planning Network: Control from pixels by latent planning with learned dynamics☆359Updated 2 years ago
- Random Network Distillation pytorch☆239Updated 5 years ago
- A simple and highly efficient RTS-game-inspired environment for reinforcement learning (formerly Gym-MicroRTS)☆227Updated 2 months ago
- Prioritized Experience Replay (PER) implementation in PyTorch☆302Updated 4 years ago
- Reinforcement learning library(framework) designed for PyTorch, implements DQN, DDPG, A2C, PPO, SAC, MADDPG, A3C, APEX, IMPALA ...☆397Updated 3 years ago
- Code for conservative Q-learning☆393Updated 2 years ago