koulanurag / muzero-pytorch
Pytorch Implementation of MuZero
☆347Updated last year
Alternatives and similar repositories for muzero-pytorch:
Users that are interested in muzero-pytorch are comparing it to the libraries listed below
- A structured implementation of MuZero☆207Updated 2 years ago
- A clean implementation of MuZero and AlphaZero following the AlphaZero General framework. Train and Pit both algorithms against each othe…☆155Updated 3 years ago
- Open-source codebase for EfficientZero, from "Mastering Atari Games with Limited Data" at NeurIPS 2021.☆879Updated last year
- Dream to Control: Learning Behaviors by Latent Imagination☆524Updated 3 years ago
- Offline Reinforcement Learning (aka Batch Reinforcement Learning) on Atari 2600 games☆540Updated last year
- Code for Go-Explore: a New Approach for Hard-Exploration Problems☆561Updated 2 years ago
- A simple implementation of MuZero algorithm for connect4 game☆97Updated 4 years ago
- DQN Zoo is a collection of reference implementations of reinforcement learning agents developed at DeepMind based on the Deep Q-Network (…☆463Updated 9 months ago
- PyTorch implementation of FQF, IQN and QR-DQN.☆167Updated 6 months ago
- Recurrent and multi-process PyTorch implementation of deep reinforcement Actor-Critic algorithms A2C and PPO☆193Updated 2 years ago
- ☆293Updated last month
- Dream to Control: Learning Behaviors by Latent Imagination, implemented in PyTorch.☆293Updated last year
- A PyTorch Platform for Distributed RL☆746Updated 3 years ago
- RL starter files in order to immediately train, visualize and evaluate an agent without writing any line of code☆666Updated 8 months ago
- CURL: Contrastive Unsupervised Representation Learning for Sample-Efficient Reinforcement Learning☆578Updated 4 years ago
- PyTorch implementation of DreamerV2 model-based RL algorithm☆214Updated last year
- Deep Planning Network: Control from pixels by latent planning with learned dynamics☆369Updated 3 years ago
- Dream to Control: Learning Behaviors by Latent Imagination☆623Updated 4 years ago
- Tools for accelerating safe exploration research.☆516Updated last year
- Code for the paper "When to Trust Your Model: Model-Based Policy Optimization"☆480Updated 2 years ago
- Mirror of Stable-Baselines: a fork of OpenAI Baselines, implementations of reinforcement learning algorithms☆287Updated last year
- Prioritized Experience Replay (PER) implementation in PyTorch☆312Updated 4 years ago
- An environment of the board game Go using OpenAI's Gym API☆168Updated 2 years ago
- A Python interface for reinforcement learning environments☆355Updated 2 years ago
- Code for the paper "Phasic Policy Gradient"☆259Updated last year
- A collection of multi agent environments based on OpenAI gym.☆584Updated 6 months ago
- Random Network Distillation pytorch☆243Updated 5 years ago
- PyTorch implementation of Hierarchical Actor Critic (HAC) for OpenAI gym environments☆304Updated 3 years ago
- A basic 2D maze environment where an agent start from the top left corner and try to find its way to the bottom left corner.☆357Updated last year
- PyTorch implementation of Soft Actor-Critic (SAC)☆523Updated 3 years ago