YuriCat / MuZeroJupyterExample
☆65Updated 3 years ago
Related projects ⓘ
Alternatives and complementary repositories for MuZeroJupyterExample
- A simple implementation of MuZero algorithm for connect4 game☆95Updated 4 years ago
- A structured implementation of MuZero☆206Updated 2 years ago
- A python implemenation of tabular MuZero for educational purposes☆21Updated 4 years ago
- Code for the blog post "Learning Montezuma’s Revenge from a Single Demonstration"☆199Updated 5 years ago
- ☆91Updated 3 years ago
- Tensorflow/Keras code and trained models for Episodic Curiosity Through Reachability☆199Updated 4 years ago
- A framework for easy prototyping of distributed reinforcement learning algorithms☆95Updated 3 years ago
- [Experimental] TensorFlow 2 version of stable-baselines, temporary repository☆45Updated 4 years ago
- Reinforcement Learning Assembly☆92Updated 3 years ago
- Scalable Implementation of Neural Fictitous Self-Play☆73Updated 5 years ago
- Keeping track of RL experiments☆159Updated last year
- Qiita投稿用に作成したAgent57(強化学習)の実装コードです。☆45Updated last year
- Pytorch Implementation of MuZero☆343Updated last year
- An environment of the board game Go using OpenAI's Gym API☆168Updated 2 years ago
- A binary release of trained deep reinforcement learning models trained in the Atari machine learning benchmark, and a software release th…☆201Updated 4 years ago
- Pytorch implementation of distributed deep reinforcement learning☆74Updated 2 years ago
- Deep reinforcement learning implementation that trains AIs for the CodeCraft real-time strategy game.☆20Updated last year
- A novel parallel UCT algorithm with linear speedup and negligible performance loss.☆112Updated 3 years ago
- Library for running a Monte Carlo tree search, either traditionally or with expert policies☆118Updated 6 months ago
- High-quality implementations of deep reinforcement learning algorithms for experiments☆51Updated 2 months ago
- Code for "Data-Efficient Reinforcement Learning with Self-Predictive Representations"☆157Updated 2 years ago
- Implementation of Proximal Meta-Policy Search (ProMP) as well as related Meta-RL algorithm. Includes a useful experiment framework for Me…☆232Updated 2 years ago
- Recurrent and multi-process PyTorch implementation of deep reinforcement Actor-Critic algorithms A2C and PPO☆191Updated 2 years ago
- This repo replicates the results Horgan et al obtained in "Distributed Prioritized Experience Replay"☆189Updated 5 years ago
- Upside-Down Reinforcement Learning (⅂ꓤ) implementation in PyTorch. Based on the paper published by Jürgen Schmidhuber.☆76Updated 4 years ago
- Demo of UCT (MCTS) in Python / Numpy☆83Updated last year
- A continuous action space version of A3C LSTM in pytorch plus A3G design☆258Updated last month
- ICML 2018 Self-Imitation Learning☆276Updated 4 years ago
- Code accompanying the paper "Better Exploration with Optimistic Actor Critic" (NeurIPS 2019)☆68Updated last year
- Simplified Action Decoder for Deep Multi-Agent Reinforcement Learning☆96Updated 2 years ago