chengyu2 / learning_alpha_starLinks
Files from the published Alpha Star paper by DeepMind
☆17Updated 6 years ago
Alternatives and similar repositories for learning_alpha_star
Users that are interested in learning_alpha_star are comparing it to the libraries listed below
Sorting:
- ☆148Updated last year
- Keeping track of RL experiments☆166Updated 3 years ago
- Performances of Reinforcement Learning Agents☆53Updated 6 years ago
- Vectorized interface for reinforcement learning environments☆142Updated 2 years ago
- Pytorch Implementation of MuZero☆352Updated 2 years ago
- A structured implementation of MuZero☆206Updated 3 years ago
- StarCraft II / PySC2 Deep Reinforcement Learning Agents (A2C)☆138Updated 7 years ago
- A continuous action space version of A3C LSTM in pytorch plus A3G design☆260Updated last year
- Multi Agent Reinforcement Learning using MalmÖ☆265Updated 5 years ago
- OpenAI Gym wrapper for ViZDoom enviroments☆70Updated 4 years ago
- Pytorch implementation of distributed deep reinforcement learning☆76Updated 3 years ago
- This repo replicates the results Horgan et al obtained in "Distributed Prioritized Experience Replay"☆190Updated 6 years ago
- Random Network Distillation pytorch☆260Updated 6 years ago
- A collection of baselines for the MineRL environment/datasets & the NeurIPS 2021 MineRL competitions☆149Updated 4 years ago
- Actor-critic with experience replay☆256Updated 3 years ago
- Code for Go-Explore: a New Approach for Hard-Exploration Problems☆579Updated 3 years ago
- Some baselines for Pommerman competition☆46Updated 7 years ago
- Tensorflow/Keras code and trained models for Episodic Curiosity Through Reachability☆205Updated 5 years ago
- Qiita投稿用に作成したAgent57(強化学習)の実装コードです。☆45Updated 2 years ago
- Paired Open-Ended Trailblazer (POET) and Enhanced POET☆260Updated 3 years ago
- Scalable Implementation of Neural Fictitous Self-Play☆85Updated 7 years ago
- Multitask Environments for RL☆281Updated 4 years ago
- An environment of the board game Go using OpenAI's Gym API☆177Updated 3 years ago
- Code for the paper "Phasic Policy Gradient"☆267Updated 2 years ago
- Deep Planning Network: Control from pixels by latent planning with learned dynamics☆374Updated 4 years ago
- A PyTorch Platform for Distributed RL☆752Updated 4 years ago
- An implement of DQfD(Deep Q-learning from Demonstrations) raised by DeepMind:Learning from Demonstrations for Real World Reinforcement Le…☆132Updated 8 years ago
- ☆66Updated 4 years ago
- World Models applied to the Open AI Sonic Retro Contest☆77Updated 7 years ago
- Code for the blog post "Learning Montezuma’s Revenge from a Single Demonstration"☆207Updated 7 years ago