chengyu2 / learning_alpha_star
Files from the published Alpha Star paper by DeepMind
☆16Updated 5 years ago
Alternatives and similar repositories for learning_alpha_star:
Users that are interested in learning_alpha_star are comparing it to the libraries listed below
- My implementation of the Proximal Policy Optisation algorithm using Keras as a backend☆88Updated 5 years ago
- Random Network Distillation(RND) algo in Pytorch☆49Updated 6 years ago
- PySC2 OpenAI Gym Environments☆48Updated 6 years ago
- Implementation of Schmidhuber's Upside Down Reinforcement Learning paper in PyTorch☆27Updated 5 years ago
- ☆142Updated 3 months ago
- [Experimental] TensorFlow 2 version of stable-baselines, temporary repository☆45Updated 5 years ago
- This repo replicates the results Horgan et al obtained in "Distributed Prioritized Experience Replay"☆189Updated 6 years ago
- Meta Reinforcement Learning Experiments☆34Updated 7 years ago
- Keeping track of RL experiments☆162Updated 2 years ago
- StarCraft II / PySC2 Deep Reinforcement Learning Agents (A2C)☆135Updated 6 years ago
- A framework for easy prototyping of distributed reinforcement learning algorithms☆95Updated 4 years ago
- A continuous action space version of A3C LSTM in pytorch plus A3G design☆259Updated 5 months ago
- World Models with A3C on Carracing-v0 in gym☆33Updated 5 years ago
- Arena: A General Evaluation Platform and Building Toolkit for Single/Multi-Agent Intelligence. AAAI 2020.☆83Updated 3 years ago
- The state-of-art deep rl algorithms for Montezuma's revenge☆25Updated 6 years ago
- Tensorflow/Keras code and trained models for Episodic Curiosity Through Reachability☆203Updated 4 years ago
- Upside-Down Reinforcement Learning (⅂ꓤ) implementation in PyTorch. Based on the paper published by Jürgen Schmidhuber.☆77Updated 4 years ago
- Pytorch implementation of distributed deep reinforcement learning☆75Updated 2 years ago
- Proximal Policy Optimization in PyTorch☆39Updated 7 years ago
- World Models applied to the Open AI Sonic Retro Contest☆77Updated 6 years ago
- TD3, SAC, IQN, Rainbow, PPO, Ape-X and etc. in TF1.x☆62Updated 3 years ago
- OpenAI Gym wrapper for ViZDoom enviroments☆69Updated 3 years ago
- This project explores deep reinforcement learning, hybrid actor-critic approach with A3C/PPO combined with curiosity for playing Super M…☆79Updated 6 years ago
- PyTorch Implementation of Distributed Prioritized Experience Replay(Ape-X)☆153Updated 5 years ago
- DQN, DDDQN, A3C, PPO, Curiosity applied to the game DOOM☆84Updated 4 years ago
- Curiosity-driven Exploration by Self-supervised Prediction☆137Updated 2 years ago
- A PyTorch implementation of Rainbow DQN agent☆170Updated 6 years ago
- Tensorflow implementation of a Deep Distributed Distributional Deterministic Policy Gradients (D4PG) network, trained on OpenAI Gym envir…☆126Updated 5 years ago
- Qiita投稿用に作成したAgent57(強化学習)の実装コードです。☆44Updated last year
- Deep reinforcement learning implementation that trains AIs for the CodeCraft real-time strategy game.☆21Updated last year