lteacy / mcts-cpp
Monte Carlo Tree Search implemented in C++
☆8Updated 12 years ago
Related projects ⓘ
Alternatives and complementary repositories for mcts-cpp
- Monte Carlo Tree Search (MCTS) ,realize using python☆12Updated 8 years ago
- Implementation is mostly based on Sergey Levine work (http://www.eecs.berkeley.edu/~svlevine/).☆43Updated 9 years ago
- hierarchical Q-learning implementation☆11Updated 9 years ago
- DDPG on OpenAI Gym Pendulum☆19Updated 8 years ago
- ☆33Updated 8 years ago
- Code base for solving Markov Decision Processes and Reinforcement Learning problems using Recurrent Convolutional Neural Networks.☆69Updated 7 years ago
- reimplementation of the ddpg algorithm using tensorflow☆38Updated 8 years ago
- Series Algorithms of Deep Reinforcement Learning, such as DQN, DDQN, one-step-DQN, DDPG, etc☆41Updated 8 years ago
- Deep Reinforcement Learning for Multi Agent Soccer☆17Updated 7 years ago
- Python implementation of tabular asynchronous actor critic☆11Updated 8 years ago
- Model-Free Episodic Control☆15Updated 7 years ago
- Duel_DDQN (Dueling Network Architectures + Double DQN) using Keras☆32Updated 8 years ago
- Single Player Monte Carlo Tree Search implementation☆18Updated 4 years ago
- Pack of Drones: Layered reinforcement learning (Q-learning w/ RNN) for complex "hunt" behaviors☆61Updated 8 years ago
- Reinforcement learning algorithms, worlds and experiments in Python☆10Updated 7 years ago
- reinforcement learning. policy gradient. PCL☆38Updated 7 years ago
- Asynchronous Advantage Actor Critic☆21Updated 8 years ago
- hierarchical deep reinforcement learning algorithms☆41Updated 6 years ago
- RC-NFQ: Regularized Convolutional Neural Fitted Q Iteration. A batch algorithm for deep reinforcement learning. Incorporates dropout regu…☆12Updated 3 years ago
- C++ library for reinforcement learning☆51Updated last year
- Distributed Tensorflow Implementation of Asynchronous Methods for Deep Reinforcement Learning☆31Updated 6 years ago
- Implementation of Scheduled Policy Optimization for task-oriented language grouding☆29Updated 6 years ago
- ☆11Updated last year
- Robust policy search algorithms which train on model ensembles☆28Updated 8 years ago
- A C++ implementation of the derivative-free optimization algorithm CMA-ES.☆23Updated 11 years ago
- Variation of "Asynchronous Methods for Deep Reinforcement Learning" with multiple processes generating experience for agent (Keras + Thea…☆44Updated 6 years ago
- WIP implementation of "The Predictron: End-To-End Learning and Planning" (http://arxiv.org/abs/1612.08810) in Chainer☆11Updated 7 years ago
- reinfore learning tool box, contains trpo, a3c algorithm for continous action space☆43Updated 6 years ago
- My Udacity Machine Learning Nanodegree capstone project in Reinforcement Learning☆11Updated 6 years ago
- ☆53Updated 7 years ago