edpowley / mcts.ai
☆18Updated 5 years ago
Related projects: ⓘ
- Implementation of Alpha Go Zero algorithm for the game of tic-tac-toe☆16Updated 6 years ago
- A python client library for microRTS.☆19Updated 4 years ago
- Demo of UCT (MCTS) in Python / Numpy☆81Updated last year
- ☆57Updated last year
- Monte Carlo Tree Search with UCT with a couple of example games.☆151Updated 3 years ago
- ☆12Updated 3 years ago
- Fictitious Self-play & Reinforcement Learning☆19Updated 6 years ago
- Board game AI implementations using Monte Carlo Tree Search☆181Updated 4 years ago
- Minimal TensorFlow implementation of the Advantage Actor-Critic model for Atari games☆13Updated 6 years ago
- ☆53Updated 7 years ago
- This is the code for the "How to Beat Pong Using Policy Gradients (LIVE)" by Siraj Raval on Youtube☆63Updated 7 years ago
- Keras implementation of Curiosity-driven Exploration by Self-supervised Prediction☆8Updated 7 years ago
- Duel_DDQN (Dueling Network Architectures + Double DQN) using Keras☆32Updated 8 years ago
- RainBow, Tensorflow☆49Updated 6 years ago
- An implementation of Monte Carlo Tree Search in python☆159Updated 3 years ago
- Single Player Monte Carlo Tree Search implementation☆18Updated 4 years ago
- Reinforcement learning in 3D.☆21Updated 7 years ago
- A platform of grid world that supports up to 1 million reinforcement-learning agents.☆70Updated 7 years ago
- Reinforcement learning benchmarking.☆39Updated 5 years ago
- Reinforcement Learning for Super Mario Bros using A3C on GPU☆36Updated 6 years ago
- ☆63Updated 2 years ago
- Combining deep learning and reinforcement learning.☆81Updated 2 years ago
- Deep reinforcement learning baselines base on OpenAI. More algorithms are included, such as Rainbow: Combining Improvements in Deep Rei…☆36Updated 6 years ago
- Using a paper from Google DeepMind I've developed a new version of the DQN using threads exploration instead of memory replay as explain …☆83Updated 8 years ago
- Reproducing MuJoCo benchmarks in a modern, commercial game /physics engine (Unity + PhysX).☆50Updated 2 months ago
- DDPG on OpenAI Gym Pendulum☆19Updated 8 years ago
- ☆30Updated 4 years ago
- Code base for solving Markov Decision Processes and Reinforcement Learning problems using Recurrent Convolutional Neural Networks.☆69Updated 6 years ago
- ☆50Updated this week
- Reinforcement learning algorithms to play Poker☆15Updated 2 years ago