brilee / python_uct
Demo of UCT (MCTS) in Python / Numpy
☆84Updated 2 years ago
Alternatives and similar repositories for python_uct:
Users that are interested in python_uct are comparing it to the libraries listed below
- An implementation of Monte Carlo Tree Search in python☆163Updated 4 years ago
- Library for running a Monte Carlo tree search, either traditionally or with expert policies☆120Updated 8 months ago
- High-quality implementations of deep reinforcement learning algorithms for experiments☆51Updated 4 months ago
- Code for the paper "Skynet: A Top Deep RL Agent in the Inaugural Pommerman Team Competition"☆37Updated 5 years ago
- Code release for Learning with Opponent-Learning Awareness and variations.☆147Updated last year
- Board game AI implementations using Monte Carlo Tree Search☆183Updated 4 years ago
- Highly Modular and Scalable Reinforcement Learning☆114Updated 5 years ago
- A python implemenation of tabular MuZero for educational purposes☆21Updated 5 years ago
- ☆66Updated 3 years ago
- Implementation of selected reinforcement learning algorithms in Tensorflow. A3C, DDPG, REINFORCE, DQN, etc.☆151Updated last year
- A high-performance Atari A3C agent in 180 lines of PyTorch☆171Updated 3 years ago
- Hindsight Experience Replay - Bit flipping experiment in Tensorflow☆59Updated 6 years ago
- A novel parallel UCT algorithm with linear speedup and negligible performance loss.☆114Updated 3 years ago
- An implementation of (Double/Dueling) Deep-Q Learning to play Super Mario Bros.☆70Updated 3 years ago
- Simple grid-world environment compatible with OpenAI-gym☆49Updated 4 years ago
- Building Agents with Imagination: pytorch step-by-step implementation☆206Updated 5 years ago
- Combining Evolutionary Algorithms and deep RL in various ways☆100Updated 4 years ago
- Fictitious Self-play & Reinforcement Learning☆19Updated 6 years ago
- A framework for easy prototyping of distributed reinforcement learning algorithms☆95Updated 4 years ago
- C51-DDQN in Keras☆125Updated 7 years ago
- Bandits Environments for the OpenAI Gym☆90Updated 5 years ago
- A collection of multi-agent reinforcement learning OpenAI gym environments☆45Updated 4 years ago
- A Tensorflow implementation of the Option-Critic Architecture☆71Updated 7 years ago
- A platform of grid world that supports up to 1 million reinforcement-learning agents.☆70Updated 7 years ago
- RUDDER: Return Decomposition for Delayed Rewards☆47Updated 4 years ago
- TensorFlow & Keras implementation of DQN with HER (Hindsight Experience Replay)☆40Updated 4 years ago
- General Python implementation of Monte Carlo Tree Search for the use with Open AI Gym environments.☆37Updated 4 years ago
- A reinforcement learning framework☆154Updated 6 years ago
- Deep Q Learning via Pytorch☆86Updated 7 years ago
- Simple tools for statistical analyses in RL experiments☆66Updated 6 years ago