brilee / python_uct
Demo of UCT (MCTS) in Python / Numpy
☆85Updated 2 years ago
Alternatives and similar repositories for python_uct
Users that are interested in python_uct are comparing it to the libraries listed below
Sorting:
- A novel parallel UCT algorithm with linear speedup and negligible performance loss.☆118Updated 4 years ago
- TensorFlow & Keras implementation of DQN with HER (Hindsight Experience Replay)☆40Updated 4 years ago
- Code for the paper "Skynet: A Top Deep RL Agent in the Inaugural Pommerman Team Competition"☆37Updated 6 years ago
- Hindsight Experience Replay - Bit flipping experiment in Tensorflow☆58Updated 6 years ago
- High-quality implementations of deep reinforcement learning algorithms for experiments☆51Updated 8 months ago
- Atari - Deep Reinforcement Learning algorithms in TensorFlow☆135Updated last year
- ☆92Updated 4 years ago
- PyTorch implementation of Proximal Policy Optimization☆51Updated 7 years ago
- RUDDER: Return Decomposition for Delayed Rewards☆47Updated 4 years ago
- Highly Modular and Scalable Reinforcement Learning☆115Updated 5 years ago
- Efficient Exploration through Bayesian Deep Q-Networks☆37Updated 7 years ago
- A reinforcement learning framework☆155Updated 6 years ago
- Simple tools for statistical analyses in RL experiments☆66Updated 6 years ago
- An implement of DQfD(Deep Q-learning from Demonstrations) raised by DeepMind:Learning from Demonstrations for Real World Reinforcement Le…☆132Updated 7 years ago
- A framework for easy prototyping of distributed reinforcement learning algorithms☆95Updated 4 years ago
- Tensorflow implementation of a Deep Distributed Distributional Deterministic Policy Gradients (D4PG) network, trained on OpenAI Gym envir…☆125Updated 5 years ago
- Distributed implementation of popular evolutionary methods☆64Updated 7 years ago
- ☆35Updated 6 years ago
- Combining Evolutionary Algorithms and deep RL in various ways☆102Updated 4 years ago
- Implementation of Proximal Meta-Policy Search (ProMP) as well as related Meta-RL algorithm. Includes a useful experiment framework for Me…☆239Updated 2 years ago
- Actor-critic with experience replay☆252Updated 2 years ago
- Reproducing results from DeepMind's paper on Population Based Training of Neural Networks.☆56Updated 6 years ago
- Website for Quality-Diversity optimisation algorithms☆43Updated last month
- ☆68Updated 3 years ago
- ☆69Updated 6 years ago
- Bandits Environments for the OpenAI Gym☆89Updated 5 years ago
- Proximal Policy Optimization implementation with TensorFlow☆107Updated 6 years ago
- Twin Delayed DDPG (TD3) PyTorch solution for Roboschool and Box2d environment☆105Updated 5 years ago
- An implementation of Monte Carlo Tree Search in python☆162Updated 4 years ago
- Tensorflow/Keras code and trained models for Episodic Curiosity Through Reachability☆204Updated 4 years ago