saschaschramm / MonteCarloTreeSearch
This project applies Monte Carlo Tree Search (MCTS) to a simple grid world.
☆10Updated 6 years ago
Related projects ⓘ
Alternatives and complementary repositories for MonteCarloTreeSearch
- Attention-based Curiosity-driven Exploration in Deep Reinforcement Learning☆25Updated 4 years ago
- PyTorch IMPALA implementation☆24Updated 5 years ago
- Random Network Distillation(RND) algo in Pytorch☆48Updated 5 years ago
- Combining Evolutionary Algorithms and deep RL in various ways☆99Updated 4 years ago
- Implementation of Relational Deep Reinforcement Learning☆25Updated 4 years ago
- Auto-tune the Entropy Temperature of Soft Actor-Critic via Metagradient - 7th ICML AutoML workshop 2020☆30Updated 3 years ago
- General-Purpose Reinforcement Learning☆17Updated 3 years ago
- Modified versions of the SAC algorithm from spinningup for discrete action spaces and image observations.☆94Updated 4 years ago
- Proximal policy optimization in PyTorch. Easy to read and understand.☆48Updated 4 years ago
- Implementation of Population-Guided Parallel Policy Search for Reinforcement Learning☆22Updated 4 years ago
- Evolution-based Soft Actor-Critic (ESAC)☆41Updated 4 months ago
- Multi-Agent Determinantal Q-Learning☆42Updated 2 years ago
- ☆43Updated last year
- Distributed RL Implementation using Pytorch and Ray (ApeX(Ape-X), A3C, Distributed-PPO(DPPO), Impala)☆26Updated 2 years ago
- General Python implementation of Monte Carlo Tree Search for the use with Open AI Gym environments.☆35Updated 4 years ago
- ☆29Updated 2 years ago
- Proximal Policy Optimization with Beta distribution - uses multi agent Unity ML Tennis☆28Updated 5 years ago
- Code for "Proximal Distilled Evolutionary Reinforcement Learning", accepted at AAAI 2020☆50Updated 4 months ago
- Implementation of the Model-Based Meta-Policy-Optimization (MB-MPO) algorithm☆44Updated 6 years ago
- Pytorch implementation of DreamerV2: MASTERING ATARI WITH DISCRETE WORLD MODELS☆50Updated 2 years ago
- on-policy optimization baselines for deep reinforcement learning☆28Updated 4 years ago
- Codes accompanying the paper "DOP: Off-Policy Multi-Agent Decomposed Policy Gradients" (ICLR 2021, https://arxiv.org/abs/2007.12322)☆52Updated last year
- TF2 Implementation of the Soft Actor-Critic Algorithm☆45Updated last year
- The implement of GAIL with pytorch☆14Updated 4 years ago
- Reinforcement Learning with Perturbed Reward, AAAI 2020☆28Updated 3 months ago
- We investigate the effect of populations on finding good solutions to the robust MDP☆28Updated 3 years ago
- Our version of #Exploration: A Study of Count-Based Explorationfor Deep Reinforcement Learning for a class project☆14Updated 3 years ago
- Codes accompanying the paper "Learning Nearly Decomposable Value Functions with Communication Minimization" (ICLR 2020)☆81Updated last year
- MARS is shortened for Multi-Agent Research Studio, a library for mulit-agent reinforcement learning research.☆45Updated 8 months ago
- Soft Actor-Critic with advanced features☆47Updated last month