martinobdl / MCTSLinks
Implementation of SPW and DPW for Monte Carlo Tree Search in Continuous action/state space
β18Updated last year
Alternatives and similar repositories for MCTS
Users that are interested in MCTS are comparing it to the libraries listed below
Sorting:
- π§Ά Minimal PyTorch Soft Actor Critic (SAC) implementationβ38Updated 3 years ago
- Codes for the study "Variational Recurrent Models for Solving Partially Observable Control Tasks", published as a conference paper at ICLβ¦β55Updated 4 years ago
- RE3: State Entropy Maximization with Random Encoders for Efficient Explorationβ68Updated 3 years ago
- Code for Optimistic Exploration even with a Pessimistic Initialisationβ14Updated 4 years ago
- Auto-tune the Entropy Temperature of Soft Actor-Critic via Metagradient - 7th ICML AutoML workshop 2020β32Updated 3 years ago
- PyTorch implementation of "Model-based Reinforcement Learning for Semi-Markov Decision Processes with Neural ODEs", NeurIPS 2020β42Updated 4 years ago
- IV-RL - Sample Efficient Deep Reinforcement Learning via Uncertainty Estimationβ39Updated 8 months ago
- Bayesian Inverse Reinforcement Learning with simple environmentsβ20Updated 3 years ago
- Implementation of the Model-Based Meta-Policy-Optimization (MB-MPO) algorithmβ44Updated 6 years ago
- A library of probabilistic model based RL algorithms in pytorchβ107Updated 4 years ago
- β35Updated 5 years ago
- Open source demo for the paper Learning to Score Behaviors for Guided Policy Optimizationβ24Updated 5 years ago
- Symplectic Recurrent Neural Networksβ28Updated 2 years ago
- PyTorch implementation of Stochastic Latent Actor-Critic(SLAC).β93Updated 11 months ago
- Reinforcement Learning Seminar at the Chinese University of Hong Kong, Shenzhen, China.β20Updated last year
- β19Updated 3 years ago
- PIC: Permutation Invariant Critic for Multi-Agent Deep Reinforcement Learningβ49Updated 4 years ago
- PyTorch implementation of the Munchausen Reinforcement Learning Algorithms M-DQN and M-IQNβ45Updated 4 years ago
- Code for paper "Model-based Adversarial Meta-Reinforcement Learning" (https://arxiv.org/abs/2006.08875)β35Updated 4 years ago
- PyTorch implementation for "On the Critical Role of Conventions in Adaptive Human-AI Collaboration", ICLR 2021β16Updated 4 years ago
- Learning Action-Value Gradients in Model-based Policy Optimizationβ31Updated 3 years ago
- Meta-Inverse Reinforcement Learning with Probabilistic Context Variablesβ73Updated 2 years ago
- Pytorch implementation of Soft Actor-Criticβ20Updated 5 years ago
- Reinforcement Learning papers on exploration methods.β19Updated 4 years ago
- Code from the paper "Effective Diversity in Population Based Reinforcement Learning", presented as a spotlight at NeurIPS 2020. This is tβ¦β44Updated 4 years ago
- Tensorflow 2 source code for the PI-SAC agent from "Predictive Information Accelerates Learning in RL" (NeurIPS 2020)β44Updated 2 years ago
- Implementation of Russo and Van Roy work on Information Directed Sampling (2017)β20Updated 6 years ago
- Pytorch code for "Learning Belief Representations for Imitation Learning in POMDPs" (UAI 2019)β19Updated 2 years ago
- PyTorch implementation of Probabilistic Network Ensembles on toy problemsβ23Updated 2 years ago
- Pytorch implementation of Randomized Ensembled Double Q-learning (REDQ)β21Updated 4 years ago