acguez / bamcpLinks

Bayes-Adaptive Monte-Carlo Planning algorithm

☆17

Alternatives and similar repositories for bamcp

Users that are interested in bamcp are comparing it to the libraries listed below

Sorting:

Santara / stochastic_value_gradient
Implementation of (Learning Continuous Control Policies by Stochastic Value Gradients)[https://arxiv.org/abs/1510.09142]
☆26Updated 3 years ago
zafarali / emdp
Easy MDPs and grid worlds with accessible transition dynamics to do exact calculations
☆49Updated 3 years ago
zuoxingdong / DeepPILCO
☆54Updated 7 years ago
Bellman-devs / bellman
Model-based reinforcement learning in TensorFlow
☆56Updated 3 years ago
info-structures / ais
This repository contains the code for RL for POMDPs through learning an Approximate Information State.
☆21Updated 3 years ago
RonanFR / UCRL
☆27Updated 6 years ago
WilsonWangTHU / POPLIN
☆99Updated 2 years ago
sebascuri / hucrl
☆30Updated last year
behaviorguidedRL / BGRL
Open source demo for the paper Learning to Score Behaviors for Guided Policy Optimization
☆24Updated 5 years ago
ermongroup / CalibratedModelBasedRL
Code for "Calibrated Model-Based Deep Reinforcement Learning", ICML 2019.
☆56Updated 6 years ago
schroederdewitt / mackrl
Code for Multi-Agent Common Knowledge Reinforcement Learning (NeurIPS 2019)
☆33Updated 5 years ago
ericjang / e2c
TensorFlow impementation of: Embed to Control: A Locally Linear Latent Dynamics Model for Control from Raw Images
☆65Updated 9 years ago
AlgTUDelft / SolvePOMDP
Solving POMDPs using exact and approximate methods
☆14Updated 7 years ago
dtak / hip-mdp-public
Code for training and testing a Hidden Parameter Markov Decision Process, used to facilitate the transfer of learning
☆32Updated 7 years ago
siemens / industrialbenchmark
Industrial Benchmark
☆131Updated 2 years ago
MinRegret / TigerControl
Google AI Princeton control framework
☆38Updated 4 years ago
MADPToolbox / MADP
The Multiagent Decision Process (MADP) Toolbox - planning and learning in multiagent systems.
☆82Updated 4 years ago
jonasrothfuss / model_ensemble_meta_learning
Implementation of the Model-Based Meta-Policy-Optimization (MB-MPO) algorithm
☆44Updated 6 years ago
sparisi / td-reg
TD-Regularized Actor-Critic Methods
☆36Updated 5 years ago
young-j-park / 18-NeurIPS-APIAE
☆21Updated 6 years ago
brain-research / mirage-rl
Code to reproduce the experiments in The Mirage of Action-Dependent Baselines in Reinforcement Learning.
☆17Updated 6 years ago
locuslab / stable_dynamics
Companion code to "Learning Stable Deep Dynamics Models" (Manek and Kolter, 2019)
☆33Updated 5 years ago
mcgillmrl / prob_mbrl
A library of probabilistic model based RL algorithms in pytorch
☆107Updated 4 years ago
supratikp / HOOF
Implementation of the Fast Efficient Hyperparameter Tuning for Policy Gradient Methods https://arxiv.org/abs/1902.06583
☆19Updated 5 years ago
nnaisense / MAGE
Learning Action-Value Gradients in Model-based Policy Optimization
☆31Updated 3 years ago
samkatt / fba-pomdp
Factored model-based Bayesian Reinforcement Learning framework
☆21Updated 2 years ago
StanfordASL / BaRC
Contains the code for "BaRC: Backward Reachability Curriculum for Robotic Reinforcement Learning" by Boris Ivanovic, James Harrison, Apoo…
☆12Updated 7 years ago
befelix / Safe-RL-Benchmark
A library to benchmark reinforcement learning algorithms
☆21Updated 7 years ago
laurimi / npgi
Non-linear policy graph improvement - planning for Dec-POMDPs
☆16Updated 4 years ago
manantomar / DSR
Deep Successor Representation
☆17Updated 7 years ago