thiagopbueno / tf-mdp
Probabilistic planning in continuous state-action MDPs in TensorFlow.
☆12Updated 2 years ago
Alternatives and similar repositories for tf-mdp:
Users that are interested in tf-mdp are comparing it to the libraries listed below
- A toolkit for working with RDDL domains in Python3.☆17Updated 4 years ago
- Planning through backpropagation using TensorFlow.☆16Updated 4 years ago
- ☆27Updated 5 years ago
- Easy MDPs and grid worlds with accessible transition dynamics to do exact calculations☆48Updated 2 years ago
- A curated list of online resources for probabilistic planning: papers, software and research groups around the world!☆58Updated 6 years ago
- Code for "Calibrated Model-Based Deep Reinforcement Learning", ICML 2019.☆55Updated 5 years ago
- Automatically exported from code.google.com/p/rddlsim☆52Updated 11 months ago
- (Experimental) Inverse reinforcement learning from trajectories generated by multiple agents with different (but correlated) rewards☆27Updated 5 years ago
- ICRL 2020☆19Updated 5 years ago
- Code that translates grammar into PDDL, runs a planner to produce multiple plans, translates plans into trainable lale pipelines and trai…☆17Updated last year
- Planning using Reinforcement Learning☆8Updated 6 years ago
- Learning Action-Value Gradients in Model-based Policy Optimization☆31Updated 3 years ago
- Code for Automatic Curriculum Learning through Value Disagreement☆30Updated 4 years ago
- A Library of MDP algorithms for Artificial Intelligence☆18Updated 5 years ago
- Solving POMDPs using exact and approximate methods☆13Updated 7 years ago
- AGAC: Adversarially Guided Actor-Critic☆48Updated 3 years ago
- Code for experimenting with state and action abstractions in reinforcement learning.☆30Updated 4 years ago
- Maximum Entropy-Regularized Multi-Goal Reinforcement Learning (ICML 2019)☆23Updated 5 years ago
- ☆18Updated 4 years ago
- A toolkit for auto-generation of OpenAI Gym environments from RDDL description files.☆76Updated this week
- Open source demo for the paper Learning to Score Behaviors for Guided Policy Optimization☆24Updated 4 years ago
- Estimating Q(s,s') with Deep Deterministic Dynamics Gradients☆32Updated 5 years ago
- Non-linear policy graph improvement - planning for Dec-POMDPs☆16Updated 3 years ago
- Reinforcement Learning Seminar at the Chinese University of Hong Kong, Shenzhen, China.☆20Updated last year
- Comp 781 Project☆8Updated 6 years ago
- Learning Off-Policy with Online Planning [CoRL 2021 Best Paper Finalist]☆36Updated 2 years ago
- Code for generating options for planning and reinforcement learning☆11Updated 4 years ago
- Implementation of the skill discovery algorithm described in ICLR submission "Option Discovery using Deep Skill Chaining"☆28Updated 5 years ago
- This repository contains the code used in the paper Evaluating the Performance of Reinformcent Learning Algorithms☆27Updated 3 years ago
- Implementation of PDFAs and PDFA learning algorithm.☆11Updated 4 years ago