wuga214 / PAPER_NIPS17_ScalablePlanning_TensorflowLinks
Tensorflow is not only an well designed deep learning toolbox, but also a standard symbolic programming framework. In this repository, we show how to use tensorflow to do classical planning task on deterministic, continous action, continous space problems.
☆12Updated 6 years ago
Alternatives and similar repositories for PAPER_NIPS17_ScalablePlanning_Tensorflow
Users that are interested in PAPER_NIPS17_ScalablePlanning_Tensorflow are comparing it to the libraries listed below
Sorting:
- The code of paper "Nonlinear Hybrid Planning with Deep Net Learned Transition Models and Mixed-Integer Linear Programming." published on …☆10Updated 7 years ago
- Implementation of (Learning Continuous Control Policies by Stochastic Value Gradients)[https://arxiv.org/abs/1510.09142]☆26Updated 3 years ago
- Safe learning of regions of attraction in uncertain, nonlinear systems with Gaussian processes☆38Updated 5 years ago
- Code for "Calibrated Model-Based Deep Reinforcement Learning", ICML 2019.☆56Updated 6 years ago
- TD-Regularized Actor-Critic Methods☆36Updated 5 years ago
- ☆36Updated 2 years ago
- Easy MDPs and grid worlds with accessible transition dynamics to do exact calculations☆49Updated 3 years ago
- Implementation of the paper "Improving Optimization Bounds using Machine Learning: Decision Diagrams meet Deep Reinforcement Learning".☆27Updated 5 years ago
- Implementation of point-based value iteration (for POMDPs)☆12Updated 5 years ago
- Safe Reinforcement Learning algorithms☆74Updated 2 years ago
- Hybrid Deep MILP Planner☆14Updated 2 years ago
- The three algorithms used to solve Bayesian Stackelberg Games have been implemented here.☆28Updated 6 years ago
- Chance constraints in CVXPY☆18Updated 6 years ago
- Model-based reinforcement learning in TensorFlow☆56Updated 3 years ago
- Source code for the Paper: CombOptNet: Fit the Right NP-Hard Problem by Learning Integer Programming Constraints}☆73Updated 3 years ago
- Solving POMDPs using exact and approximate methods☆13Updated 7 years ago
- Enforcing robust control guarantees within neural network policies☆53Updated 4 years ago
- Code for training policies based on paper Coordinated Multi-Agent Imitation Learning☆26Updated 7 years ago
- ☆73Updated 4 years ago
- Multi-Objective Reinforcement Learning components built on top of RL glue components☆29Updated 2 years ago
- Generate taylored code for Differential Dynamic Programming (DDP) aka Iterative Linear Quadratic Gaussian (iLQG) solvers for finite time …☆15Updated 7 years ago
- ☆54Updated 7 years ago
- Code the AAAI 2019 paper "Melding the Data-Decisions Pipeline: Decision-Focused Learning for Combinatorial Optimization"☆33Updated 4 years ago
- Source for Action Schema Networks paper (AAAI'18)☆31Updated 2 years ago
- Code to reproduce Supervised Policy Update (ICLR 2019)☆17Updated 2 years ago
- Deep Gaussian Process for Inverse Reinforcement Learning☆33Updated 7 years ago
- A library to benchmark reinforcement learning algorithms☆21Updated 7 years ago
- ☆27Updated 6 years ago
- The PO-UCT algorithm (aka POMCP) implemented in Julia☆37Updated 2 months ago
- Companion code to "Learning Stable Deep Dynamics Models" (Manek and Kolter, 2019)☆33Updated 5 years ago