wuga214 / PAPER_NIPS17_ScalablePlanning_Tensorflow
Tensorflow is not only an well designed deep learning toolbox, but also a standard symbolic programming framework. In this repository, we show how to use tensorflow to do classical planning task on deterministic, continous action, continous space problems.
☆12Updated 6 years ago
Alternatives and similar repositories for PAPER_NIPS17_ScalablePlanning_Tensorflow:
Users that are interested in PAPER_NIPS17_ScalablePlanning_Tensorflow are comparing it to the libraries listed below
- PyTorch implementation of various reinforcement learning algorithms☆18Updated 7 years ago
- Implementation of (Learning Continuous Control Policies by Stochastic Value Gradients)[https://arxiv.org/abs/1510.09142]☆26Updated 3 years ago
- Safe learning of regions of attraction in uncertain, nonlinear systems with Gaussian processes☆38Updated 5 years ago
- The code of paper "Nonlinear Hybrid Planning with Deep Net Learned Transition Models and Mixed-Integer Linear Programming." published on …☆10Updated 6 years ago
- Solving POMDPs using exact and approximate methods☆13Updated 7 years ago
- TD-Regularized Actor-Critic Methods☆36Updated 5 years ago
- Easy MDPs and grid worlds with accessible transition dynamics to do exact calculations☆49Updated 3 years ago
- Code for "Calibrated Model-Based Deep Reinforcement Learning", ICML 2019.☆55Updated 5 years ago
- ☆53Updated 7 years ago
- Scalable MCTS for team scenarios☆16Updated 9 months ago
- ☆27Updated 5 years ago
- This repository contains the code used in the paper Evaluating the Performance of Reinformcent Learning Algorithms☆27Updated 3 years ago
- Implementation of point-based value iteration (for POMDPs)☆12Updated 5 years ago
- The PO-UCT algorithm (aka POMCP) implemented in Julia☆37Updated last week
- Code to reproduce Supervised Policy Update (ICLR 2019)☆17Updated 2 years ago
- (Experimental) Inverse reinforcement learning from trajectories generated by multiple agents with different (but correlated) rewards☆27Updated 5 years ago
- ☆36Updated 8 years ago
- Reproducing Policy Distillation (DeepMind paper ICLR 2016)☆22Updated 5 years ago
- ☆76Updated 2 years ago
- Code the AAAI 2019 paper "Melding the Data-Decisions Pipeline: Decision-Focused Learning for Combinatorial Optimization"☆31Updated 4 years ago
- ☆11Updated 5 years ago
- Enforcing robust control guarantees within neural network policies☆54Updated 3 years ago
- Reinforcement Learning with Convex Constraints☆14Updated 2 years ago
- Source for Action Schema Networks paper (AAAI'18)☆31Updated last year
- Code for experimenting with state and action abstractions in reinforcement learning.☆31Updated 4 years ago
- A comparison of parameter space noise methods for exploration in deep reinforcement learning☆27Updated 6 years ago
- Code implementing the CORE-RL algorithm with DDPG, PPO, and TRPO. See the paper "Control Regularization for Reduced Variance Reinforcemen…☆32Updated 4 years ago
- Sample-Efficient Reinforcement Learning with Bootstrapped Dual Policy Iteration☆25Updated 5 years ago
- Generate taylored code for Differential Dynamic Programming (DDP) aka Iterative Linear Quadratic Gaussian (iLQG) solvers for finite time …☆15Updated 7 years ago
- ☆11Updated 2 years ago