rhololkeolke / lspi-python
Least Squares Policy Iteration (LSPI) in Python
☆9Updated 9 years ago
Related projects: ⓘ
- Online solver based on Monte Carlo tree search for POMDPs with continuous state, action, and observation spaces.☆52Updated 3 months ago
- Source code for the examples accompanying the paper "Learning convex optimization control policies."☆80Updated last year
- The PO-UCT algorithm (aka POMCP) implemented in Julia☆35Updated 3 months ago
- Python interface for ECOS☆53Updated 3 months ago
- ☆35Updated last year
- The Machine Learning Optimizer☆97Updated last year
- A gallery of POMDPs.jl problems☆48Updated last week
- Safe Exploration with MPC and Gaussian process models☆87Updated 4 years ago
- Perform Model Checking and POMDP Planning from LTL specifications using POMDPs.jl☆14Updated last month
- Differentiation through cone programs☆89Updated 2 weeks ago
- Monte Carlo Tree Search for Markov decision processes using the POMDPs.jl framework☆73Updated 3 months ago
- Enforcing robust control guarantees within neural network policies☆52Updated 3 years ago
- Safe exploration in Markov Decision Processes☆38Updated 6 years ago
- ☆39Updated last year
- The Multiagent Decision Process (MADP) Toolbox - planning and learning in multiagent systems.☆76Updated 3 years ago
- Safe reinforcement learning with stability guarantees☆222Updated 2 years ago
- AA120Q Course Materials☆28Updated 3 years ago
- Safe Reinforcement Learning algorithms☆69Updated 2 years ago
- This repository contains the code for RL for POMDPs through learning an Approximate Information State.☆18Updated 3 years ago
- Implementation of a differetiable discrete-time algebraic Riccati equation (DARE) solver in PyTorch.☆11Updated last year
- Safe learning of regions of attraction in uncertain, nonlinear systems with Gaussian processes☆37Updated 4 years ago
- Google AI Princeton control framework☆38Updated 3 years ago
- Non-linear policy graph improvement - planning for Dec-POMDPs☆16Updated 3 years ago
- Easy MDPs and grid worlds with accessible transition dynamics to do exact calculations☆47Updated 2 years ago
- Reinforcement learning with Deterministic Policy Gradient methods☆9Updated 3 years ago
- Safe Bayesian Optimization☆140Updated last year
- Belief-state planning for POMDPs using learned approximations☆20Updated 3 months ago
- ☆73Updated 2 years ago
- Companion code to "Learning Stable Deep Dynamics Models" (Manek and Kolter, 2019)☆32Updated 4 years ago
- A library for mean-field games.☆43Updated last year