rhololkeolke / lspi-pythonLinks
Least Squares Policy Iteration (LSPI) in Python
☆11Updated 10 years ago
Alternatives and similar repositories for lspi-python
Users that are interested in lspi-python are comparing it to the libraries listed below
Sorting:
- ☆36Updated 2 years ago
- Model-based reinforcement learning in TensorFlow☆56Updated 3 years ago
- Working directory for dynamics learning for experimental robots.☆57Updated 4 years ago
- Safe reinforcement learning with stability guarantees☆232Updated 3 years ago
- This repository contains the code for RL for POMDPs through learning an Approximate Information State.☆21Updated 3 years ago
- Factored model-based Bayesian Reinforcement Learning framework☆21Updated 2 years ago
- ☆42Updated 2 years ago
- Safe exploration in Markov Decision Processes☆37Updated 7 years ago
- Least-Squares Policy Iteration☆8Updated 2 years ago
- Easy MDPs and grid worlds with accessible transition dynamics to do exact calculations☆49Updated 3 years ago
- Online solver based on Monte Carlo tree search for POMDPs with continuous state, action, and observation spaces.☆56Updated last month
- ☆73Updated 4 years ago
- Safe Reinforcement Learning algorithms☆74Updated 2 years ago
- Safe Exploration with MPC and Gaussian process models☆90Updated 4 years ago
- Implementation of robust adaptive control methods for the linear quadratic regulator☆38Updated 3 years ago
- A library for mean-field games.☆53Updated this week
- On the model-based stochastic value gradient for continuous reinforcement learning☆55Updated last year
- Implementations of SAILR, PDO, and CSC☆32Updated 10 months ago
- Source code for the examples accompanying the paper "Learning convex optimization control policies."☆84Updated 2 years ago
- Google AI Princeton control framework☆38Updated 4 years ago
- OpenAI Gym environment for Platform☆20Updated 6 years ago
- Enforcing robust control guarantees within neural network policies☆53Updated 4 years ago
- Safe Reinforcement Learning in Constrained Markov Decision Processes☆60Updated 4 years ago
- This code implements Prioritized Level Replay, a method for sampling training levels for reinforcement learning agents that exploits the …☆86Updated 3 years ago
- Code for the paper Model-Predictive Control via Cross-Entropy and Gradient-Based Optimization☆69Updated 5 years ago
- A toolbox for trajectory optimization of dynamical systems☆53Updated 2 years ago
- Controlgym: Large-Scale Control Environments for Benchmarking Reinforcement Learning Algorithms☆43Updated last year
- Gym-like extensions for POMDP☆57Updated 4 years ago
- The Multiagent Decision Process (MADP) Toolbox - planning and learning in multiagent systems.☆80Updated 4 years ago
- Gradient descent algorithms for LQG control☆14Updated 3 years ago