yrlu / non-convexLinks
Self-contained implementation of non-convex optimization algorithms in Python. Steepest Descent, Conjugate Gradient, Newton's Method, Quasi-newton (BFGS), l-BFGS
☆82Updated 4 years ago
Alternatives and similar repositories for non-convex
Users that are interested in non-convex are comparing it to the libraries listed below
Sorting:
- Implementation of Steepest Descent, Newton, Quasi-Newton and Conjugate Gradient for non-linear unconstrained optimization☆28Updated 12 years ago
- Course notes for AA203☆153Updated 5 years ago
- A library to benchmark reinforcement learning algorithms☆21Updated 7 years ago
- Efficient Point-Based POMDP Planning by Approximating☆92Updated 5 years ago
- Code for ICRA2018 - Intent-aware Multi-agent Reinforcement Learning.☆22Updated 7 years ago
- Safe Bayesian Optimization☆150Updated 3 years ago
- Implementation of robust adaptive control methods for the linear quadratic regulator☆38Updated 4 years ago
- Code for "Calibrated Model-Based Deep Reinforcement Learning", ICML 2019.☆55Updated 6 years ago
- Safe exploration in Markov Decision Processes☆37Updated 8 years ago
- ☆54Updated 7 years ago
- Deep Reinforcement Learning in Autonomous Driving: the A3C algorithm used to make a car learn to drive in TORCS; Python 3.5, Tensorflow, …☆38Updated 8 years ago
- Path Planning using Generative Adversarial Network (GAN)☆47Updated 4 years ago
- Deep Gaussian Process for Inverse Reinforcement Learning☆33Updated 8 years ago
- Differential Dynamic Programming controller operating in OpenAI Gym environment.☆87Updated 5 years ago
- Jointly learning policies and latent representations for driver behavior.☆15Updated 8 years ago
- Python demo for the paper "Pareto Monte Carlo Tree Search for Multi-Objective Informative Planning".☆34Updated 3 years ago
- Code implementing the CORE-RL algorithm with DDPG, PPO, and TRPO. See the paper "Control Regularization for Reduced Variance Reinforcemen…☆32Updated 4 years ago
- CitySim3D: Simulated car following benchmark☆27Updated 2 years ago
- A Simple Example for Imitation Learning with Dataset Aggregation (DAGGER) on Torcs Env☆70Updated 8 years ago
- Safe learning of regions of attraction in uncertain, nonlinear systems with Gaussian processes☆39Updated 6 years ago
- ☆80Updated 6 years ago
- CUDA optimized code for solving MDPs, POMDPs, and Dec-POMDPs.☆18Updated 4 years ago
- Code for the Black-DROPS algorithm: "Black-Box Data-efficient Policy Search for Robotics", IROS 2017/ICRA 2018☆66Updated 4 years ago
- Enforcing robust control guarantees within neural network policies☆55Updated 4 years ago
- Stabilizable Nonlinear Dynamics Learning☆22Updated 6 years ago
- ☆43Updated 4 years ago
- Co-training for Policy Learning☆13Updated 6 years ago
- QMDP-Net implementation☆65Updated 5 years ago
- Code for the ICRA2018 paper "Learning with training wheels: Speeding up training with a simple controller for Deep Reinforcement Learning…☆37Updated 7 years ago
- Code for the paper Model-Predictive Control via Cross-Entropy and Gradient-Based Optimization☆68Updated 5 years ago