pfnet-research / chainer-differentiable-mpc
Differentiable MPC in Chainer, developed as part of PFN summer internship 2019.
☆12Updated 2 years ago
Related projects: ⓘ
- Research repo of RL☆22Updated last year
- ☆53Updated last year
- Study Group of Model-based RL, 高橋研究室のモデルベース強化学習勉強会のスライドのまとめです☆25Updated 5 years ago
- Code associated with paper "High-Dimensional Contextual Policy Search with Unknown Context Rewards using Bayesian Optimization"☆15Updated 3 years ago
- A codebase for experimenting with various approaches to action priors.☆19Updated 6 years ago
- Simple and extensible hypergradient for PyTorch☆16Updated last year
- Theano☆11Updated 7 years ago
- Pixyz Tutorial in RL Architecture Study Group☆11Updated 5 years ago
- ☆28Updated 5 years ago
- [WIP] Python implementation of evolution strategy based on Information Geometry. This library includes CMA-ES, NES, CompactGA and PBIL.☆15Updated 5 years ago
- ☆13Updated 7 years ago
- ☆13Updated 4 years ago
- A framework for Bayesian optimization of composite functions.☆14Updated last year
- ☆11Updated this week
- Neural Fixed-Point Acceleration for Convex Optimization☆29Updated last year
- project for my essay on how to use neural networks to linearise nonlinear dynamical systems☆9Updated 4 years ago
- ☆12Updated 3 years ago
- pycombina - Solving binary approximation problems in Python☆20Updated 9 months ago
- ☆46Updated last year
- Code for "Calibrated Model-Based Deep Reinforcement Learning", ICML 2019.☆54Updated 5 years ago
- Implementation of clipped action policy gradient (CAPG) with PPO and TRPO☆29Updated 6 years ago
- Entropy Search for Information-Efficient Global Optimization - JMLR v13☆26Updated 7 years ago
- ☆68Updated 4 years ago
- Python implementation of the PR-SSM.☆51Updated 6 years ago
- Code implementing the CORE-RL algorithm with DDPG, PPO, and TRPO. See the paper "Control Regularization for Reduced Variance Reinforcemen…☆32Updated 3 years ago
- ☆23Updated 4 years ago
- Easy MDPs and grid worlds with accessible transition dynamics to do exact calculations☆47Updated 2 years ago
- Learning dynamical systems from data: Koopman☆15Updated 4 years ago
- TensorFlow impementation of: Embed to Control: A Locally Linear Latent Dynamics Model for Control from Raw Images☆65Updated 8 years ago