Implementation is mostly based on Sergey Levine work (http://www.eecs.berkeley.edu/~svlevine/).
☆45Dec 11, 2014Updated 11 years ago
Alternatives and similar repositories for guided-policy-search
Users that are interested in guided-policy-search are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Guided policy search in Python and ROS Indigo.☆26Feb 12, 2026Updated 3 months ago
- Guided Policy Search☆600Feb 9, 2021Updated 5 years ago
- These are experiments for examining reproducibility in Policy Gradient RL algorithms in Continuous domains. Mainly using the Rllab implem…☆17Sep 20, 2017Updated 8 years ago
- This is my implementation of the Optimality Tightening☆37Apr 26, 2017Updated 9 years ago
- Code for Max-Margin Deep Generative Models☆12Jan 1, 2015Updated 11 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- ☆36Aug 2, 2016Updated 9 years ago
- A Lua wrapper for the Arcade Learning Environment☆17May 9, 2014Updated 12 years ago
- Implementation of Variational Intrinsic Control in tensorflow☆11Apr 5, 2017Updated 9 years ago
- Implementation of TRPO and related algorithms☆651May 20, 2018Updated 8 years ago
- Exploring by Minimizing Uncertainty of Q values (EMU-Q) as presented in "Bayesian RL for Goal-Only Rewards" at CoRL'18.☆10Nov 8, 2018Updated 7 years ago
- 強化学習論文のサーベイリポジトリ☆13Jun 23, 2017Updated 8 years ago
- Robustness via Retrying: Closed-Loop Robotic Manipulation with Self-Supervised Learning☆16Nov 7, 2018Updated 7 years ago
- Monitor parameter and gradient statistics during neural network training with Chainer☆13Jan 24, 2017Updated 9 years ago
- Simple tools for statistical analyses in RL experiments☆67Jun 21, 2018Updated 7 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Universal library for deep reinforcement learning.☆38Apr 15, 2016Updated 10 years ago
- Bayesian Poisson Tucker decomposition☆17Mar 17, 2017Updated 9 years ago
- ☆160Jul 21, 2017Updated 8 years ago
- Asynchronous Advantage Actor Critic☆20Aug 15, 2016Updated 9 years ago
- Contains the code for "BaRC: Backward Reachability Curriculum for Robotic Reinforcement Learning" by Boris Ivanovic, James Harrison, Apoo…☆12Jun 20, 2018Updated 7 years ago
- Variational Recurrent Auto Encoder☆15Jul 10, 2016Updated 9 years ago
- From Pixels to Torques: Policy Learning using Deep Dynamical Convolutional Neural Networks (DDCNN)☆42Nov 3, 2016Updated 9 years ago
- RLPy Reinforcement Learning Framework☆254Sep 29, 2019Updated 6 years ago
- 「速習 強化学習 -基礎理論とアルゴリズム-」サポートページ☆15Nov 25, 2017Updated 8 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Gated Recurrent Unit with Low-rank matrix factorization☆35Mar 11, 2016Updated 10 years ago
- Stochastic Neural Networks for Hierarchical Reinforcement Learning☆93Apr 17, 2018Updated 8 years ago
- Deep Attention Recurrent Q-Network☆115Nov 7, 2015Updated 10 years ago
- Implementation of an iterative linear quadratic regular (iLQR) on inverted pendulum, box quadratic programming (box-QP) is used to deal w…☆10Jul 24, 2018Updated 7 years ago
- Chainer implementation of Self-Normalizing Networks (SNN)☆25Jun 11, 2017Updated 8 years ago
- A toolkit for developing and comparing reinforcement learning algorithms using ROS, Player/Stage and Gazebo.☆24Feb 21, 2018Updated 8 years ago
- TensorFlow implementation of the DDPG algorithm from the paper Continuous Control with Deep Reinforcement Learning (ICLR 2016)☆214Feb 16, 2018Updated 8 years ago
- ☆28Apr 15, 2017Updated 9 years ago
- ☆11Aug 27, 2017Updated 8 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Source code of DeepTracking research project☆130Aug 22, 2016Updated 9 years ago
- PyMC Example Notebooks☆74Feb 16, 2014Updated 12 years ago
- Reproduction of the paper "Soft Q-Learning with Mutual Information Regularization" CoRL 2019.☆10Jan 10, 2019Updated 7 years ago
- Using Pilco algorithm to find a controller for few robotic problems☆43Jul 31, 2015Updated 10 years ago
- An implementation of the RL-NTM from http://arxiv.org/abs/1505.00521☆160Jan 7, 2016Updated 10 years ago
- Implementation of clipped action policy gradient (CAPG) with PPO and TRPO☆31Jun 24, 2018Updated 7 years ago
- Reinforcement Learning with Deep Energy-Based Policies☆438Nov 28, 2023Updated 2 years ago