These are experiments for examining reproducibility in Policy Gradient RL algorithms in Continuous domains. Mainly using the Rllab implementation.
☆17Sep 20, 2017Updated 8 years ago
Alternatives and similar repositories for ReproducibilityInContinuousPolicyGradientMethods
Users that are interested in ReproducibilityInContinuousPolicyGradientMethods are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Implementation is mostly based on Sergey Levine work (http://www.eecs.berkeley.edu/~svlevine/).☆45Dec 11, 2014Updated 11 years ago
- ☆29May 17, 2017Updated 9 years ago
- Keras implementation of guide actor-critic for continuous control☆11Mar 12, 2018Updated 8 years ago
- ROS package for robot learning☆17Oct 16, 2019Updated 6 years ago
- This is my implementation of the Optimality Tightening☆37Apr 26, 2017Updated 9 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Accompanying code for "Deep Reinforcement Learning that Matters"☆154Sep 22, 2017Updated 8 years ago
- Tutorial on continuous control at Reinforcement Learning Summer School 2017.☆34Jul 3, 2017Updated 8 years ago
- Deep Semi-Supervised Learning with Holistic methods for audio classification.☆11Dec 14, 2024Updated last year
- Using Pilco algorithm to find a controller for few robotic problems☆43Jul 31, 2015Updated 10 years ago
- Expectation Particle Belief Propagation code☆13Oct 8, 2018Updated 7 years ago
- Real-time image and video foveation transform using PyCUDA☆11Jan 6, 2021Updated 5 years ago
- Ranking Policy Gradient☆23Nov 27, 2019Updated 6 years ago
- Rectifying Self Organizing Map☆29Oct 7, 2024Updated last year
- Continual Learning Toolkit for Reinforcement Learning☆21Jan 28, 2018Updated 8 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆23Oct 7, 2018Updated 7 years ago
- WIP implementation of "The Predictron: End-To-End Learning and Planning" (http://arxiv.org/abs/1612.08810) in Chainer☆11Dec 31, 2016Updated 9 years ago
- Original tensorflow implementation of SILOT (Spatially Invariant, Label-free Object Tracking).☆13Mar 24, 2023Updated 3 years ago
- Awesome openai gym environments☆12Aug 6, 2019Updated 6 years ago
- ☆13Jun 23, 2017Updated 8 years ago
- Reinforcement Learning with Deep Energy-Based Policies☆438Nov 28, 2023Updated 2 years ago
- Implimentation of the Model Free Episodic Control paper by Deep Mind : http://arxiv.org/abs/1606.04460☆52Jul 25, 2016Updated 9 years ago
- PyTorch implementation of the Value Iteration Networks (VIN) (NIPS '16 best paper)☆80Mar 13, 2017Updated 9 years ago
- Implementation of "Action-Conditional Video Prediction using Deep Networks in Atari Games"☆114Feb 8, 2016Updated 10 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- SMASH: Physics-guided Reconstruction of Collisions from Videos, SIGGRAPH Asia 2016☆11Jan 25, 2018Updated 8 years ago
- ☆28Apr 28, 2019Updated 7 years ago
- ☆20Jan 31, 2018Updated 8 years ago
- Generalised UDRL☆37May 12, 2022Updated 4 years ago
- ☆28Oct 9, 2017Updated 8 years ago
- Noisy Networks for Exploration☆187Jan 28, 2018Updated 8 years ago
- Code for 'Contrastive Multi-Document Question Generation'☆11Oct 16, 2022Updated 3 years ago
- ☆161Jul 21, 2017Updated 8 years ago
- ☆20Mar 17, 2017Updated 9 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Sample-Efficient Reinforcement Learning with Bootstrapped Dual Policy Iteration☆25Sep 9, 2019Updated 6 years ago
- Codebase for Paper Reusing Embeddings: Reproducible Reward Model Research in Large Language Model Alignment without GPUs☆22Apr 24, 2025Updated last year
- Reward Estimation for Variance Reduction in Deep Reinforcement Learning☆10May 8, 2018Updated 8 years ago
- Some starter code for training/testing some basic CNN models given our data.☆10Feb 15, 2017Updated 9 years ago
- A minimal Unreal Engine project for developing and testing UnrealCV☆17Nov 8, 2018Updated 7 years ago
- From Pixels to Torques: Policy Learning using Deep Dynamical Convolutional Neural Networks (DDCNN)☆42Nov 3, 2016Updated 9 years ago
- Code for reproducing the results in "Mining Semantic Affordances of Visual Object Categories"☆12Jun 10, 2024Updated 2 years ago