Implementation of clipped action policy gradient (CAPG) with PPO and TRPO
☆31Jun 24, 2018Updated 7 years ago
Alternatives and similar repositories for capg
Users that are interested in capg are comparing it to the libraries listed below
Sorting:
- These are experiments for examining reproducibility in Policy Gradient RL algorithms in Continuous domains. Mainly using the Rllab implem…☆17Sep 20, 2017Updated 8 years ago
- code for polite☆11Feb 28, 2024Updated 2 years ago
- Ranking Policy Gradient☆23Nov 27, 2019Updated 6 years ago
- Code-base for the paper Spectral Normalisation for Deep Reinforcement Learning: An Optimisation Perspective.☆11Jun 26, 2021Updated 4 years ago
- 実装するリスト☆10Dec 21, 2017Updated 8 years ago
- Keras implementation of guide actor-critic for continuous control☆11Mar 12, 2018Updated 7 years ago
- WIP implementation of "The Predictron: End-To-End Learning and Planning" (http://arxiv.org/abs/1612.08810) in Chainer☆11Dec 31, 2016Updated 9 years ago
- Sample-Efficient Reinforcement Learning with Bootstrapped Dual Policy Iteration☆25Sep 9, 2019Updated 6 years ago
- ROS package for robot learning☆17Oct 16, 2019Updated 6 years ago
- ☆20May 24, 2017Updated 8 years ago
- Metal Warfare game for ML-Agents challenge☆18Feb 24, 2018Updated 8 years ago
- Library to compare and evaluate reward functions☆68Oct 23, 2023Updated 2 years ago
- Source code for the paper "Divergence-Augmented Policy Optimization"☆37Nov 28, 2019Updated 6 years ago
- Implementation of "Sample-Efficient Deep Reinforcement Learning via Episodic Backward Update", NeurIPS 2019.☆16Sep 24, 2019Updated 6 years ago
- RC-NFQ: Regularized Convolutional Neural Fitted Q Iteration. A batch algorithm for deep reinforcement learning. Incorporates dropout regu…☆12Mar 17, 2021Updated 4 years ago
- Code to reproduce Supervised Policy Update (ICLR 2019)☆17Dec 8, 2022Updated 3 years ago
- EARL: Environment for Autonomous Reinforcement Learning☆37Nov 24, 2022Updated 3 years ago
- This is the official repository for the paper "Guided Exploration with Proximal Policy Optimization using a Single Demonstration", https:…☆19Oct 5, 2021Updated 4 years ago
- ☆54Jan 13, 2023Updated 3 years ago
- ☆22Nov 8, 2021Updated 4 years ago
- Contextual Bandits Action Elimination DQN☆21Jun 25, 2018Updated 7 years ago
- MIT racecar_simulator ported to python and speeded up using GPU ray marching☆20May 25, 2020Updated 5 years ago
- Continual Learning Toolkit for Reinforcement Learning☆21Jan 28, 2018Updated 8 years ago
- Upper Confidence Tree Planner for ATARI games☆19Mar 9, 2016Updated 9 years ago
- Implementation of the Model-Based Meta-Policy-Optimization (MB-MPO) algorithm☆44Nov 15, 2018Updated 7 years ago
- Deep reinforcement learning in ViZDoom (using Tensorflow)☆19Jan 25, 2018Updated 8 years ago
- Helpful files for Visual Doom AI Competition 2017☆45Jun 21, 2018Updated 7 years ago
- ☆18Apr 17, 2019Updated 6 years ago
- Run a static part of the computational graph written in Chainer with Tensorflow☆20Jan 10, 2017Updated 9 years ago
- ☆23Oct 7, 2018Updated 7 years ago
- Implementation is mostly based on Sergey Levine work (http://www.eecs.berkeley.edu/~svlevine/).☆44Dec 11, 2014Updated 11 years ago
- (ICLR 2021) Learning to Represent Action Values as a Hypergraph on the Action Vertices☆23Jun 22, 2021Updated 4 years ago
- IJCAI 2019 - Regularized Opponent Model with Maximum Entropy Objective (ROMMEO)☆23Dec 8, 2022Updated 3 years ago
- Tensorflow code for "Learning Self-Imitating Diverse Policies" (ICLR 2019)☆20Nov 26, 2020Updated 5 years ago
- Implementation of 'A Distributional Perspective on Reinforcement Learning' and 'Distributional Reinforcement Learning with Quantile Regre…☆133May 5, 2019Updated 6 years ago
- Modifiable OpenAI Gym environments for studying generalization in RL☆88Jan 22, 2019Updated 7 years ago
- An event-based on-line adaptable fast nonlinear model predictive control framework☆25Oct 29, 2018Updated 7 years ago
- Malmö challenge☆18May 22, 2017Updated 8 years ago
- Accompanying code for "Deep Reinforcement Learning that Matters"☆155Sep 22, 2017Updated 8 years ago