Implementation prototype of the Deep Deterministic Off-Policy Gradient (DD-OPG) method.
☆11Jun 12, 2019Updated 6 years ago
Alternatives and similar repositories for DD_OPG
Users that are interested in DD_OPG are comparing it to the libraries listed below
Sorting:
- ☆10Aug 17, 2022Updated 3 years ago
- Official implementation of MURAL (ICML 2021)☆17Sep 23, 2021Updated 4 years ago
- Keras implementation of guide actor-critic for continuous control☆11Mar 12, 2018Updated 7 years ago
- Implementation of the skill discovery algorithm described in ICLR submission "Option Discovery using Deep Skill Chaining"☆30Sep 24, 2019Updated 6 years ago
- Code for Optimistic Exploration even with a Pessimistic Initialisation☆14Aug 4, 2020Updated 5 years ago
- Official implementation of the paper "Approximating two value functions instead of one: towards characterizing a new family of Deep Reinf…☆11Jul 14, 2021Updated 4 years ago
- Implicit Normalizing Flows + Reinforcement Learning☆62May 31, 2019Updated 6 years ago
- Code for Diagnosing Bottlenecks in Deep Q-learning. Contains implementations of tabular environments plus solvers.☆19May 14, 2019Updated 6 years ago
- ICRL 2020☆20Feb 18, 2020Updated 6 years ago
- Code for paper "Successor Uncertainties: Exploration and Uncertainty in Temporal Difference Learning" by David Janz*, Jiri Hron*, Przemys…☆21Feb 24, 2023Updated 3 years ago
- Reward Estimation for Variance Reduction in Deep Reinforcement Learning☆22Oct 26, 2018Updated 7 years ago
- Official implementation of the AAAI 2021 paper Deep Bayesian Quadrature Policy Optimization.☆17Feb 17, 2021Updated 5 years ago
- Ranking Policy Gradient☆23Nov 27, 2019Updated 6 years ago
- Code accompanying NeurIPS 2019 paper: "Distributional Policy Optimization - An Alternative Approach for Continuous Control"☆22Dec 17, 2019Updated 6 years ago
- Official repo for our AAAI'21 paper, https://arxiv.org/abs/2007.12354☆27Jul 14, 2021Updated 4 years ago
- A comparison of parameter space noise methods for exploration in deep reinforcement learning☆30Mar 14, 2019Updated 6 years ago
- Proximal Policy Option-Critic☆26Jan 4, 2019Updated 7 years ago
- Sample-Efficient Reinforcement Learning with Bootstrapped Dual Policy Iteration☆25Sep 9, 2019Updated 6 years ago
- ☆30Nov 21, 2022Updated 3 years ago
- ☆31Jul 1, 2019Updated 6 years ago
- [ICLR 2022 Spotlight] Code for Reinforcement Learning with Sparse Rewards using Guidance from Offline Demonstration☆28Feb 10, 2022Updated 4 years ago
- P3O paper code☆30Aug 7, 2019Updated 6 years ago
- ☆17Feb 1, 2026Updated last month
- Offline Risk-Averse Actor-Critic (O-RAAC). A model-free RL algorithm for risk-averse RL in a fully offline setting☆35Feb 9, 2021Updated 5 years ago
- ☆32Mar 4, 2021Updated 5 years ago
- Code for the paper "D2RL: Deep Dense Architectures for Reinforcement Learning"☆39Jan 22, 2021Updated 5 years ago
- Code repo for Gradient Temporal-Difference Learning with Regularized Corrections paper.☆37Oct 14, 2020Updated 5 years ago
- Benchmarks of artificial neural network library for Spark MLlib☆11Dec 3, 2015Updated 10 years ago
- ☆11Nov 13, 2025Updated 3 months ago
- Implementation of BIMRL: Brain Inspired Meta Reinforcement Learning - Roozbeh Razavi et al. (IROS 2022)☆10Dec 1, 2022Updated 3 years ago
- Accepted at WWW 25 Industrial Track (oral)☆18Jun 6, 2025Updated 8 months ago
- Code for the papers "Induction of Subgoal Automata for Reinforcement Learning" (AAAI-20) and "Induction and Exploitation of Subgoal Autom…☆13Aug 15, 2023Updated 2 years ago
- Official implementation of the UMDQN algorithm presented in the scientific research paper entitled "Distributional Reinforcement Learning…☆11Jun 3, 2022Updated 3 years ago
- Aquarium: A Comprehensive Framework for Exploring Predator-Prey Dynamics through Multi-Agent Reinforcement Learning Algorithms☆13Apr 3, 2024Updated last year
- Alpha mining with DEAP-based genetic programming.☆11Jul 7, 2023Updated 2 years ago
- factory.ai FACTORY_API_KEY switch and query☆27Dec 6, 2025Updated 2 months ago
- Code for optimal execution☆12Oct 29, 2020Updated 5 years ago
- Code for Stabilizing Off-Policy RL via Bootstrapping Error Reduction☆163Jul 17, 2020Updated 5 years ago
- An Online Latent Dirichlet Allocation with Infinite Vocabulary implementation in Python.☆12Oct 4, 2018Updated 7 years ago