Code accompanying NeurIPS 2019 paper: "Distributional Policy Optimization - An Alternative Approach for Continuous Control"
☆22Dec 17, 2019Updated 6 years ago
Alternatives and similar repositories for GAC
Users that are interested in GAC are comparing it to the libraries listed below
Sorting:
- Implementing DQNClipped and DQNReg Algorithms☆10Mar 2, 2021Updated 4 years ago
- Prioritized Sequence Experience Replay☆10Aug 16, 2021Updated 4 years ago
- Source code for Interpretable Reward Redistribution in Reinforcement Learning: A Causal Approach (NeurIPS 2023)☆10Dec 12, 2023Updated 2 years ago
- Implementation prototype of the Deep Deterministic Off-Policy Gradient (DD-OPG) method.☆11Jun 12, 2019Updated 6 years ago
- Code accompanying the paper "Better Exploration with Optimistic Actor Critic" (NeurIPS 2019)☆69Aug 11, 2023Updated 2 years ago
- ☆11Oct 19, 2020Updated 5 years ago
- Code associated with our paper "Robust Domain Randomization for Reinforcement Learning"☆12Nov 22, 2022Updated 3 years ago
- Convergent Policy Optimization for Safe Reinforcement Learning☆11Oct 26, 2019Updated 6 years ago
- Exploring by Minimizing Uncertainty of Q values (EMU-Q) as presented in "Bayesian RL for Goal-Only Rewards" at CoRL'18.☆10Nov 8, 2018Updated 7 years ago
- ☆17Dec 19, 2024Updated last year
- Official implementation of Neural Episodic Control with State Abstraction☆13Aug 3, 2023Updated 2 years ago
- Count based exploration with the successor representation for Unity ML's Pyramid☆12Jun 19, 2019Updated 6 years ago
- Code for ICLR 2022 paper Rethinking Goal-Conditioned Supervised Learning and Its Connection to Offline RL.☆28Feb 21, 2022Updated 4 years ago
- Code for VIREL: A Variational Inference Framework for Reinforcement Learning☆14Dec 1, 2019Updated 6 years ago
- Exploration by Random Network Distillation☆15Dec 30, 2018Updated 7 years ago
- Co-training for Policy Learning☆13Aug 8, 2019Updated 6 years ago
- [NeurIPS 2020, Spotlight] Code for "Robust Deep Reinforcement Learning against Adversarial Perturbations on Observations"☆140Nov 16, 2021Updated 4 years ago
- Model-Free-Episodic-Control implementation.☆17Jun 3, 2019Updated 6 years ago
- The implementation of Discriminator Soft Actor Critic☆15Jan 25, 2020Updated 6 years ago
- Code to run the ASEBO algorithm from the paper: From Complexity to Simplicity: Adaptive ES-Active Subspaces for Blackbox Optimization... …☆16Oct 14, 2020Updated 5 years ago
- Open source code combining implementations of Upside Down Reinforcement Learning and Reward Conditioned Policies☆19Mar 10, 2021Updated 4 years ago
- Repository for ML Reproducibility Challenge 2020 for the Neurips paper, "The Value Equivalence Principle for Model-Based Reinforcement Le…☆18Apr 13, 2021Updated 4 years ago
- Source code for the paper "Divergence-Augmented Policy Optimization"☆37Nov 28, 2019Updated 6 years ago
- Pytorch implementation of Planar Flow☆17Dec 2, 2019Updated 6 years ago
- Mirror Descent Policy Optimization☆42Oct 31, 2020Updated 5 years ago
- Code to reproduce Supervised Policy Update (ICLR 2019)☆17Dec 8, 2022Updated 3 years ago
- Official PyTorch Implementation for Metric Residual Networks for Sample Efficient Goal-Conditioned Reinforcement Learning☆19Jan 11, 2023Updated 3 years ago
- TensorFlow implementation for our paper "Learning Long-Term Reward Redistribution via Randomized Return Decomposition"☆19Mar 17, 2022Updated 3 years ago
- Code accompanying the paper "Action Robust Reinforcement Learning and Applications in Continuous Control" https://arxiv.org/abs/1901.0918…☆48Apr 14, 2019Updated 6 years ago
- Modular-HER is revised from OpenAI baselines and supports many improvements for Hindsight Experience Replay as modules.☆17Jun 23, 2021Updated 4 years ago
- Ranking Policy Gradient☆23Nov 27, 2019Updated 6 years ago
- Hierarchical Self-Play☆21Dec 5, 2018Updated 7 years ago
- Code from the paper "Effective Diversity in Population Based Reinforcement Learning", presented as a spotlight at NeurIPS 2020. This is t…☆46Oct 29, 2020Updated 5 years ago
- Implementation of Tactical Optimistic and Pessimistic value estimation☆25Jul 18, 2023Updated 2 years ago
- ☆23Aug 9, 2022Updated 3 years ago
- Open source demo for the paper Learning to Score Behaviors for Guided Policy Optimization☆24Jun 24, 2020Updated 5 years ago
- PyTorch - Implicit Quantile Networks - Quantile Regression - C51☆22Jul 26, 2019Updated 6 years ago
- Code for the paper Adaptive Auxiliary Task Weighting for Reinforcement Learning☆26Jun 29, 2020Updated 5 years ago
- Map-Elites based on Evolution Strategies☆33Feb 11, 2022Updated 4 years ago