Code accompanying NeurIPS 2019 paper: "Distributional Policy Optimization - An Alternative Approach for Continuous Control"
☆22Dec 17, 2019Updated 6 years ago
Alternatives and similar repositories for GAC
Users that are interested in GAC are comparing it to the libraries listed below
Sorting:
- Implementing DQNClipped and DQNReg Algorithms☆10Mar 2, 2021Updated 5 years ago
- Exploring by Minimizing Uncertainty of Q values (EMU-Q) as presented in "Bayesian RL for Goal-Only Rewards" at CoRL'18.☆10Nov 8, 2018Updated 7 years ago
- Source code for Interpretable Reward Redistribution in Reinforcement Learning: A Causal Approach (NeurIPS 2023)☆10Dec 12, 2023Updated 2 years ago
- Repository for ML Reproducibility Challenge 2020 for the Neurips paper, "The Value Equivalence Principle for Model-Based Reinforcement Le…☆18Apr 13, 2021Updated 4 years ago
- Code accompanying the paper "Better Exploration with Optimistic Actor Critic" (NeurIPS 2019)☆69Aug 11, 2023Updated 2 years ago
- Prioritized Sequence Experience Replay☆10Aug 16, 2021Updated 4 years ago
- Convergent Policy Optimization for Safe Reinforcement Learning☆11Oct 26, 2019Updated 6 years ago
- [NeurIPS 2020, Spotlight] Code for "Robust Deep Reinforcement Learning against Adversarial Perturbations on Observations"☆141Nov 16, 2021Updated 4 years ago
- Code to run the ASEBO algorithm from the paper: From Complexity to Simplicity: Adaptive ES-Active Subspaces for Blackbox Optimization... …☆16Oct 14, 2020Updated 5 years ago
- Official implementation of Neural Episodic Control with State Abstraction☆13Aug 3, 2023Updated 2 years ago
- Implementation prototype of the Deep Deterministic Off-Policy Gradient (DD-OPG) method.☆11Jun 12, 2019Updated 6 years ago
- Mirror Descent Policy Optimization☆42Oct 31, 2020Updated 5 years ago
- Code associated with our paper "Robust Domain Randomization for Reinforcement Learning"☆12Nov 22, 2022Updated 3 years ago
- Source code for the paper "Divergence-Augmented Policy Optimization"☆37Nov 28, 2019Updated 6 years ago
- Build and Train Neural Networks in Emacs Lisp☆14May 22, 2024Updated last year
- ☆33Nov 21, 2022Updated 3 years ago
- Code for ICLR 2022 paper Rethinking Goal-Conditioned Supervised Learning and Its Connection to Offline RL.☆28Feb 21, 2022Updated 4 years ago
- Code accompanying the paper "Action Robust Reinforcement Learning and Applications in Continuous Control" https://arxiv.org/abs/1901.0918…☆48Apr 14, 2019Updated 6 years ago
- TensorFlow implementation for our paper "Learning Long-Term Reward Redistribution via Randomized Return Decomposition"☆19Mar 17, 2022Updated 4 years ago
- ☆11Oct 19, 2020Updated 5 years ago
- ☆10Dec 14, 2022Updated 3 years ago
- ☆11Oct 14, 2019Updated 6 years ago
- ☆15May 4, 2025Updated 10 months ago
- A little logger for machine learning research☆35Aug 30, 2023Updated 2 years ago
- ☆17Dec 19, 2024Updated last year
- Explore and Control with Adversarial Surprise☆10Jul 20, 2021Updated 4 years ago
- PyTorch - Implicit Quantile Networks - Quantile Regression - C51☆22Jul 26, 2019Updated 6 years ago
- Bombing AI agents☆12Jun 21, 2018Updated 7 years ago
- Modular-HER is revised from OpenAI baselines and supports many improvements for Hindsight Experience Replay as modules.☆17Jun 23, 2021Updated 4 years ago
- Code for the paper "D2RL: Deep Dense Architectures for Reinforcement Learning"☆39Jan 22, 2021Updated 5 years ago
- Code for NeurIPS 2022 paper Exploiting Reward Shifting in Value-Based Deep RL☆29Oct 29, 2023Updated 2 years ago
- ☆24Aug 9, 2022Updated 3 years ago
- Code from the paper "Effective Diversity in Population Based Reinforcement Learning", presented as a spotlight at NeurIPS 2020. This is t…☆46Oct 29, 2020Updated 5 years ago
- Implementation of Tactical Optimistic and Pessimistic value estimation☆25Jul 18, 2023Updated 2 years ago
- Autoregressive policies for continuous control reinforcement learning☆32May 15, 2019Updated 6 years ago
- Hierarchical Self-Play☆21Dec 5, 2018Updated 7 years ago
- Code for the paper Alpha Zero in Continuous Action Space (A0C) (https://arxiv.org/pdf/1805.09613.pdf)☆15Jan 19, 2021Updated 5 years ago
- Modified versions of the SAC algorithm from spinningup for discrete action spaces and image observations.☆99Jun 22, 2020Updated 5 years ago
- GoLU, a novel, self-gated and element-wise activation function that performs well over a diverse set of tasks☆24Oct 4, 2025Updated 5 months ago