jparkerholder / DvD_ESView external linksLinks
Code from the paper "Effective Diversity in Population Based Reinforcement Learning", presented as a spotlight at NeurIPS 2020. This is the Evolution Strategies implementation, but of course the method can be used for gradient based RL algorithms (e.g. TD3).
☆45Oct 29, 2020Updated 5 years ago
Alternatives and similar repositories for DvD_ES
Users that are interested in DvD_ES are comparing it to the libraries listed below
Sorting:
- Open source demo for the paper Learning to Score Behaviors for Guided Policy Optimization☆24Jun 24, 2020Updated 5 years ago
- Implementation of Population-Guided Parallel Policy Search for Reinforcement Learning☆22Jan 9, 2020Updated 6 years ago
- ☆17Dec 12, 2020Updated 5 years ago
- A python implementation of differentiable quality diversity.☆51Oct 29, 2021Updated 4 years ago
- Code to run the ASEBO algorithm from the paper: From Complexity to Simplicity: Adaptive ES-Active Subspaces for Blackbox Optimization... …☆16Oct 14, 2020Updated 5 years ago
- Paper: Challenges in High-dimensional Reinforcement Learning with Evolution Strategies☆29May 30, 2022Updated 3 years ago
- Adaptable Agent Populations via a Generative Model of Policies☆12Oct 14, 2021Updated 4 years ago
- **Sferes2 module** A unifying modular framework for Quality-Diversity algorithms☆22Nov 6, 2020Updated 5 years ago
- Vectorization techniques for fast population-based training.☆57Aug 12, 2022Updated 3 years ago
- Repository for the PGA-MAP-Elites algorithm. PGA-MAP-Elites was developed to efficiently scale MAP-Elites to large genotypes and noisy d…☆58Oct 18, 2021Updated 4 years ago
- PyTorch implementation for "On the Critical Role of Conventions in Adaptive Human-AI Collaboration", ICLR 2021☆15Mar 9, 2021Updated 4 years ago
- Docker containers of baseline agents for the Crafter environment☆30Dec 14, 2021Updated 4 years ago
- Combining Evolutionary Algorithms and deep RL in various ways☆107Nov 17, 2020Updated 5 years ago
- Code for the paper "Learning Step-Size Adaptation in CMA-ES"☆12Mar 24, 2023Updated 2 years ago
- ♊ Minimal PyTorch Twin Delayed DDPG (TD3) implementation☆10Jun 20, 2021Updated 4 years ago
- 🔍 Codebase for the ICML '20 paper "Ready Policy One: World Building Through Active Learning" (arxiv: 2002.02693)☆18Jul 6, 2023Updated 2 years ago
- ☆71Jan 3, 2023Updated 3 years ago
- Official implementation of "Approximating Gradients for Differentiable Quality Diversity in Reinforcement Learning"☆22Oct 3, 2022Updated 3 years ago
- Results reproductions & comparisons between OpenSpiel implementations, associated paper & originating works☆18Mar 2, 2021Updated 4 years ago
- Applying PBT optimization technique to different domains☆10Oct 16, 2019Updated 6 years ago
- ☆14Jun 26, 2019Updated 6 years ago
- Accelerated Quality-Diversity☆339Oct 30, 2025Updated 3 months ago
- Exploring techniques to generate diverse conventions in multi-agent settings☆15Nov 14, 2023Updated 2 years ago
- ☆36Aug 10, 2018Updated 7 years ago
- Repository for the QDgym code. A framework for Quality Diversity optimization benchmark tasks based OpenAI Gym.☆25May 18, 2021Updated 4 years ago
- [ICLR'20] Learning to Learn by Zeroth-Order Oracle☆14Feb 7, 2020Updated 6 years ago
- This is the source code of RPG (Reward-Randomized Policy Gradient)☆42Sep 1, 2022Updated 3 years ago
- TeachMyAgent is a testbed platform for Automatic Curriculum Learning methods in Deep RL.☆76Nov 21, 2023Updated 2 years ago
- Implementation of the Off Belief Learning algorithm.☆49Aug 18, 2022Updated 3 years ago
- Official Implementation for Quality-Similar Diversity via Population Based Reinforcement Learning☆19Dec 26, 2025Updated last month
- [ICML 2020] Prediction-Guided Multi-Objective Reinforcement Learning for Continuous Robot Control☆125Oct 9, 2020Updated 5 years ago
- Submission for MAVEN: Multi-Agent Variational Exploration☆59Apr 6, 2022Updated 3 years ago
- Implementations of robust Dual Curriculum Design (DCD) algorithms for unsupervised environment design.☆138Aug 20, 2024Updated last year
- Open source code combining implementations of Upside Down Reinforcement Learning and Reward Conditioned Policies☆19Mar 10, 2021Updated 4 years ago
- Tutorial on Multi-Objective Recommender Systems @ KDD 2021☆19Dec 4, 2022Updated 3 years ago
- Website for Quality-Diversity optimisation algorithms☆49Dec 17, 2025Updated last month
- Learning Action-Value Gradients in Model-based Policy Optimization☆32Sep 7, 2021Updated 4 years ago
- original source code of the ASE 2019 paper: Wuji: Automatic Online Combat Game Testing Using Evolutionary Deep Reinforcement Learning☆28Jun 8, 2020Updated 5 years ago
- Using RLLib and PycoLab to explore intelligent cooperative behavior in sequential social dilemmas☆54Dec 8, 2022Updated 3 years ago