Code from the paper "Effective Diversity in Population Based Reinforcement Learning", presented as a spotlight at NeurIPS 2020. This is the Evolution Strategies implementation, but of course the method can be used for gradient based RL algorithms (e.g. TD3).
☆46Oct 29, 2020Updated 5 years ago
Alternatives and similar repositories for DvD_ES
Users that are interested in DvD_ES are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Open source demo for the paper Learning to Score Behaviors for Guided Policy Optimization☆24Jun 24, 2020Updated 5 years ago
- ☆17Dec 12, 2020Updated 5 years ago
- Implementation of Population-Guided Parallel Policy Search for Reinforcement Learning☆22Jan 9, 2020Updated 6 years ago
- Code to run the ASEBO algorithm from the paper: From Complexity to Simplicity: Adaptive ES-Active Subspaces for Blackbox Optimization... …☆16Oct 14, 2020Updated 5 years ago
- Paper: Challenges in High-dimensional Reinforcement Learning with Evolution Strategies☆29May 30, 2022Updated 3 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- A python implementation of differentiable quality diversity.☆51Oct 29, 2021Updated 4 years ago
- Adaptable Agent Populations via a Generative Model of Policies☆12Oct 14, 2021Updated 4 years ago
- 🔍 Codebase for the ICML '20 paper "Ready Policy One: World Building Through Active Learning" (arxiv: 2002.02693)☆18Jul 6, 2023Updated 2 years ago
- Docker containers of baseline agents for the Crafter environment☆30Dec 14, 2021Updated 4 years ago
- Vectorization techniques for fast population-based training.☆57Apr 26, 2026Updated last month
- **Sferes2 module** A unifying modular framework for Quality-Diversity algorithms☆22Nov 6, 2020Updated 5 years ago
- ♊ Minimal PyTorch Twin Delayed DDPG (TD3) implementation☆10Jun 20, 2021Updated 4 years ago
- Repository for the PGA-MAP-Elites algorithm. PGA-MAP-Elites was developed to efficiently scale MAP-Elites to large genotypes and noisy d…☆60Oct 18, 2021Updated 4 years ago
- Results reproductions & comparisons between OpenSpiel implementations, associated paper & originating works☆18Mar 2, 2021Updated 5 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Applying PBT optimization technique to different domains☆10Oct 16, 2019Updated 6 years ago
- Code for the paper "Learning Step-Size Adaptation in CMA-ES"☆12Mar 24, 2023Updated 3 years ago
- ☆71Jan 3, 2023Updated 3 years ago
- Combining Evolutionary Algorithms and deep RL in various ways☆107Nov 17, 2020Updated 5 years ago
- This is the source code of RPG (Reward-Randomized Policy Gradient)☆43Sep 1, 2022Updated 3 years ago
- PyTorch implementation for "On the Critical Role of Conventions in Adaptive Human-AI Collaboration", ICLR 2021☆15Mar 9, 2021Updated 5 years ago
- [ICML 2020] Prediction-Guided Multi-Objective Reinforcement Learning for Continuous Robot Control☆132Oct 9, 2020Updated 5 years ago
- ☆14Jun 26, 2019Updated 6 years ago
- ☆36Aug 10, 2018Updated 7 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Official Implementation for Quality-Similar Diversity via Population Based Reinforcement Learning☆19Dec 26, 2025Updated 5 months ago
- original source code of the ASE 2019 paper: Wuji: Automatic Online Combat Game Testing Using Evolutionary Deep Reinforcement Learning☆28Jun 8, 2020Updated 5 years ago
- Exploring techniques to generate diverse conventions in multi-agent settings☆15Nov 14, 2023Updated 2 years ago
- Learning Action-Value Gradients in Model-based Policy Optimization☆32Sep 7, 2021Updated 4 years ago
- Accelerated Quality-Diversity☆353Oct 30, 2025Updated 6 months ago
- Official Code Release for Pipeline PSRO: A Scalable Approach for Finding Approximate Nash Equilibria in Large Games☆56Aug 30, 2024Updated last year
- Contrastive UCB: Provably Efficient Contrastive Self-Supervised Learning in Online Reinforcement Learning☆11Jun 16, 2022Updated 3 years ago
- TeachMyAgent is a testbed platform for Automatic Curriculum Learning methods in Deep RL.☆77Nov 21, 2023Updated 2 years ago
- Official code of Nash-DQN for paper: Nash-DQN algorithm for two-player zero-sum Markov games, details see our paper: A Deep Reinforcement…☆21Aug 26, 2022Updated 3 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- MVE: model-based value estimation☆11Jul 30, 2018Updated 7 years ago
- Code for the Population-Based Bandits Algorithm, presented at NeurIPS 2020.☆20Apr 13, 2021Updated 5 years ago
- Official implementation of "Approximating Gradients for Differentiable Quality Diversity in Reinforcement Learning"☆22Oct 3, 2022Updated 3 years ago
- Tensorflow code for "Learning Self-Imitating Diverse Policies" (ICLR 2019)☆20Nov 26, 2020Updated 5 years ago
- Submission for MAVEN: Multi-Agent Variational Exploration☆60Apr 6, 2022Updated 4 years ago
- Ordered Preference Elicitation Strategies for Multi-Objective Decision Making using Gaussian Processes☆23Jul 25, 2018Updated 7 years ago
- Website for Quality-Diversity optimisation algorithms☆49Dec 17, 2025Updated 5 months ago