icaros-usc / dqd-rlLinks
Official implementation of "Approximating Gradients for Differentiable Quality Diversity in Reinforcement Learning"
β20Updated 2 years ago
Alternatives and similar repositories for dqd-rl
Users that are interested in dqd-rl are comparing it to the libraries listed below
Sorting:
- Code for NeurIPS 2022 paper Exploiting Reward Shifting in Value-Based Deep RLβ29Updated last year
- π Codebase for the ICML '20 paper "Ready Policy One: World Building Through Active Learning" (arxiv: 2002.02693)β18Updated 2 years ago
- Code to accompany the paper "Mismatched No More: Joint Model-Policy Optimization for Model-Based RL"β20Updated 3 years ago
- IV-RL - Sample Efficient Deep Reinforcement Learning via Uncertainty Estimationβ40Updated last month
- Generalised UDRLβ37Updated 3 years ago
- β32Updated last year
- Code to accompany the paper "The Information Geometry of Unsupervised Reinforcement Learning"β20Updated 3 years ago
- Model-Based Reinforcement Learning via Latent-Space Collocation.β33Updated 2 years ago
- β17Updated 4 years ago
- β25Updated last year
- Minimal Decision Transformer Implementation written in Jax (Flax).β17Updated 3 years ago
- Official pytorch implementation for our ICLR 2023 paper "Latent State Marginalization as a Low-cost Approach for Improving Exploration".β24Updated 2 years ago
- A System for Morphology-Task Generalization via Unified Representation and Behavior Distillation (ICLR2023)β13Updated 2 years ago
- Source code for the paper "Policy Architectures for Compositional Generalization in Control"β30Updated 3 years ago
- [ICLR 22] Value Gradient weighted Model-Based Reinforcement Learning.β24Updated 2 years ago
- PyTorch implementation for "Discovery of Incremental Skills" (DISk) algorithm from ICLR 2022 paper "One After Another: Learning Incrementβ¦β19Updated 3 years ago
- β29Updated 4 years ago
- β31Updated 4 years ago
- MELD: Meta-Reinforcement Learning from Images via Latent State Models https://arxiv.org/abs/2010.13957β63Updated 4 years ago
- Implementation of Proximal Policy Optimization in Jax+Flaxβ20Updated 2 years ago
- Mirror Descent Policy Optimizationβ38Updated 4 years ago
- Codebase for the paper "How Crucial is Transformer in Decision Transformer?". Containing experiments on different pendulum tasks and codeβ¦β27Updated 2 years ago
- [AutoML'22] Bayesian Generational Population-based Training (BG-PBT)β28Updated 2 years ago
- Author's PyTorch implementation of SR-DICE for marginalized importance samplingβ17Updated 3 years ago
- Code for the paper "Gamma-Models: Generative Temporal Difference Learning for Infinite-Horizon Prediction"β44Updated last year
- Official repository for paper "Versatile Offline Imitation from Observations and Examples via Regularized State-Occupancy Matching" (ICMLβ¦β26Updated 2 years ago
- β31Updated last year
- My Body Is A Cageβ41Updated 4 years ago
- Open source demo for the paper Learning to Score Behaviors for Guided Policy Optimizationβ24Updated 5 years ago
- Repository for "Quality-Diversity Actor-Critic: Learning High-Performing and Diverse Behaviors via Value and Successor Features Critics" β¦β16Updated last year