automl / mdp-playground
A python package to design and debug RL agents.
☆28Updated 2 weeks ago
Related projects ⓘ
Alternatives and complementary repositories for mdp-playground
- ☆28Updated 3 years ago
- ☆29Updated 3 years ago
- This code implements Prioritized Level Replay, a method for sampling training levels for reinforcement learning agents that exploits the …☆83Updated 3 years ago
- ☆36Updated last year
- Learning Action-Value Gradients in Model-based Policy Optimization☆31Updated 3 years ago
- Code and data for the paper "Bridging RL Theory and Practice with the Effective Horizon"☆42Updated 4 months ago
- ☆28Updated last year
- ☆42Updated last year
- Revisiting Rainbow☆73Updated 3 years ago
- ☆34Updated last year
- HPO and Architecture Benchmarking for RL: Dynamically, Reactive and Efficient☆17Updated this week
- Estimating Q(s,s') with Deep Deterministic Dynamics Gradients☆32Updated 4 years ago
- Vectorization techniques for fast population-based training.☆54Updated 2 years ago
- Implementation of Truncated Quantile Critics method for continuous reinforcement learning.☆19Updated last year
- Change-Based Exploration Transfer☆36Updated 2 years ago
- Simplifying Model-based RL: Learning Representations, Latent-space Models and Policies with One Objective☆79Updated last year
- ☆44Updated 2 years ago
- on-policy optimization baselines for deep reinforcement learning☆28Updated 4 years ago
- On the model-based stochastic value gradient for continuous reinforcement learning☆55Updated last year
- ☆30Updated last year
- [AutoML'22] Bayesian Generational Population-based Training (BG-PBT)☆26Updated 2 years ago
- Code for "Unsupervised Zero-Shot RL via Functional Reward Representations"☆52Updated 7 months ago
- LAMBDA is a model-based reinforcement learning agent that uses Bayesian world models for safe policy optimization☆32Updated last year
- Plug-and-play hydra sweepers for the EA-based multifidelity method DEHB and several population-based training variations, all proven to e…☆70Updated 11 months ago
- Model-based reinforcement learning in TensorFlow☆54Updated 3 years ago
- Learning Laplacian Representations in Reinforcement Learning☆17Updated 3 years ago
- Implementation of Tactical Optimistic and Pessimistic value estimation☆25Updated last year
- Implementation of Diversity Is All You Need (DIAYN) on top of Stable Baselines 3.☆12Updated 2 years ago
- Sandbox environment for generalizable agent research☆23Updated 2 years ago
- Code for reproducing experiments in Model-Based Active Exploration, ICML 2019☆78Updated 5 years ago