rail-berkeley / design-bench
☆44Updated 2 years ago
Related projects: ⓘ
- ☆20Updated 2 years ago
- Baselines for Model-Based Optimization☆49Updated 2 years ago
- ☆27Updated 3 years ago
- ☆34Updated last year
- Official pytorch implementation for our ICLR 2023 paper "Latent State Marginalization as a Low-cost Approach for Improving Exploration".☆24Updated last year
- Official implementation for the paper "Offline Meta RL - Identifiability Challenges and Effective Data Collection Strategies", NeurIPS 20…☆27Updated 2 years ago
- ☆27Updated last year
- 🔍 Codebase for the ICML '20 paper "Ready Policy One: World Building Through Active Learning" (arxiv: 2002.02693)☆19Updated last year
- Code for the paper "Gamma-Models: Generative Temporal Difference Learning for Infinite-Horizon Prediction"☆39Updated last year
- Learning Action-Value Gradients in Model-based Policy Optimization☆31Updated 3 years ago
- ☆27Updated 3 years ago
- implementation of Wasserstein Natural Policy Gradients and Wasserstein Natural Evolution Strategies☆10Updated 3 years ago
- Invariant Causal Prediction for Block MDPs☆43Updated 4 years ago
- Representation Learning in RL☆16Updated 2 years ago
- Generalized Decision Transformer for Offline Hindsight Information Matching (ICLR2022)☆64Updated 2 years ago
- ☆85Updated last month
- This code implements Prioritized Level Replay, a method for sampling training levels for reinforcement learning agents that exploits the …☆83Updated 3 years ago
- Code for NeurIPS 2022 paper Exploiting Reward Shifting in Value-Based Deep RL☆25Updated 10 months ago
- on-policy optimization baselines for deep reinforcement learning☆28Updated 4 years ago
- Revisiting Peng's Q(lambda) for Modern Reinforcement Learning☆16Updated 3 years ago
- Experiment for Understanding the Effects of Dataset Characteristics on Offline Reinforcement Learning☆24Updated last year
- PyTorch implementation of Probabilistic Network Ensembles on toy problems☆23Updated last year
- ☆11Updated 4 years ago
- Estimating Q(s,s') with Deep Deterministic Dynamics Gradients☆30Updated 4 years ago
- Conservative Q learning in Jax☆49Updated last year
- [AutoML'22] Bayesian Generational Population-based Training (BG-PBT)☆26Updated 2 years ago
- Code for the paper Novelty Search in Representational Space for Sample Efficient Exploration presented at NeurIPS 2020.☆14Updated 2 months ago
- IV-RL - Sample Efficient Deep Reinforcement Learning via Uncertainty Estimation☆35Updated 2 weeks ago
- Docker containers of baseline agents for the Crafter environment☆27Updated 2 years ago
- A System for Morphology-Task Generalization via Unified Representation and Behavior Distillation (ICLR2023)☆11Updated last year