mengdi-li / internally-rewarded-rl
[ICML 2023] Code for paper "Internally Rewarded Reinforcement Learning"
☆10Updated last year
Alternatives and similar repositories for internally-rewarded-rl:
Users that are interested in internally-rewarded-rl are comparing it to the libraries listed below
- Official repository for Paper "Offline Goal-Conditioned Reinforcement Learning via f-Advantage Regression" (NeurIPS 2022)☆35Updated last year
- Challenges and Opportunities in Offline Reinforcement Learning from Visual Observations☆100Updated 11 months ago
- ☆25Updated last year
- Official release of CompoSuite, a compositional RL benchmark☆47Updated last year
- EARL: Environment for Autonomous Reinforcement Learning☆37Updated 2 years ago
- HIQL: Offline Goal-Conditioned RL with Latent States as Actions (NeurIPS 2023)☆80Updated 4 months ago
- Listwise Reward Estimation for Offline Preference-based Reinforcement Learning (ICML 2024)☆14Updated 10 months ago
- ☆39Updated last year
- A version of the DeepMind Control Suite with randomly generated graphics, for measuring visual generalization in continuous control.☆18Updated 4 years ago
- Skeleton for scalable and flexible Jax RL implementations☆80Updated last year
- ☆48Updated last year
- Lipschitz-constrained Unsupervised Skill Discovery (ICLR 2022)☆37Updated last year
- ☆43Updated 5 months ago
- ☆25Updated 2 years ago
- [ICLR 2023] Choreographer: a model-based agent that discovers and learns unsupervised skills in latent imagination, and it's able to effi…☆40Updated 10 months ago
- Official Codebase for TMLR 2023, Benchmarks and Algorithms for Offline Preference-Based Reward Learning☆20Updated 2 years ago
- Official code for "RAMBO: Robust Adversarial Model-Based Offline RL", NeurIPS 2022☆27Updated last year
- Official repository for paper "Conservative Offline Distributional Reinforcement Learning" (NeurIPS 2021)☆21Updated 3 years ago
- Official codebase for "The Generalization Gap in Offline Reinforcement Learning" accepted to ICLR 2024☆28Updated 8 months ago
- Learning from Guided Play: A Scheduled Hierarchical Approach for Improving Exploration in Adversarial Imitation Learning Source Code☆16Updated 8 months ago
- The official implementation of "Mind the Gap: Offline Policy Optimization for Imperfect Rewards" (ICLR2023)☆16Updated 2 years ago
- ☆53Updated 3 years ago
- ☆15Updated last year
- Official implementation of NeurIPS'23 paper, Beyond Uniform Sampling: Offline Reinforcement Learning with Imbalanced Datasets☆24Updated last year
- DrM, a visual RL algorithm, minimizes the dormant ratio to guide exploration-exploitation trade-offs, achieving significant improvements …☆73Updated 11 months ago
- Official code repository for Prompt-DT.☆109Updated 2 years ago
- Source files to replicate experiments in my ICLR 2022 paper.☆70Updated 9 months ago
- From Play to Policy: Conditional Behavior Generation from Uncurated Robot Data☆53Updated 2 years ago
- ☆119Updated 4 years ago
- BeCL: Behavior Contrastive Learning for Unsupervised Skill Discovery.☆19Updated last year