rail-berkeley / SUPELinks
This code accompanies the paper "Leveraging Skills from Unlabeled Prior Data for Efficient Online Exploration."
☆28Updated 7 months ago
Alternatives and similar repositories for SUPE
Users that are interested in SUPE are comparing it to the libraries listed below
Sorting:
- Official code for TLDR: Unsupervised Goal-Conditioned RL via Temporal Distance-Aware Representations☆32Updated 7 months ago
- ☆32Updated last week
- Official code for "Reward-Free Curricula for Training Robust World Models", ICLR 2024.☆28Updated last year
- Code for "Unsupervised Zero-Shot RL via Functional Reward Representations"☆57Updated last year
- Official code for "World Models via Policy-Guided Trajectory Diffusion", TMLR 2024☆62Updated last year
- ☆43Updated 5 months ago
- ☆44Updated 10 months ago
- Diffusing States and Matching Scores: A New Framework for Imitation Learning☆19Updated 6 months ago
- ☆24Updated 11 months ago
- VC-FB and MC-FB algorithms from "Zero-Shot Reinforcement Learning from Low Quality Data" (NeurIPS 2024)☆16Updated 4 months ago
- Official implementation of DEMO3☆47Updated 2 weeks ago
- High quality implementations of imitation and inverse reinforcement learning algorithms☆18Updated 2 months ago
- Repository for "Quality-Diversity Actor-Critic: Learning High-Performing and Diverse Behaviors via Value and Successor Features Critics" …☆16Updated 11 months ago
- ☆22Updated last year
- PWM: Policy Learning with Large World Models☆49Updated 3 months ago
- [NeurIPS 2024] GenRL: Multimodal-foundation world models enable grounding language and video prompts into embodied domains, by turning th…☆77Updated 2 months ago
- Recall to Imagine, a model-based RL algorithm with superhuman memory. Oral (1.2%) @ ICLR 2024☆67Updated last year
- AgentHive provides the primitives and helpers for a seamless usage of robohive within TorchRL.☆33Updated last year
- Minimal Decision Transformer Implementation written in Jax (Flax).☆17Updated 2 years ago
- Drop-in environment replacements that make your RL algorithm train faster.☆20Updated 11 months ago
- Code for the paper "Policy Adaptation via Language Optimization: Decomposing Tasks for Few-Shot Imitation"☆29Updated 6 months ago
- Action Value Gradient Algorithm☆20Updated 2 weeks ago
- Official Implementation of `An Optimisation Framework for Unsupervised Environment Design` from RLC 2025☆11Updated last week
- ☆22Updated 2 weeks ago
- ☆41Updated 10 months ago
- Public code for "Reinforcement Learning from Passive Data via Latent Intentions"☆89Updated last year
- ☆56Updated 11 months ago
- PyTorch implementation for "Discovery of Incremental Skills" (DISk) algorithm from ICLR 2022 paper "One After Another: Learning Increment…☆19Updated 3 years ago
- ☆15Updated 2 years ago
- Official Implementation of NeurIPS'23 Paper "Cross-Episodic Curriculum for Transformer Agents"☆31Updated last year