[ICLR 2023] Choreographer: a world-model-based agent that discovers and learns unsupervised skills in latent imagination, and it's able to efficiently coordinate and adapt the skills to solve downstream tasks.
☆42Jun 18, 2024Updated last year
Alternatives and similar repositories for choreographer
Users that are interested in choreographer are comparing it to the libraries listed below
Sorting:
- CIC: Contrastive Intrinsic Control for Unsupervised Skill Discovery☆83Jul 27, 2022Updated 3 years ago
- [ICML 2023] Pre-train world model-based agents with different unsupervised strategies, fine-tune the agent's components selectively, and …☆41Feb 27, 2024Updated 2 years ago
- An implementation of DreamerV2 written in JAX, with support for running multiple random seeds of an experiment on a single GPU.☆18Jan 16, 2023Updated 3 years ago
- PyTorch implementation for all methods and environments in the paper "MIMEx: Intrinsic Rewards from Masked Input Modeling"☆16May 17, 2023Updated 2 years ago
- ☆13Apr 25, 2024Updated last year
- [NeurIPS 2024] GenRL: Multimodal-foundation world models enable grounding language and video prompts into embodied domains, by turning th…☆86Apr 4, 2025Updated 11 months ago
- PyTorch implementation for "Discovery of Incremental Skills" (DISk) algorithm from ICLR 2022 paper "One After Another: Learning Increment…☆20Mar 22, 2022Updated 3 years ago
- Minimal Decision Transformer Implementation written in Jax (Flax).☆17Aug 8, 2022Updated 3 years ago
- Predictable MDP Abstraction for Unsupervised Model-Based RL (ICML 2023)☆32Feb 6, 2023Updated 3 years ago
- ☆17Sep 28, 2023Updated 2 years ago
- METRA: Scalable Unsupervised RL with Metric-Aware Abstraction (ICLR 2024)☆86Oct 15, 2023Updated 2 years ago
- Official implementation of A Mixture of Surprises for Unsupervised Reinforcement Learning☆23Nov 16, 2022Updated 3 years ago
- ☆35Jan 4, 2023Updated 3 years ago
- Evaluating long-term memory of reinforcement learning algorithms☆164Jun 23, 2023Updated 2 years ago
- ☆24Jan 26, 2024Updated 2 years ago
- Simple single file implementations of Reinforcement Learning algorithms in Julia☆23Feb 15, 2025Updated last year
- Official release of the DMControl Generalization Benchmark 2 (DMC-GB2)☆22Jul 21, 2025Updated 7 months ago
- (NeurIPS '22) LISA: Learning Interpretable Skill Abstractions - A framework for unsupervised skill learning using Imitation☆29Feb 22, 2023Updated 3 years ago
- ☆10Jun 27, 2024Updated last year
- A scalable benchmark for state representation learning in visual reinforcement learning.☆16Jun 23, 2025Updated 8 months ago
- [ICLR 2024] Closing the Gap between TD Learning and Supervised Learning - A Generalisation Point of View.☆23Apr 19, 2024Updated last year
- ☆26Apr 26, 2024Updated last year
- ☆360Oct 12, 2022Updated 3 years ago
- Benchmarking RL for POMDPs in Pure JAX [Code for "Structured State Space Models for In-Context Reinforcement Learning" (NeurIPS 2023)]☆112Dec 5, 2023Updated 2 years ago
- Code for "Planning Goals for Exploration", ICLR2023 Spotlight. An unsupervised RL agent for hard exploration tasks.☆82May 13, 2024Updated last year
- Implementation of CASCADE in Learning General World Models in a Handful of Reward-Free Deployments (NeurIPS 22).☆29Oct 25, 2022Updated 3 years ago
- Official implementation for "Anti-Exploration by Random Network Distillation", ICML 2023☆56Feb 3, 2023Updated 3 years ago
- Deep Hierarchical Planning from Pixels☆115Dec 21, 2022Updated 3 years ago
- Discovering and Achieving Goals via World Models, NeurIPS 2021☆87Jan 24, 2024Updated 2 years ago
- ☆15Jun 1, 2023Updated 2 years ago
- Docker containers of baseline agents for the Crafter environment☆30Dec 14, 2021Updated 4 years ago
- Code for "Unsupervised Zero-Shot RL via Functional Reward Representations"☆57Mar 26, 2024Updated last year
- ExORL: Exploratory Data for Offline Reinforcement Learning☆129Feb 8, 2022Updated 4 years ago
- Codebase for the paper "How Crucial is Transformer in Decision Transformer?". Containing experiments on different pendulum tasks and code…☆28Mar 24, 2023Updated 2 years ago
- Author's implementation of ReBRAC, a minimalist improvement upon TD3+BC☆62Aug 3, 2023Updated 2 years ago
- Advantage weighted Actor Critic for Offline RL☆52Aug 27, 2022Updated 3 years ago
- ☆93Jan 21, 2026Updated last month
- Code for NeurIPS 2022 paper Exploiting Reward Shifting in Value-Based Deep RL☆29Oct 29, 2023Updated 2 years ago
- ☆88Jan 9, 2026Updated last month