[ICML 2023] Pre-train world model-based agents with different unsupervised strategies, fine-tune the agent's components selectively, and use planning (Dyna-MPC) during fine-tuning.
☆41Feb 27, 2024Updated 2 years ago
Alternatives and similar repositories for mastering-urlb
Users that are interested in mastering-urlb are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [ICLR 2023] Choreographer: a world-model-based agent that discovers and learns unsupervised skills in latent imagination, and it's able t…☆42Jun 18, 2024Updated last year
- On the Feasibility of Cross-Task Transfer with Model-Based Reinforcement Learning☆16Apr 30, 2023Updated 2 years ago
- BeCL: Behavior Contrastive Learning for Unsupervised Skill Discovery.☆23May 11, 2023Updated 2 years ago
- MIMEx: Intrinsic Rewards from Masked Input Modeling [NeurIPS 2023]☆16May 17, 2023Updated 2 years ago
- ☆13Apr 25, 2024Updated last year
- ☆361Oct 12, 2022Updated 3 years ago
- Docker containers of baseline agents for the Crafter environment☆30Dec 14, 2021Updated 4 years ago
- Official implementation for "PEAC: Unsupervised Pre-training for Cross-Embodiment Reinforcement Learning" (NeurIPS 2024)☆19Oct 13, 2024Updated last year
- MoDem Accelerating Visual Model-Based Reinforcement Learning with Demonstrations☆87Dec 12, 2022Updated 3 years ago
- This is the repo of NeurIPS 2022 paper: "Pre-Trained Image Encoder for Generalizable Visual Reinforcement Learning"☆15Sep 21, 2023Updated 2 years ago
- Code for "Planning Goals for Exploration", ICLR2023 Spotlight. An unsupervised RL agent for hard exploration tasks.☆83May 13, 2024Updated last year
- Evaluating video predictions from the standpoint of a robot making action decisions☆13May 28, 2020Updated 5 years ago
- KitchenShift: Evaluating Zero-Shot Generalization of Imitation-Based Policy Learning Under Domain Shifts☆19Jun 21, 2022Updated 3 years ago
- phone teleoperation for robots☆105Feb 10, 2026Updated last month
- Standalone library of frequently-used wrappers for dm_env environments.☆19Jul 9, 2024Updated last year
- ☆14Dec 4, 2023Updated 2 years ago
- Code release for H-GAP Humanoid Control with a Generalist Planner☆24Nov 25, 2024Updated last year
- Official codebase for Generating Diverse Cooperative Agents by Learning Incompatible Policies (notable-top-25% @ ICLR 2023)☆19May 10, 2024Updated last year
- ☆13Feb 24, 2023Updated 3 years ago
- [NeurIPS 2024] GenRL: Multimodal-foundation world models enable grounding language and video prompts into embodied domains, by turning th…☆86Apr 4, 2025Updated 11 months ago
- pytorch-implementation of Dreamer (Model-based Image RL Algorithm)☆168Jan 19, 2025Updated last year
- Code and data for Learning Rewards from Linguistic Feedback, AAAI '21☆10Dec 16, 2020Updated 5 years ago
- Code for "TD-MPC2: Scalable, Robust World Models for Continuous Control"☆772May 21, 2025Updated 10 months ago
- MoDem-V2 combines the sample efficiency of the original MoDem with conservative exploration in order to quickly and safely learn manipula…☆23Apr 1, 2024Updated last year
- The Power of the Senses: Generalizable Manipulation from Vision and Touch through Masked Multimodal Learning☆40Aug 13, 2024Updated last year
- Official repository for "Temporal Disentanglement of Representations for Improved Generalisation in Reinforcement Learning".☆13Jan 25, 2023Updated 3 years ago
- GreenAug: Green Screen Augmentation Enables Scene Generalisation in Robotic Manipulation☆13Sep 10, 2024Updated last year
- DMControl Generalization Benchmark☆189Jan 3, 2024Updated 2 years ago
- VP2 Benchmark (A Control-Centric Benchmark for Video Prediction, ICLR 2023)☆30Mar 3, 2025Updated last year
- Code for "Hierarchical Skills for Efficient Exploration" HSD-3 Algorithm and Baselines☆51Jun 3, 2022Updated 3 years ago
- ☆24Jan 26, 2024Updated 2 years ago
- ☆60Apr 16, 2023Updated 2 years ago
- Transformer-based World Models☆89Apr 4, 2023Updated 2 years ago
- CIC: Contrastive Intrinsic Control for Unsupervised Skill Discovery☆85Jul 27, 2022Updated 3 years ago
- Adding Dreamer-v3's implementation tricks to CleanRL's PPO☆14May 19, 2023Updated 2 years ago
- Modular Single-file Reinfocement Learning Algorithms Library☆38May 16, 2023Updated 2 years ago
- ☆10Oct 15, 2020Updated 5 years ago
- MetaArcade is a configurable environment suite for meta-learning☆16Oct 19, 2022Updated 3 years ago
- Challenges and Opportunities in Offline Reinforcement Learning from Visual Observations☆112May 27, 2024Updated last year