[ICML 2023] Pre-train world model-based agents with different unsupervised strategies, fine-tune the agent's components selectively, and use planning (Dyna-MPC) during fine-tuning.
☆41Feb 27, 2024Updated 2 years ago
Alternatives and similar repositories for mastering-urlb
Users that are interested in mastering-urlb are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [ICLR 2023] Choreographer: a world-model-based agent that discovers and learns unsupervised skills in latent imagination, and it's able t…☆43Jun 18, 2024Updated last year
- On the Feasibility of Cross-Task Transfer with Model-Based Reinforcement Learning☆16Apr 30, 2023Updated 3 years ago
- BeCL: Behavior Contrastive Learning for Unsupervised Skill Discovery.☆23May 11, 2023Updated 3 years ago
- MIMEx: Intrinsic Rewards from Masked Input Modeling [NeurIPS 2023]☆16May 17, 2023Updated 3 years ago
- ☆13Apr 25, 2024Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆364Oct 12, 2022Updated 3 years ago
- Docker containers of baseline agents for the Crafter environment☆30Dec 14, 2021Updated 4 years ago
- Official implementation for "PEAC: Unsupervised Pre-training for Cross-Embodiment Reinforcement Learning" (NeurIPS 2024)☆19Oct 13, 2024Updated last year
- MoDem Accelerating Visual Model-Based Reinforcement Learning with Demonstrations☆87Dec 12, 2022Updated 3 years ago
- This is the repo of NeurIPS 2022 paper: "Pre-Trained Image Encoder for Generalizable Visual Reinforcement Learning"☆16Sep 21, 2023Updated 2 years ago
- Code for "Planning Goals for Exploration", ICLR2023 Spotlight. An unsupervised RL agent for hard exploration tasks.☆83May 13, 2024Updated 2 years ago
- Evaluating video predictions from the standpoint of a robot making action decisions☆13May 28, 2020Updated 5 years ago
- KitchenShift: Evaluating Zero-Shot Generalization of Imitation-Based Policy Learning Under Domain Shifts☆19Jun 21, 2022Updated 3 years ago
- phone teleoperation for robots☆112Apr 13, 2026Updated last month
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Standalone library of frequently-used wrappers for dm_env environments.☆19Jul 9, 2024Updated last year
- ☆14Dec 4, 2023Updated 2 years ago
- Code release for H-GAP Humanoid Control with a Generalist Planner☆24Nov 25, 2024Updated last year
- Official codebase for Generating Diverse Cooperative Agents by Learning Incompatible Policies (notable-top-25% @ ICLR 2023)☆19May 10, 2024Updated 2 years ago
- ☆13Feb 24, 2023Updated 3 years ago
- [NeurIPS 2024] GenRL: Multimodal-foundation world models enable grounding language and video prompts into embodied domains, by turning th…☆85Apr 4, 2025Updated last year
- pytorch-implementation of Dreamer (Model-based Image RL Algorithm)☆168Jan 19, 2025Updated last year
- Code for "TD-MPC2: Scalable, Robust World Models for Continuous Control"☆831May 21, 2025Updated last year
- The Power of the Senses: Generalizable Manipulation from Vision and Touch through Masked Multimodal Learning☆43Aug 13, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Official repository for "Temporal Disentanglement of Representations for Improved Generalisation in Reinforcement Learning".☆13Jan 25, 2023Updated 3 years ago
- GreenAug: Green Screen Augmentation Enables Scene Generalisation in Robotic Manipulation☆13Sep 10, 2024Updated last year
- DMControl Generalization Benchmark☆189Jan 3, 2024Updated 2 years ago
- MoDem-V2 combines the sample efficiency of the original MoDem with conservative exploration in order to quickly and safely learn manipula…☆25Apr 1, 2024Updated 2 years ago
- VP2 Benchmark (A Control-Centric Benchmark for Video Prediction, ICLR 2023)☆31Mar 3, 2025Updated last year
- Code and data for Learning Rewards from Linguistic Feedback, AAAI '21☆11Dec 16, 2020Updated 5 years ago
- Code for "Hierarchical Skills for Efficient Exploration" HSD-3 Algorithm and Baselines☆52Jun 3, 2022Updated 3 years ago
- ☆24Jan 26, 2024Updated 2 years ago
- ☆61Apr 16, 2023Updated 3 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- Transformer-based World Models☆89Apr 4, 2023Updated 3 years ago
- CIC: Contrastive Intrinsic Control for Unsupervised Skill Discovery☆86Jul 27, 2022Updated 3 years ago
- Adding Dreamer-v3's implementation tricks to CleanRL's PPO☆16May 19, 2023Updated 3 years ago
- Modular Single-file Reinfocement Learning Algorithms Library☆38May 16, 2023Updated 3 years ago
- ☆10Oct 15, 2020Updated 5 years ago
- MetaArcade is a configurable environment suite for meta-learning☆16Oct 19, 2022Updated 3 years ago
- Challenges and Opportunities in Offline Reinforcement Learning from Visual Observations☆115Apr 16, 2026Updated last month