mazpie/mastering-urlb

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/mazpie/mastering-urlb)

mazpie / mastering-urlb

[ICML 2023] Pre-train world model-based agents with different unsupervised strategies, fine-tune the agent's components selectively, and use planning (Dyna-MPC) during fine-tuning.

☆41

Alternatives and similar repositories for mastering-urlb

Users that are interested in mastering-urlb are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

mazpie / choreographer
View on GitHub
[ICLR 2023] Choreographer: a world-model-based agent that discovers and learns unsupervised skills in latent imagination, and it's able t…
☆42Jun 18, 2024Updated 2 years ago
Rooshy-yang / BeCL
View on GitHub
BeCL: Behavior Contrastive Learning for Unsupervised Skill Discovery.
☆23May 11, 2023Updated 3 years ago
ToruOwO / mimex
View on GitHub
MIMEx: Intrinsic Rewards from Masked Input Modeling [NeurIPS 2023]
☆16May 17, 2023Updated 3 years ago
uoe-agents / CMID
View on GitHub
☆13Apr 25, 2024Updated 2 years ago
rll-research / url_benchmark
View on GitHub
☆367Oct 12, 2022Updated 3 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
danijar / crafter-baselines
View on GitHub
Docker containers of baseline agents for the Crafter environment
☆30Dec 14, 2021Updated 4 years ago
facebookresearch / modem
View on GitHub
MoDem Accelerating Visual Model-Based Reinforcement Learning with Demonstrations
☆87Dec 12, 2022Updated 3 years ago
mserranunes / action-inference-for-video-prediction-benchmarking
View on GitHub
Evaluating video predictions from the standpoint of a robot making action decisions
☆13May 28, 2020Updated 6 years ago
etaoxing / kitchen-shift
View on GitHub
KitchenShift: Evaluating Zero-Shot Generalization of Imitation-Based Policy Learning Under Domain Shifts
☆20Jun 21, 2022Updated 4 years ago
penn-pal-lab / peg
View on GitHub
Code for "Planning Goals for Exploration", ICLR2023 Spotlight. An unsupervised RL agent for hard exploration tasks.
☆83May 13, 2024Updated 2 years ago
kevinzakka / dm_env_wrappers
View on GitHub
Standalone library of frequently-used wrappers for dm_env environments.
☆19Jul 9, 2024Updated 2 years ago
annahdo / counterfactuals
View on GitHub
☆14Dec 4, 2023Updated 2 years ago
51616 / marl-lipo
View on GitHub
Official codebase for Generating Diverse Cooperative Agents by Learning Incompatible Policies (notable-top-25% @ ICLR 2023)
☆19May 10, 2024Updated 2 years ago
philtabor / Advanced-Replay-Strategies
View on GitHub
☆13Feb 24, 2023Updated 3 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
gemcollector / PIE-G
View on GitHub
This is the repo of NeurIPS 2022 paper: "Pre-Trained Image Encoder for Generalizable Visual Reinforcement Learning"
☆16Sep 21, 2023Updated 2 years ago
facebookresearch / modemv2
View on GitHub
MoDem-V2 combines the sample efficiency of the original MoDem with conservative exploration in order to quickly and safely learn manipula…
☆25Apr 1, 2024Updated 2 years ago
mazpie / genrl
View on GitHub
[NeurIPS 2024] GenRL: Multimodal-foundation world models enable grounding language and video prompts into embodied domains, by turning th…
☆87Apr 4, 2025Updated last year
yusukeurakami / dreamer-pytorch
View on GitHub
pytorch-implementation of Dreamer (Model-based Image RL Algorithm)
☆169Jan 19, 2025Updated last year
thu-ml / CEURL
View on GitHub
Official implementation for "PEAC: Unsupervised Pre-training for Cross-Embodiment Reinforcement Learning" (NeurIPS 2024)
☆19Oct 13, 2024Updated last year
eugeneteoh / greenaug
View on GitHub
GreenAug: Green Screen Augmentation Enables Scene Generalisation in Robotic Manipulation
☆13Sep 10, 2024Updated last year
nicklashansen / dmcontrol-generalization-benchmark
View on GitHub
DMControl Generalization Benchmark
☆189Jan 3, 2024Updated 2 years ago
uoe-agents / TED
View on GitHub
Official repository for "Temporal Disentanglement of Representations for Improved Generalisation in Reinforcement Learning".
☆13Jan 25, 2023Updated 3 years ago
s-tian / vp2
View on GitHub
VP2 Benchmark (A Control-Centric Benchmark for Video Prediction, ICLR 2023)
☆32Mar 3, 2025Updated last year
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
tsumers / rewards
View on GitHub
Code and data for Learning Rewards from Linguistic Feedback, AAAI '21
☆11Dec 16, 2020Updated 5 years ago
lifelong-learning-systems / meta-arcade
View on GitHub
MetaArcade is a configurable environment suite for meta-learning
☆16Oct 19, 2022Updated 3 years ago
FLAIROx / cultural-accumulation
View on GitHub
☆16Jul 16, 2024Updated 2 years ago
zhaoyi11 / tcrl
View on GitHub
☆26Jan 26, 2024Updated 2 years ago
younggyoseo / MV-MWM
View on GitHub
☆61Apr 16, 2023Updated 3 years ago
jrobine / twm
View on GitHub
Transformer-based World Models
☆90Apr 4, 2023Updated 3 years ago
nicklashansen / tdmpc2
View on GitHub
Code for "TD-MPC2: Scalable, Robust World Models for Continuous Control"
☆898Jul 13, 2026Updated last week
rll-research / cic
View on GitHub
CIC: Contrastive Intrinsic Control for Unsupervised Skill Discovery
☆88Jul 27, 2022Updated 3 years ago
RyanNavillus / PPO-v3
View on GitHub
Adding Dreamer-v3's implementation tricks to CleanRL's PPO
☆16May 19, 2023Updated 3 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
carlosferrazza / M3L
View on GitHub
The Power of the Senses: Generalizable Manipulation from Vision and Touch through Masked Multimodal Learning
☆43Aug 13, 2024Updated last year
seohongpark / ogbench
View on GitHub
A benchmark for offline goal-conditioned RL and offline RL
☆441Jan 14, 2026Updated 6 months ago
sdpkjc / abcdrl
View on GitHub
Modular Single-file Reinfocement Learning Algorithms Library
☆38May 16, 2023Updated 3 years ago
omarrayyann / TeleDex
View on GitHub
phone teleoperation for robots
☆118Jul 9, 2026Updated 2 weeks ago
maxencefaldor / learned-qd
View on GitHub
Discovering Quality-Diversity Algorithms via Meta-Black-Box Optimization
☆25Dec 1, 2025Updated 7 months ago
conglu1997 / v-d4rl
View on GitHub
Challenges and Opportunities in Offline Reinforcement Learning from Visual Observations
☆115Apr 16, 2026Updated 3 months ago
gemcollector / learning-from-scratch
View on GitHub
The repository of ICML2023 paper: On Pre-Training for Visuo-Motor Control: Revisiting a Learning-from-Scratch Baseline
☆23May 28, 2023Updated 3 years ago