imgeorgiev/PWM

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/imgeorgiev/PWM)

imgeorgiev / PWM

PWM: Policy Learning with Large World Models

☆64

Alternatives and similar repositories for PWM

Users that are interested in PWM are comparing it to the libraries listed below

Sorting:

facebookresearch / modemv2
View on GitHub
MoDem-V2 combines the sample efficiency of the original MoDem with conservative exploration in order to quickly and safely learn manipula…
☆22Apr 1, 2024Updated last year
heatz123 / tldr
View on GitHub
Official code for TLDR: Unsupervised Goal-Conditioned RL via Temporal Distance-Aware Representations
☆36Jan 24, 2026Updated last month
nicklashansen / tdmpc2
View on GitHub
Code for "TD-MPC2: Scalable, Robust World Models for Continuous Control"
☆759May 21, 2025Updated 9 months ago
rail-berkeley / grif_release
View on GitHub
Code release for the paper "Goal Representations for Instruction Following: A Semi-Supervised Language Interface to Control"
☆17Apr 9, 2024Updated last year
bit1029public / HRSSM
View on GitHub
Pytorch Implementation of Learning Latent Dynamic Robust Representations for World Models
☆24May 11, 2024Updated last year
lmur98 / epic_kitchens_affordances
View on GitHub
☆11Jul 19, 2023Updated 2 years ago
julian-8897 / hyperbolic-latent-vae
View on GitHub
Variational Autoencoder with non-euclidean (hyperbolic) latent space
☆12Nov 25, 2022Updated 3 years ago
younggyoseo / MWM
View on GitHub
Masked World Models for Visual Control
☆135Jun 11, 2023Updated 2 years ago
XuGW-Kevin / DrM
View on GitHub
DrM, a visual RL algorithm, minimizes the dormant ratio to guide exploration-exploitation trade-offs, achieving significant improvements …
☆78Feb 19, 2026Updated last week
zoharri / mamba
View on GitHub
Meta-RL Model-Based Algorithm
☆43Apr 30, 2025Updated 10 months ago
mazpie / genrl
View on GitHub
[NeurIPS 2024] GenRL: Multimodal-foundation world models enable grounding language and video prompts into embodied domains, by turning th…
☆86Apr 4, 2025Updated 10 months ago
CMU-AIRe / floq
View on GitHub
Code Release for floq: Training Critics via Flow-Matching for Scaling Compute In Value-Based RL
☆31Feb 7, 2026Updated 3 weeks ago
lmzintgraf / hyperx
View on GitHub
☆17Aug 2, 2022Updated 3 years ago
yunhaif / fowm
View on GitHub
Finetuning Offline World Models in the Real World
☆65Oct 25, 2023Updated 2 years ago
adrialopezescoriza / demo3
View on GitHub
Official implementation of DEMO3
☆65Jul 29, 2025Updated 7 months ago
thuml / ContextWM
View on GitHub
Code release for "Pre-training Contextualized World Models with In-the-wild Videos for Reinforcement Learning" (NeurIPS 2023), https://ar…
☆70Sep 29, 2024Updated last year
vla-safe / SAFE
View on GitHub
About This is the official repository for "SAFE: Multitask Failure Detection for Vision-Language-Action Models" (NeurIPS 2025)
☆56Jan 18, 2026Updated last month
yufeiwang63 / ROLL
View on GitHub
Implementation for "ROLL: Visual Self-Supervised Reinforcement Learning with Object Reasoning", CoRL 2020
☆16Jun 22, 2022Updated 3 years ago
machado-research / AgarCL
View on GitHub
Agar.io for Continual Reinforcement Learning
☆23Jul 24, 2025Updated 7 months ago
nicklashansen / puppeteer
View on GitHub
Code for "Hierarchical World Models as Visual Whole-Body Humanoid Controllers"
☆200Sep 18, 2025Updated 5 months ago
srsohn / msgi
View on GitHub
ICLR 2020 Meta Reinforcement Learning with Autonomous Inference of Subtask Dependencies
☆18Jul 16, 2020Updated 5 years ago
penn-pal-lab / peg
View on GitHub
Code for "Planning Goals for Exploration", ICLR2023 Spotlight. An unsupervised RL agent for hard exploration tasks.
☆82May 13, 2024Updated last year
clvrai / skill-chaining
View on GitHub
Adversarial Skill Chaining for Long-Horizon Robot Manipulation via Terminal State Regularization (CoRL 2021)
☆36May 3, 2022Updated 3 years ago
irom-princeton / byovla
View on GitHub
Repo for Bring Your Own Vision-Language-Action (VLA) model, arxiv 2024
☆36Jan 22, 2025Updated last year
twni2016 / self-predictive-rl
View on GitHub
Bridging State and History Representations: Understanding Self-Predictive RL, ICLR 2024
☆24Apr 7, 2024Updated last year
facebookresearch / hgap
View on GitHub
Code release for H-GAP Humanoid Control with a Generalist Planner
☆24Nov 25, 2024Updated last year
philipjball / ReadyPolicyOne
View on GitHub
🔍 Codebase for the ICML '20 paper "Ready Policy One: World Building Through Active Learning" (arxiv: 2002.02693)
☆18Jul 6, 2023Updated 2 years ago
Ricky-Zhu / IRDEC
View on GitHub
[IROS2023]Learning to Solve Tasks with Exploring Prior Behaviours
☆12Mar 3, 2024Updated last year
ankile / robust-rearrangement
View on GitHub
From Imitation to Refinement -- Residual RL for Precise Assembly
☆213Dec 2, 2025Updated 3 months ago
EmptyJackson / policy-guided-diffusion
View on GitHub
Official implementation of the RLC 2024 paper "Policy-Guided Diffusion"
☆152Jul 19, 2024Updated last year
ldcq / ldcq
View on GitHub
☆36May 24, 2023Updated 2 years ago
intuitive-robots / beso
View on GitHub
[RSS 2023] Official code for "Goal Conditioned Imitation Learning using Score-based Diffusion Policies"
☆89Dec 1, 2023Updated 2 years ago
juliusfrost / dreamer-pytorch
View on GitHub
Dream to Control: Learning Behaviors by Latent Imagination, implemented in PyTorch.
☆321Jan 11, 2024Updated 2 years ago
ComputationalRobotics / TRAC
View on GitHub
This repository is the official implementation of the TRAC optimizer in Fast TRAC: A Parameter-Free Optimizer for Lifelong Reinforcement …
☆32May 2, 2025Updated 10 months ago
martius-lab / dominic
View on GitHub
Code for our ICRA 2024 paper on learning diverse skills
☆26Apr 6, 2024Updated last year
marc-rigter / polygrad-world-models
View on GitHub
Official code for "World Models via Policy-Guided Trajectory Diffusion", TMLR 2024
☆74Mar 22, 2024Updated last year
inboxedshoe / RP-DQN
View on GitHub
☆11Jan 11, 2022Updated 4 years ago
holken / polite
View on GitHub
code for polite
☆11Feb 28, 2024Updated 2 years ago
gkswamy98 / causal_il
View on GitHub
Contains implementation of the DoubIL and ResiduIL algorithms from the ICML '22 paper Causal Imitation Learning under Temporally Correlat…
☆11Dec 9, 2022Updated 3 years ago