AlignmentResearch / vlmrm
☆37Updated 2 months ago
Related projects: ⓘ
- [ICLR 2023] Choreographer: a model-based agent that discovers and learns unsupervised skills in latent imagination, and it's able to effi…☆34Updated 3 months ago
- PWM: Policy Learning with Large World Models☆32Updated last month
- Code for "Planning Goals for Exploration", ICLR2023 Spotlight. An unsupervised RL agent for hard exploration tasks.☆71Updated 4 months ago
- ☆44Updated 7 months ago
- MoDem Accelerating Visual Model-Based Reinforcement Learning with Demonstrations☆82Updated last year
- From Play to Policy: Conditional Behavior Generation from Uncurated Robot Data☆48Updated last year
- ☆28Updated 11 months ago
- Code for "Masked Autoencoding for Scalable and Generalizable Decision Making". NeurIPS 2022☆43Updated 6 months ago
- [GenRL] Multimodal foundation world models allow grounding language and video prompts into embodied domains, by turning them into sequenc…☆41Updated last month
- [ICML 2023] Pre-train world model-based agents with different unsupervised strategies, fine-tune the agent's components selectively, and …☆31Updated 6 months ago
- METRA: Scalable Unsupervised RL with Metric-Aware Abstraction (ICLR 2024)☆49Updated 11 months ago
- ☆69Updated 2 years ago
- Controllability-Aware Unsupervised Skill Discovery (ICML 2023)☆21Updated last year
- Public code for "Reinforcement Learning from Passive Data via Latent Intentions"☆81Updated 10 months ago
- Code for the ICLR 2024 spotlight paper: "Learning to Act without Actions" (introducing Latent Action Policies)☆68Updated last month
- Chain-of-Thought Predictive Control☆54Updated last year
- Official release of CompoSuite, a compositional RL benchmark☆44Updated 7 months ago
- Author's Pytorch implementation of our ICLR 2024 paper "Uni-O4"☆29Updated 4 months ago
- ☆38Updated 10 months ago
- HIQL: Offline Goal-Conditioned RL with Latent States as Actions (NeurIPS 2023)☆71Updated 9 months ago
- ☆51Updated last year
- Predictable MDP Abstraction for Unsupervised Model-Based RL (ICML 2023)☆29Updated last year
- Using advances in generative modeling to learn reward functions from unlabeled videos.☆106Updated 7 months ago
- DrM, a visual RL algorithm, minimizes the dormant ratio to guide exploration-exploitation trade-offs, achieving significant improvements …☆52Updated 3 months ago
- Masked World Models for Visual Control☆114Updated last year
- Source files to replicate experiments in my ICLR 2022 paper.☆59Updated 2 months ago
- Code release for Efficient Planning in a Compact Latent Action Space (ICLR2023) https://arxiv.org/abs/2208.10291.☆90Updated last year
- [NeurIPS 2022] Official implementation of the paper: "Human-AI Shared Control via Policy Dissection"☆48Updated last year
- This repository contains the implementation of the PTR algorithm described in the paper: Pre-Training for Robots: Leveraging Diverse Mult…☆29Updated last year
- Finetuning Offline World Models in the Real World☆42Updated 10 months ago