mazpie / genrlLinks
[NeurIPS 2024] GenRL: Multimodal-foundation world models enable grounding language and video prompts into embodied domains, by turning them into sequences of latent world model states. Latent state sequences can be decoded using the decoder of the model, allowing visualization of the expected behavior, before training the agent to execute it.
☆86Updated 10 months ago
Alternatives and similar repositories for genrl
Users that are interested in genrl are comparing it to the libraries listed below
Sorting:
- Using advances in generative modeling to learn reward functions from unlabeled videos.☆140Updated last year
- Code for the ICLR 2024 spotlight paper: "Learning to Act without Actions" (introducing Latent Action Policies)☆134Updated last year
- PWM: Policy Learning with Large World Models☆65Updated 6 months ago
- ☆83Updated last month
- DrM, a visual RL algorithm, minimizes the dormant ratio to guide exploration-exploitation trade-offs, achieving significant improvements …☆78Updated last year
- Official repository for "LIV: Language-Image Representations and Rewards for Robotic Control" (ICML 2023)☆130Updated 2 years ago
- ☆67Updated last year
- Public code for "Reinforcement Learning from Passive Data via Latent Intentions"☆89Updated 2 years ago
- Finetuning Offline World Models in the Real World☆65Updated 2 years ago
- VP2 Benchmark (A Control-Centric Benchmark for Video Prediction, ICLR 2023)☆30Updated 11 months ago
- ☆35Updated 8 months ago
- Masked World Models for Visual Control☆135Updated 2 years ago
- ☆60Updated 2 years ago
- [ECCV 2024] 💐Official implementation of the paper "Diffusion Reward: Learning Rewards via Conditional Video Diffusion"☆116Updated last year
- Recall to Imagine, a model-based RL algorithm with superhuman memory. Oral (1.2%) @ ICLR 2024☆78Updated last year
- ☆47Updated 2 years ago
- MiniGrid Implementation of BEHAVIOR Tasks☆58Updated 4 months ago
- ☆31Updated last year
- ☆46Updated 2 years ago
- (ICLR 2024) Reverse Forward Curriculum Learning☆52Updated last year
- Streaming Diffusion Policy: Fast Policy Synthesis with Variable Noise Diffusion Models☆74Updated 8 months ago
- Chain-of-Thought Predictive Control☆57Updated 2 years ago
- Official repository for "VIP: Towards Universal Visual Reward and Representation via Value-Implicit Pre-Training"☆179Updated 2 years ago
- Official implementation of DEMO3☆65Updated 6 months ago
- Code for "Planning Goals for Exploration", ICLR2023 Spotlight. An unsupervised RL agent for hard exploration tasks.☆82Updated last year
- The official implementations of Intention-conditioned Flow Occupancy Models (InFOM)☆29Updated last month
- ☆84Updated 8 months ago
- The official implementation of "Horizon Reduction Makes RL Scalable"☆181Updated 6 months ago
- Official code for "QueST: Self-Supervised Skill Abstractions for Continuous Control" [NeurIPS 2024]☆106Updated last year
- [IJCAI'24] An index of algorithms, approaches, and systems on cross-domain policy transfer for embodied agents☆59Updated 11 months ago