mazpie / genrl
[NeurIPS 2024] GenRL: Multimodal-foundation world models enable grounding language and video prompts into embodied domains, by turning them into sequences of latent world model states. Latent state sequences can be decoded using the decoder of the model, allowing visualization of the expected behavior, before training the agent to execute it.
☆63Updated this week
Alternatives and similar repositories for genrl:
Users that are interested in genrl are comparing it to the libraries listed below
- DrM, a visual RL algorithm, minimizes the dormant ratio to guide exploration-exploitation trade-offs, achieving significant improvements …☆69Updated 7 months ago
- Official code for "QueST: Self-Supervised Skill Abstractions for Continuous Control" [NeurIPS 2024]☆56Updated last month
- Official repository for "LIV: Language-Image Representations and Rewards for Robotic Control" (ICML 2023)☆95Updated last year
- PWM: Policy Learning with Large World Models☆39Updated 4 months ago
- ☆54Updated last year
- (ICLR 2024) Reverse Forward Curriculum Learning☆40Updated last month
- Public code for "Reinforcement Learning from Passive Data via Latent Intentions"☆85Updated last year
- [IJCAI'24] An index of algorithms, approaches, and systems on cross-domain policy transfer for embodied agents☆39Updated 2 months ago
- ☆45Updated 11 months ago
- Code for the ICLR 2024 spotlight paper: "Learning to Act without Actions" (introducing Latent Action Policies)☆85Updated 5 months ago
- VP2 Benchmark (A Control-Centric Benchmark for Video Prediction, ICLR 2023)☆25Updated last year
- Chain-of-Thought Predictive Control☆55Updated last year
- Code for "Masked Autoencoding for Scalable and Generalizable Decision Making". NeurIPS 2022☆44Updated 10 months ago
- [ECCV 2024] 💐Official implementation of the paper "Diffusion Reward: Learning Rewards via Conditional Video Diffusion"☆88Updated 6 months ago
- Official code for TLDR: Unsupervised Goal-Conditioned RL via Temporal Distance-Aware Representations☆25Updated 3 months ago
- ☆43Updated last year
- ☆24Updated last year
- ☆46Updated 6 months ago
- ☆29Updated last year
- JAX implementation of WSRL and RL baselines☆18Updated this week
- Codebase for HiP☆88Updated last year
- Code for the paper "Policy Adaptation via Language Optimization: Decomposing Tasks for Few-Shot Imitation"☆27Updated last month
- A Benchmark for Low-Level Manipulation in Home Rearrangement Tasks☆61Updated 3 weeks ago
- ☆29Updated 8 months ago
- Using advances in generative modeling to learn reward functions from unlabeled videos.☆119Updated 11 months ago
- [RA-L 2024] Novel action spaces leveraging redundancy in 7 DoF arms enable efficient & precise learning in robotic manipulation☆17Updated 7 months ago
- Official PyTorch implementation of AdaFlow☆45Updated 2 months ago
- From Play to Policy: Conditional Behavior Generation from Uncurated Robot Data☆51Updated last year
- ☆69Updated 4 months ago
- code for the paper Imitation Learning from Observation with Automatic Discount Scheduling☆13Updated 9 months ago