AlignmentResearch / vlmrmLinks
☆65Updated last year
Alternatives and similar repositories for vlmrm
Users that are interested in vlmrm are comparing it to the libraries listed below
Sorting:
- Using advances in generative modeling to learn reward functions from unlabeled videos.☆135Updated last year
- Masked World Models for Visual Control☆131Updated 2 years ago
- Code for the ICLR 2024 spotlight paper: "Learning to Act without Actions" (introducing Latent Action Policies)☆128Updated last year
- [NeurIPS 2024] GenRL: Multimodal-foundation world models enable grounding language and video prompts into embodied domains, by turning th…☆84Updated 7 months ago
- ☆46Updated last year
- PWM: Policy Learning with Large World Models☆58Updated 3 months ago
- Code for "Masked Autoencoding for Scalable and Generalizable Decision Making". NeurIPS 2022☆46Updated last year
- (NeurIPS 2023) Residual Q-Learning: Offline and Online Policy Customization without Value☆35Updated last year
- ☆24Updated last year
- MiniGrid Implementation of BEHAVIOR Tasks☆56Updated last month
- Chain-of-Thought Predictive Control☆58Updated 2 years ago
- [ICLR 2024] Adaptive Replay Ratio implementation from 'Revisiting Plasticity in Visual RL: Data, Modules and Training Stages'.☆13Updated last year
- Official repository for "LIV: Language-Image Representations and Rewards for Robotic Control" (ICML 2023)☆126Updated 2 years ago
- MTM Masked Trajectory Models for Prediction, Representation, and Control.☆159Updated 2 years ago
- DrM, a visual RL algorithm, minimizes the dormant ratio to guide exploration-exploitation trade-offs, achieving significant improvements …☆76Updated last year
- Finetuning Offline World Models in the Real World☆62Updated 2 years ago
- ☆45Updated 2 years ago
- Source codes for the paper "COMBO: Compositional World Models for Embodied Multi-Agent Cooperation"☆44Updated 8 months ago
- [NeurIPS 2022] Official implementation of the paper: "Human-AI Shared Control via Policy Dissection"☆50Updated 2 years ago
- ☆71Updated 3 years ago
- Code for "MetaMorph: Learning Universal Controllers with Transformers", Gupta et al, ICLR 2022☆125Updated 3 years ago
- ☆28Updated last year
- ☆35Updated 5 months ago
- METRA: Scalable Unsupervised RL with Metric-Aware Abstraction (ICLR 2024)☆78Updated 2 years ago
- Code for Reinforcement Learning from Vision Language Foundation Model Feedback☆126Updated last year
- Offline RLHF codebase implementation for "Uni-RLHF: Universal Platform and Benchmark Suite for Reinforcement Learning with Diverse Human …☆41Updated last year
- (NeurIPS '22) LISA: Learning Interpretable Skill Abstractions - A framework for unsupervised skill learning using Imitation☆29Updated 2 years ago
- From Play to Policy: Conditional Behavior Generation from Uncurated Robot Data☆55Updated 2 years ago
- Transformer-based World Models☆86Updated 2 years ago
- ☆48Updated last year