thuml / iVideoGPTLinks
Official repository for "iVideoGPT: Interactive VideoGPTs are Scalable World Models" (NeurIPS 2024), https://arxiv.org/abs/2405.15223
☆133Updated 2 weeks ago
Alternatives and similar repositories for iVideoGPT
Users that are interested in iVideoGPT are comparing it to the libraries listed below
Sorting:
- Latent Motion Token as the Bridging Language for Robot Manipulation☆89Updated 3 weeks ago
- ☆99Updated 2 weeks ago
- Code for FLIP: Flow-Centric Generative Planning for General-Purpose Manipulation Tasks☆66Updated 5 months ago
- ☆46Updated 5 months ago
- ☆87Updated 3 weeks ago
- Official PyTorch Implementation of Unified Video Action Model (RSS 2025)☆199Updated 2 months ago
- Official repository of Learning to Act from Actionless Videos through Dense Correspondences.☆216Updated last year
- GRAPE: Guided-Reinforced Vision-Language-Action Preference Optimization☆129Updated last month
- [ICLR 2025 Spotlight] Grounding Video Models to Actions through Goal Conditioned Exploration☆48Updated last month
- [ICML 2024] A Touch, Vision, and Language Dataset for Multimodal Alignment☆78Updated this week
- Code release for "Pre-training Contextualized World Models with In-the-wild Videos for Reinforcement Learning" (NeurIPS 2023), https://ar…☆64Updated 8 months ago
- [ICML 2025] OTTER: A Vision-Language-Action Model with Text-Aware Visual Feature Extraction☆78Updated last month
- ☆72Updated 9 months ago
- [ICLR 2025] LAPA: Latent Action Pretraining from Videos☆293Updated 4 months ago
- [ICML'25] The PyTorch implementation of paper: "AdaWorld: Learning Adaptable World Models with Latent Actions".☆85Updated last week
- [ICML 2024] The offical Implementation of "DecisionNCE: Embodied Multimodal Representations via Implicit Preference Learning"☆80Updated last week
- Official implementation of "Self-Improving Video Generation"☆66Updated last month
- Single-file implementation to advance vision-language-action (VLA) models with reinforcement learning.☆96Updated 2 weeks ago
- [ECCV 2024] 💐Official implementation of the paper "Diffusion Reward: Learning Rewards via Conditional Video Diffusion"☆104Updated 11 months ago
- ☆41Updated 7 months ago
- An unofficial pytorch dataloader for Open X-Embodiment Datasets https://github.com/google-deepmind/open_x_embodiment☆14Updated 4 months ago
- A comprehensive list of papers about dual-system VLA models, including papers, codes, and related websites.☆30Updated 3 weeks ago
- Unfied World Models: Coupling Video and Action Diffusion for Pretraining on Large Robotic Datasets☆82Updated last month
- RoboFactory: Exploring Embodied Agent Collaboration with Compositional Constraints☆48Updated last week
- Repository for "General Flow as Foundation Affordance for Scalable Robot Learning"☆55Updated 5 months ago
- Official implementation of GR-MG☆80Updated 4 months ago
- Official repo of VLABench, a large scale benchmark designed for fairly evaluating VLA, Embodied Agent, and VLMs.☆227Updated last week
- Online RL with Simple Reward Enables Training VLA Models with Only One Trajectory☆157Updated last week
- The repo of paper `RoboMamba: Multimodal State Space Model for Efficient Robot Reasoning and Manipulation`☆124Updated 5 months ago
- ☆76Updated last week