jiaming-zhou / X-ICMLinks
official repo for AGNOSTOS, a cross-task manipulation benchmark, and X-ICM method, a cross-task in-context manipulation (VLA) method
☆34Updated last month
Alternatives and similar repositories for X-ICM
Users that are interested in X-ICM are comparing it to the libraries listed below
Sorting:
- ICCV2025☆108Updated last week
- [ICCV2025 Oral] Latent Motion Token as the Bridging Language for Learning Robot Manipulation from Videos☆117Updated 2 months ago
- [ICLR 2025 Oral] Seer: Predictive Inverse Dynamics Models are Scalable Learners for Robotic Manipulation☆208Updated 3 weeks ago
- [ICCV2025] AnyBimanual: Transfering Unimanual Policy for General Bimanual Manipulation☆79Updated last month
- Official implementation of GR-MG☆85Updated 6 months ago
- GRAPE: Guided-Reinforced Vision-Language-Action Preference Optimization☆134Updated 3 months ago
- SoFar: Language-Grounded Orientation Bridges Spatial Reasoning and Object Manipulation☆180Updated last month
- ☆53Updated 7 months ago
- Single-file implementation to advance vision-language-action (VLA) models with reinforcement learning.☆175Updated last week
- Official repo of VLABench, a large scale benchmark designed for fairly evaluating VLA, Embodied Agent, and VLMs.☆266Updated last month
- Repository for "General Flow as Foundation Affordance for Scalable Robot Learning"☆60Updated 7 months ago
- Official codebase for "Any-point Trajectory Modeling for Policy Learning"☆236Updated last month
- A comprehensive list of papers about dual-system VLA models, including papers, codes, and related websites.☆59Updated last week
- Official PyTorch Implementation of Unified Video Action Model (RSS 2025)☆248Updated last week
- [ICML 2025] OTTER: A Vision-Language-Action Model with Text-Aware Visual Feature Extraction☆95Updated 3 months ago
- Official Code For VLA-OS.☆61Updated last month
- [ICCV 2025] Dense Policy: Bidirectional Autoregressive Learning of Actions --DSP☆49Updated last month
- A list of robotics related papers accepted by ICLR'25☆20Updated 5 months ago
- Reimplementation of GR-1, a generalized policy for robotics manipulation.☆139Updated 10 months ago
- Official implementation of "OneTwoVLA: A Unified Vision-Language-Action Model with Adaptive Reasoning"☆158Updated 2 months ago
- 🦾 A Dual-System VLA with System2 Thinking☆75Updated 2 weeks ago
- The official codebase for ManipLLM: Embodied Multimodal Large Language Model for Object-Centric Robotic Manipulation(cvpr 2024)☆139Updated last year
- [ICML 2025] Rethinking Latent Redundancy in Behavior Cloning: An Information Bottleneck Approach for Robot Manipulation☆29Updated 2 months ago
- ☆60Updated last month
- An example RLDS dataset builder for X-embodiment dataset conversion.☆25Updated 5 months ago
- [CoRL 2024] Im2Flow2Act: Flow as the Cross-domain Manipulation Interface☆134Updated 9 months ago
- [ICLR 2025] LAPA: Latent Action Pretraining from Videos☆340Updated 6 months ago
- Unfied World Models: Coupling Video and Action Diffusion for Pretraining on Large Robotic Datasets☆103Updated this week
- HybridVLA: Collaborative Diffusion and Autoregression in a Unified Vision-Language-Action Model☆261Updated last month
- MOKA: Open-World Robotic Manipulation through Mark-based Visual Prompting (RSS 2024)☆85Updated last year