jiaming-zhou / X-ICMLinks
official repo for AGNOSTOS, a cross-task manipulation benchmark, and X-ICM method, a cross-task in-context manipulation (VLA) method
☆58Updated 2 months ago
Alternatives and similar repositories for X-ICM
Users that are interested in X-ICM are comparing it to the libraries listed below
Sorting:
- ICCV2025☆153Updated 2 months ago
- [ICCV2025] AnyBimanual: Transfering Unimanual Policy for General Bimanual Manipulation☆97Updated 7 months ago
- [ICCV 2025] Dense Policy: Bidirectional Autoregressive Learning of Actions DSP☆72Updated 3 weeks ago
- [AAAI26 oral] CronusVLA: Towards Efficient and Robust Manipulation via Multi-Frame Vision-Language-Action Modeling☆87Updated last month
- [NeurIPS 2025 Spotlight] SoFar: Language-Grounded Orientation Bridges Spatial Reasoning and Object Manipulation☆225Updated 7 months ago
- Official implementation of GR-MG☆93Updated last year
- ☆71Updated 3 weeks ago
- The official codebase for ManipLLM: Embodied Multimodal Large Language Model for Object-Centric Robotic Manipulation(cvpr 2024)☆146Updated last year
- [ICLR 2025 Oral] Seer: Predictive Inverse Dynamics Models are Scalable Learners for Robotic Manipulation☆277Updated 7 months ago
- ☆146Updated last week
- ☆62Updated last year
- [ICML 2025] OTTER: A Vision-Language-Action Model with Text-Aware Visual Feature Extraction☆115Updated 9 months ago
- Being-H0.5: Scaling Human-Centric Robot Learning for Cross-Embodiment Generalization☆338Updated 2 weeks ago
- This is the official repo for [CVPR 2025] paper, Mitigating the Human-Robot Domain Discrepancy in Visual Pre-training for Robotic Manipul…☆29Updated 10 months ago
- [ICCV2025 Oral] Latent Motion Token as the Bridging Language for Learning Robot Manipulation from Videos☆162Updated 4 months ago
- Official implementation of "OneTwoVLA: A Unified Vision-Language-Action Model with Adaptive Reasoning"☆208Updated 8 months ago
- Official Code For VLA-OS.☆138Updated 7 months ago
- ☆47Updated 7 months ago
- H-RDT: Human Manipulation Enhanced Bimanual Robotic Manipulation☆118Updated last month
- Official implementation of the paper: Task Reconstruction and Extrapolation for $\pi_0$ using Text Latent (https://arxiv.org/pdf/2505.035…☆102Updated 6 months ago
- [ICRA 2026] Official implemetation of the paper "InSpire: Vision-Language-Action Models with Intrinsic Spatial Reasoning"☆48Updated last week
- Official PyTorch Implementation of Unified Video Action Model (RSS 2025)☆331Updated 6 months ago
- [CoRL 2024] Im2Flow2Act: Flow as the Cross-domain Manipulation Interface☆151Updated last year
- [RSS 2025] Gripper Keypose and Object Pointflow as Interfaces for Bimanual Robotic Manipulation☆76Updated 6 months ago
- Implementation of VLM4VLA☆115Updated last week
- Official codebase for "Any-point Trajectory Modeling for Policy Learning"☆271Updated 7 months ago
- ManiCM: Real-time 3D Diffusion Policy via Consistency Model for Robotic Manipulation☆122Updated 9 months ago
- ☆55Updated 9 months ago
- [ICLR 2026] InstructVLA: Vision-Language-Action Instruction Tuning from Understanding to Manipulation☆96Updated 2 weeks ago
- Official code for "Embodied-R1: Reinforced Embodied Reasoning for General Robotic Manipulation"☆122Updated 5 months ago