cvlab-columbia / videopolicyLinks
☆38Updated 4 months ago
Alternatives and similar repositories for videopolicy
Users that are interested in videopolicy are comparing it to the libraries listed below
Sorting:
- Code for FLIP: Flow-Centric Generative Planning for General-Purpose Manipulation Tasks☆80Updated 11 months ago
- List of papers on video-centric robot learning☆22Updated last year
- F1: A Vision Language Action Model Bridging Understanding and Generation to Actions☆137Updated last month
- ☆87Updated last year
- Being-H0: Vision-Language-Action Pretraining from Large-Scale Human Videos☆182Updated 3 months ago
- [ICCV2025] AnyBimanual: Transfering Unimanual Policy for General Bimanual Manipulation☆93Updated 5 months ago
- [AAAI26 oral] CronusVLA: Towards Efficient and Robust Manipulation via Multi-Frame Vision-Language-Action Modeling☆63Updated last month
- InstructVLA: Vision-Language-Action Instruction Tuning from Understanding to Manipulation☆70Updated 2 months ago
- Official PyTorch Implementation of Unified Video Action Model (RSS 2025)☆303Updated 4 months ago
- ☆41Updated 5 months ago
- Unfied World Models: Coupling Video and Action Diffusion for Pretraining on Large Robotic Datasets☆160Updated 2 months ago
- ☆55Updated 3 months ago
- Repository for "General Flow as Foundation Affordance for Scalable Robot Learning"☆67Updated 11 months ago
- [ICCV2025 Oral] Latent Motion Token as the Bridging Language for Learning Robot Manipulation from Videos☆150Updated 2 months ago
- ☆135Updated 5 months ago
- [ICML 2025] OTTER: A Vision-Language-Action Model with Text-Aware Visual Feature Extraction☆111Updated 7 months ago
- ICCV2025☆143Updated 3 weeks ago
- [ICCV 2025] Dense Policy: Bidirectional Autoregressive Learning of Actions #DSP☆71Updated last month
- GRAPE: Guided-Reinforced Vision-Language-Action Preference Optimization☆152Updated 8 months ago
- Official implementation of the paper: Task Reconstruction and Extrapolation for $\pi_0$ using Text Latent (https://arxiv.org/pdf/2505.035…