Video2Tasks: Split multi-task robot videos into single-task segments with auto-generated instruction labels for VLA (pi0, OpenVLA) training
☆49Feb 28, 2026Updated 3 weeks ago
Alternatives and similar repositories for video2tasks
Users that are interested in video2tasks are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [CVPR 2026] HiF-VLA: An efficient, bidirectional spatiotemporal expansion Vision-Language-Action Model☆50Mar 11, 2026Updated 2 weeks ago
- Learning-Based Efficient Approximation of Data-Enabled Predictive Control☆15Mar 29, 2024Updated last year
- PiloTY: AI pilot for PTY operations via MCP - enables AI agents to control interactive terminals like a human☆30Mar 11, 2026Updated 2 weeks ago
- [CVPR 2024] DiffAgent: Fast and Accurate Text-to-Image API Selection with Large Language Model☆19Apr 16, 2024Updated last year
- ☆14Jul 6, 2025Updated 8 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- ☆12Aug 8, 2024Updated last year
- ☆10Apr 22, 2025Updated 11 months ago
- DreamGaussian with 2D-GS☆12Oct 10, 2024Updated last year
- [TVCG 2024] Official implementation of "JIMR: Joint Semantic and Geometry Learning for Point Scene Instance Mesh Reconstruction”☆14Jan 7, 2026Updated 2 months ago
- A list of robotics related papers accepted by ICLR'25☆25Aug 28, 2025Updated 6 months ago
- Awesome-Text2Motion-Generation☆18Oct 26, 2023Updated 2 years ago
- Code for [AAAI 2026] AffordDex: Towards Affordance-Aware Robotic Dexterous Grasping with Human-like Priors☆28Dec 26, 2025Updated 3 months ago
- an OpenClaw skill that can generate paper search-review-critque expert-agent relevant to specific topics (we use Scientific ML and 3D geo…☆186Mar 11, 2026Updated 2 weeks ago
- A wrapped package for Data-enabled predictive control (DeePC) implementation. Including DeePC and Robust DeePC design with multiple objec…☆24Dec 9, 2024Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- I GAVE GPT-4 EYES!☆14Jan 24, 2024Updated 2 years ago
- ☆14Apr 3, 2025Updated 11 months ago
- Subscribe Loomo published image messages and process☆10Oct 22, 2017Updated 8 years ago
- Generalized SLAM for Monocular Endoscopy based on Tracking any Point☆20May 21, 2025Updated 10 months ago
- ☆14Feb 13, 2025Updated last year
- CVPR 2025, EchoMatch: Partial-to-Partial Shape Matching via Correspondence Reflection☆13Jul 29, 2025Updated 7 months ago
- 一个可以总结微信对话内容的工具,生成分布图、词云图等统计结果,支持ai总结☆16May 6, 2024Updated last year
- 4D-VLA: Spatiotemporal Vision-Language-Action Pretraining with Cross-Scene Calibration. Accepted to NeurIPS 2025.☆51Jan 10, 2026Updated 2 months ago
- [IJCV 2023] The official repo for “Learning Geometric Transformation for Point Cloud Completion”☆17Jul 11, 2023Updated 2 years ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- 🏠 [JBHI 2024] Pytorch implementation of 'MonoLoT: Self-Supervised Monocular Depth Estimation in Low-Texture Scenes for Automatic Robotic…☆21Mar 23, 2025Updated last year
- Code for "StreamingTalker: Audio-driven 3D Facial Animation with Autoregressive Diffusion Model", AAAI2026 Oral☆48Jan 16, 2026Updated 2 months ago
- yolo-pose for training escalator data☆16Jul 1, 2024Updated last year
- Implementation of a Gaussian process regression for motion prediction in target-tracking scenarios (currently under development). Based o…☆11Dec 16, 2019Updated 6 years ago
- [NeurIPS 2025] LabUtopia: High-Fidelity Simulation and Hierarchical Benchmark for Scientific Embodied Agents☆31Updated this week
- ☆14Feb 25, 2023Updated 3 years ago
- Human-centered Delivery Benchmark☆20Jul 24, 2024Updated last year
- ☆27Jul 15, 2024Updated last year
- [Arxiv'25] DINO-Tok: Adapting DINO for Visual Tokenizers☆35Nov 25, 2025Updated 4 months ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Official implementation of ICCV 2025 paper "TACO: Taming Diffusion for in-the-wild Video Amodal Completion"☆28Jul 4, 2025Updated 8 months ago
- Endoscopy Specular Reflection Removal☆21May 31, 2024Updated last year
- A benchmark dataset in Realistic Urban settings for Multi-Agent Anomaly Detection. Including code for the dataset, the baseline models an…☆16Aug 8, 2025Updated 7 months ago
- Realistic endoscopic illumination modelling for NeRF-based data generation☆24May 10, 2025Updated 10 months ago
- Image stitching and 3D point cloud registration using a Kinect camera☆11Sep 9, 2020Updated 5 years ago
- [ICCV'2025]: GAP: Gaussianize Any Point Clouds with Text Guidance☆54Nov 6, 2025Updated 4 months ago
- ☆135Nov 1, 2025Updated 4 months ago