ly-geming / video2tasksView external linksLinks
Video2Tasks: Split multi-task robot videos into single-task segments with auto-generated instruction labels for VLA (pi0, OpenVLA) training
☆40Feb 6, 2026Updated last week
Alternatives and similar repositories for video2tasks
Users that are interested in video2tasks are comparing it to the libraries listed below
Sorting:
- ☆12Aug 8, 2024Updated last year
- Implementation of a Gaussian process regression for motion prediction in target-tracking scenarios (currently under development). Based o…☆11Dec 16, 2019Updated 6 years ago
- [NeurIPS 2025] LabUtopia: High-Fidelity Simulation and Hierarchical Benchmark for Scientific Embodied Agents☆27Updated this week
- ☆14Feb 13, 2025Updated last year
- ☆10Apr 22, 2025Updated 9 months ago
- Subscribe Loomo published image messages and process☆10Oct 22, 2017Updated 8 years ago
- [TVCG 2024] Official implementation of "JIMR: Joint Semantic and Geometry Learning for Point Scene Instance Mesh Reconstruction”☆14Jan 7, 2026Updated last month
- ☆13Apr 3, 2025Updated 10 months ago
- I GAVE GPT-4 EYES!☆14Jan 24, 2024Updated 2 years ago
- Image stitching and 3D point cloud registration using a Kinect camera☆11Sep 9, 2020Updated 5 years ago
- 一个可以总结微信对话内容的工具,生成分布图、词云图等统计结果,支持ai总结☆16May 6, 2024Updated last year
- python写的电脑端眨眼检测和提醒工具,提醒你每分钟眨眼N次☆12Feb 17, 2021Updated 4 years ago
- Generalized SLAM for Monocular Endoscopy based on Tracking any Point☆18May 21, 2025Updated 8 months ago
- ☆14Feb 25, 2023Updated 2 years ago
- Code for [AAAI 2026] AffordDex: Towards Affordance-Aware Robotic Dexterous Grasping with Human-like Priors☆25Dec 26, 2025Updated last month
- DreamGaussian with 2D-GS☆12Oct 10, 2024Updated last year
- CVPR 2025, EchoMatch: Partial-to-Partial Shape Matching via Correspondence Reflection☆13Jul 29, 2025Updated 6 months ago
- Open source simulator for autonomous vehicles built on Unreal Engine / Unity, from Microsoft AI & Research☆12Oct 22, 2021Updated 4 years ago
- [CVPR 2024] DiffAgent: Fast and Accurate Text-to-Image API Selection with Large Language Model☆18Apr 16, 2024Updated last year
- ☆14Jul 6, 2025Updated 7 months ago
- PiloTY: AI pilot for PTY operations via MCP - enables AI agents to control interactive terminals like a human☆25Feb 6, 2026Updated last week
- HiF-VLA: An efficient, bidirectional spatiotemporal expansion Vision-Language-Action Model☆46Dec 11, 2025Updated 2 months ago
- [CVPR 2025] Unleashing the Potential of Consistency Learning for Detecting and Grounding Multi-Modal Media Manipulation☆27Jul 18, 2025Updated 6 months ago
- Awesome-Text2Motion-Generation☆18Oct 26, 2023Updated 2 years ago
- Learning-Based Efficient Approximation of Data-Enabled Predictive Control☆15Mar 29, 2024Updated last year
- Code for "StreamingTalker: Audio-driven 3D Facial Animation with Autoregressive Diffusion Model", AAAI2026 Oral☆42Jan 16, 2026Updated 3 weeks ago
- [IJCV 2023] The official repo for “Learning Geometric Transformation for Point Cloud Completion”☆17Jul 11, 2023Updated 2 years ago
- ☆124Nov 1, 2025Updated 3 months ago
- [Arxiv'25] DINO-Tok: Adapting DINO for Visual Tokenizers☆35Nov 25, 2025Updated 2 months ago
- A benchmark dataset in Realistic Urban settings for Multi-Agent Anomaly Detection. Including code for the dataset, the baseline models an…☆16Aug 8, 2025Updated 6 months ago
- A visual analysis tool to support a unified model evaluation for different computer vision tasks, including classification, object detect…☆16Dec 5, 2023Updated 2 years ago
- Human-centered Delivery Benchmark☆20Jul 24, 2024Updated last year
- [INFFUS 2025] CoreNet: Conflict Resolution Network for point-pixel misalignment and sub-task suppression of 3D LiDAR-camera object detect…☆23Mar 15, 2025Updated 10 months ago
- frequency cam: detecting and visualizing frequencies with an event based camera☆22Dec 8, 2025Updated 2 months ago
- 🏠 [JBHI 2024] Pytorch implementation of 'MonoLoT: Self-Supervised Monocular Depth Estimation in Low-Texture Scenes for Automatic Robotic…☆21Mar 23, 2025Updated 10 months ago
- Endoscopy Specular Reflection Removal☆20May 31, 2024Updated last year
- A list of robotics related papers accepted by ICLR'25☆25Aug 28, 2025Updated 5 months ago
- [RA-L'22] Proactive Anomaly Detection for Robot Navigation with Multi-Sensor Fusion☆19Nov 9, 2022Updated 3 years ago
- yolo-pose for training escalator data☆16Jul 1, 2024Updated last year