Video2Tasks: Split multi-task robot videos into single-task segments with auto-generated instruction labels for VLA (pi0, OpenVLA) training
☆66Feb 28, 2026Updated 3 months ago
Alternatives and similar repositories for video2tasks
Users that are interested in video2tasks are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Learning-Based Efficient Approximation of Data-Enabled Predictive Control☆16Mar 29, 2024Updated 2 years ago
- PiloTY: AI pilot for PTY operations via MCP - enables AI agents to control interactive terminals like a human☆38Apr 23, 2026Updated last month
- [CVPR 2026] HiF-VLA: An efficient, bidirectional spatiotemporal expansion Vision-Language-Action Model☆67Mar 11, 2026Updated 3 months ago
- [CVPR 2024] DiffAgent: Fast and Accurate Text-to-Image API Selection with Large Language Model☆19Apr 16, 2024Updated 2 years ago
- ☆14Jul 6, 2025Updated 11 months ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- CrossGen: Learning and Generating Cross Fields for Quad Meshing☆24Mar 25, 2026Updated 2 months ago
- ☆12Aug 8, 2024Updated last year
- ☆10Apr 22, 2025Updated last year
- DreamGaussian with 2D-GS☆12Oct 10, 2024Updated last year
- [TVCG 2024] Official implementation of "JIMR: Joint Semantic and Geometry Learning for Point Scene Instance Mesh Reconstruction”☆15Jan 7, 2026Updated 5 months ago
- Multi-Resolution POMDP Planning for Multi-Object Search in 3D (IROS 2021) | IROS RoboCup Best Paper Award☆10May 15, 2025Updated last year
- A list of robotics related papers accepted by ICLR'25☆25Aug 28, 2025Updated 9 months ago
- Awesome-Text2Motion-Generation☆18Oct 26, 2023Updated 2 years ago
- A wrapped package for Data-enabled predictive control (DeePC) implementation. Including DeePC and Robust DeePC design with multiple objec…☆26Dec 9, 2024Updated last year
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- I GAVE GPT-4 EYES!☆15Jan 24, 2024Updated 2 years ago
- an OpenClaw skill that can generate paper search-review-critque expert-agent relevant to specific topics (we use Scientific ML and 3D geo…☆237Mar 11, 2026Updated 3 months ago
- Code for [AAAI 2026] AffordDex: Towards Affordance-Aware Robotic Dexterous Grasping with Human-like Priors☆34Dec 26, 2025Updated 5 months ago
- ☆18Apr 3, 2025Updated last year
- Subscribe Loomo published image messages and process☆10Oct 22, 2017Updated 8 years ago
- Generalized SLAM for Monocular Endoscopy based on Tracking any Point☆21May 21, 2025Updated last year
- A lightweight toolkit for quantitatively scoring LeRobot episodes.☆70Mar 13, 2026Updated 3 months ago
- ☆15Feb 13, 2025Updated last year
- 一个可以总结微信对话内容的工具,生成分布图、词云图等统计结果,支持ai总结☆16May 6, 2024Updated 2 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- 4D-VLA: Spatiotemporal Vision-Language-Action Pretraining with Cross-Scene Calibration. Accepted to NeurIPS 2025.☆54Jan 10, 2026Updated 5 months ago
- CVPR 2025, EchoMatch: Partial-to-Partial Shape Matching via Correspondence Reflection☆18Jul 29, 2025Updated 10 months ago
- [IJCV 2023] The official repo for “Learning Geometric Transformation for Point Cloud Completion”☆17Jul 11, 2023Updated 2 years ago
- 🏠 [JBHI 2024] Pytorch implementation of 'MonoLoT: Self-Supervised Monocular Depth Estimation in Low-Texture Scenes for Automatic Robotic…☆21Mar 23, 2025Updated last year
- Code for "StreamingTalker: Audio-driven 3D Facial Animation with Autoregressive Diffusion Model", AAAI2026 Oral☆56Jan 16, 2026Updated 5 months ago
- yolo-pose for training escalator data☆16Jul 1, 2024Updated last year
- Implementation of a Gaussian process regression for motion prediction in target-tracking scenarios (currently under development). Based o…☆11Dec 16, 2019Updated 6 years ago
- [NeurIPS 2025] LabUtopia: High-Fidelity Simulation and Hierarchical Benchmark for Scientific Embodied Agents☆36Updated this week
- ☆14Feb 25, 2023Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Human-centered Delivery Benchmark☆20Jul 24, 2024Updated last year
- ☆26Jul 15, 2024Updated last year
- Endoscopy Specular Reflection Removal☆23May 31, 2024Updated 2 years ago
- Official implementation of ICCV 2025 paper "TACO: Taming Diffusion for in-the-wild Video Amodal Completion"☆29Jul 4, 2025Updated 11 months ago
- Realistic endoscopic illumination modelling for NeRF-based data generation☆24May 10, 2025Updated last year
- A benchmark dataset in Realistic Urban settings for Multi-Agent Anomaly Detection. Including code for the dataset, the baseline models an…☆17Aug 8, 2025Updated 10 months ago
- Image stitching and 3D point cloud registration using a Kinect camera☆11Apr 20, 2026Updated last month