Youtu-VL: Unleashing Visual Potential via Unified Vision-Language Supervision
☆143Feb 6, 2026Updated last month
Alternatives and similar repositories for youtu-vl
Users that are interested in youtu-vl are comparing it to the libraries listed below
Sorting:
- DIPO: Dual-State Images Controlled Articulated Object Generation Powered by Diverse Data☆46Dec 12, 2025Updated 3 months ago
- Offical implementation of "Re-Aligning Language to Visual Objects with an Agentic Workflow"☆31Apr 20, 2025Updated 11 months ago
- The official code for the paper: LLaVA-Scissor: Token Compression with Semantic Connected Components for Video LLMs☆119Jul 1, 2025Updated 8 months ago
- ☆55Sep 21, 2025Updated 6 months ago
- An open source codebase for object detection based on Jittor☆19Dec 9, 2025Updated 3 months ago
- ☆14Apr 19, 2025Updated 11 months ago
- [ICCV 2025] Official implementation of "Anchor Token Matching: Implicit Structure Locking for Training-free AR Image Editing"☆29Apr 15, 2025Updated 11 months ago
- [NeurIPS'25 Spotlight] MJ-VIDEO: Fine-Grained Benchmarking and Rewarding Video Preferences in Video Generation☆21Feb 23, 2025Updated last year
- An official code for "A Decoupled Spatio-Temporal Framework for Skeleton-based Action Segmentation".☆38Dec 15, 2023Updated 2 years ago
- ASID-Caption: Attribute-Structured and Quality-Verified Audiovisual Instruction Dataset and Training Pipeline for Fine-Grained Video Unde…☆49Mar 3, 2026Updated 2 weeks ago
- Official Release of ACM TOG 2025 paper -- GS-ROR