penghao-wu / ProxyVView external linksLinks
[ICML 2025] Streamline Without Sacrifice - Squeeze out Computation Redundancy in LMM
☆20May 22, 2025Updated 8 months ago
Alternatives and similar repositories for ProxyV
Users that are interested in ProxyV are comparing it to the libraries listed below
Sorting:
- ☆21Dec 10, 2025Updated 2 months ago
- A local AI assistant running on your device. It turns your files into actionable memory.☆54Updated this week
- Toolbox for GTA-Human Datasets☆18Oct 9, 2024Updated last year
- Syphus: Automatic Instruction-Response Generation Pipeline☆14Dec 14, 2023Updated 2 years ago
- [ACL2025 Findings] Benchmarking Multihop Multimodal Internet Agents☆48Feb 27, 2025Updated 11 months ago
- Evaluating Knowledge Acquisition from Multi-Discipline Professional Videos☆65Sep 5, 2025Updated 5 months ago
- Benchmarking and Analyzing Generative Data for Visual Recognition☆26Jul 25, 2023Updated 2 years ago
- Code for FreeTraj, a tuning-free method for trajectory-controllable video generation☆111Sep 19, 2025Updated 4 months ago
- VideoAuto-R1: Video Auto Reasoning via Thinking Once, Answering Twice☆61Jan 9, 2026Updated last month
- NTU SC2002 Group Project - Final Year Project Management System (FYPMS)☆18Aug 12, 2025Updated 6 months ago
- A framework that allows you to apply Sparse AutoEncoder on any models☆51Jul 11, 2025Updated 7 months ago
- ☆77May 4, 2025Updated 9 months ago
- NEO Series: Native Vision-Language Models from First Principles☆643Jan 9, 2026Updated last month
- ☆18Jun 10, 2025Updated 8 months ago
- High-Resolution Visual Reasoning via Multi-Turn Grounding-Based Reinforcement Learning☆52Jul 23, 2025Updated 6 months ago
- ☆40Mar 3, 2024Updated last year
- Unofficial Implementation of "Stable Video Diffusion Multi-View"☆79Apr 15, 2024Updated last year
- [CVPR 2025] HMAR: Efficient Hierarchical Masked Auto-Regressive Image Generation☆61Jul 8, 2025Updated 7 months ago
- [ICCV 2025 Workshop Outstanding Paper Award] VChain: Chain-of-Visual-Thought for Reasoning in Video Generation☆116Oct 7, 2025Updated 4 months ago
- Ego-R1: Chain-of-Tool-Thought for Ultra-Long Egocentric Video Reasoning☆139Aug 21, 2025Updated 5 months ago
- [NeurIPS 2025] Official Implementation of paper "Sherlock: Self-Correcting Reasoning in Vision-Language Models"☆28Sep 18, 2025Updated 4 months ago
- Code for CineScale, higher-resolution video generation based on Wan☆183Aug 25, 2025Updated 5 months ago
- Code for the paper: Graph Jigsaw Learning for Cartoon Face Recognition☆10Jul 1, 2022Updated 3 years ago
- Reason-before-Retrieve: One-Stage Reflective Chain-of-Thoughts for Training-Free Zero-Shot Composed Image Retrieval [CVPR 2025 Highlight]☆65Jul 8, 2025Updated 7 months ago
- Code for paper "Rethinking Text-based Protein Understanding: Retrieval or LLM?"☆18Oct 7, 2025Updated 4 months ago
- Deep learning framework in Rust and Python☆10Dec 18, 2025Updated last month
- Official repository for Scone (Subject-driven Composition and Distinction Enhancement) model, designed to support multi-subject compositi…☆28Jan 14, 2026Updated last month
- ☆22Nov 18, 2025Updated 2 months ago
- ☆18Mar 2, 2025Updated 11 months ago
- Omni6D: Large-Vocabulary 3D Object Dataset for Category-Level 6D Object Pose Estimation☆48Dec 11, 2024Updated last year
- Starter template for building a JS HTML chatbot☆10Mar 21, 2024Updated last year
- [ECCV 2024] Characterizing Robustness via Natural Input Gradients☆13Oct 14, 2024Updated last year
- A large-scale place image dataset with multi-faceted annotations. Multi-level place recognition.☆10Jul 15, 2020Updated 5 years ago
- 南开大学网络空间安全学院计算机组成原理2023spring☆13Jan 22, 2024Updated 2 years ago
- Home page for Microsoft Phi-Ground tech-report☆23Sep 8, 2025Updated 5 months ago
- Cross-Self KV Cache Pruning for Efficient Vision-Language Inference☆10Dec 15, 2024Updated last year
- FunQA benchmarks funny, creative, and magic videos for challenging tasks including timestamp localization, video description, reasoning, …☆104Dec 25, 2025Updated last month
- On-Device Domain Generalization☆46Nov 9, 2022Updated 3 years ago
- [ICCV'25] FreeMorph: Tuning-Free Generalized Image Morphing with Diffusion Model☆82Jul 24, 2025Updated 6 months ago