[ICML 2025] Streamline Without Sacrifice - Squeeze out Computation Redundancy in LMM
☆20May 22, 2025Updated 9 months ago
Alternatives and similar repositories for ProxyV
Users that are interested in ProxyV are comparing it to the libraries listed below
Sorting:
- ☆21Feb 13, 2026Updated 3 weeks ago
- A local AI assistant running on your device. It turns your files into actionable memory.☆54Feb 15, 2026Updated 2 weeks ago
- Toolbox for GTA-Human Datasets☆25Oct 9, 2024Updated last year
- Syphus: Automatic Instruction-Response Generation Pipeline☆14Dec 14, 2023Updated 2 years ago
- [ACL2025 Findings] Benchmarking Multihop Multimodal Internet Agents☆48Feb 27, 2025Updated last year
- Benchmarking and Analyzing Generative Data for Visual Recognition☆26Jul 25, 2023Updated 2 years ago
- Code for FreeTraj, a tuning-free method for trajectory-controllable video generation☆111Sep 19, 2025Updated 5 months ago
- NTU SC2002 Group Project - Final Year Project Management System (FYPMS)☆18Aug 12, 2025Updated 6 months ago
- ☆27Oct 5, 2023Updated 2 years ago
- A framework that allows you to apply Sparse AutoEncoder on any models☆51Jul 11, 2025Updated 7 months ago
- ☆77May 4, 2025Updated 10 months ago
- ☆18Jun 10, 2025Updated 8 months ago
- [CVPR2026] VideoAuto-R1: Video Auto Reasoning via Thinking Once, Answering Twice☆74Feb 27, 2026Updated last week
- NEO Series: Native Vision-Language Models from First Principles☆654Feb 21, 2026Updated 2 weeks ago
- High-Resolution Visual Reasoning via Multi-Turn Grounding-Based Reinforcement Learning☆53Jul 23, 2025Updated 7 months ago
- Unofficial Implementation of "Stable Video Diffusion Multi-View"☆79Apr 15, 2024Updated last year
- [ICCV 2025 Workshop Outstanding Paper Award] VChain: Chain-of-Visual-Thought for Reasoning in Video Generation☆115Oct 7, 2025Updated 5 months ago
- [CVPR 2025] HMAR: Efficient Hierarchical Masked Auto-Regressive Image Generation☆61Jul 8, 2025Updated 7 months ago
- Ego-R1: Chain-of-Tool-Thought for Ultra-Long Egocentric Video Reasoning☆141Aug 21, 2025Updated 6 months ago
- [NeurIPS 2025] Official Implementation of paper "Sherlock: Self-Correcting Reasoning in Vision-Language Models"☆28Sep 18, 2025Updated 5 months ago
- MMSearch-R1 is an end-to-end RL framework that enables LMMs to perform on-demand, multi-turn search with real-world multimodal search too…☆408Aug 26, 2025Updated 6 months ago
- Code for CineScale, higher-resolution video generation based on Wan☆184Aug 25, 2025Updated 6 months ago
- Dataset Quantization with Active Learning based Adaptive Sampling [ECCV 2024]☆10Jul 9, 2024Updated last year
- Starter template for building a JS HTML chatbot☆10Mar 21, 2024Updated last year
- ☆16Oct 13, 2025Updated 4 months ago
- ☆22Nov 18, 2025Updated 3 months ago
- Deep learning framework in Rust and Python☆10Dec 18, 2025Updated 2 months ago
- 南开大学网络空间安全学院计算机组成原理2023spring☆13Jan 22, 2024Updated 2 years ago
- A large-scale place image dataset with multi-faceted annotations. Multi-level place recognition.☆10Jul 15, 2020Updated 5 years ago
- Code for paper "Rethinking Text-based Protein Understanding: Retrieval or LLM?"☆18Oct 7, 2025Updated 5 months ago
- [ECCV 2024] Characterizing Robustness via Natural Input Gradients☆13Updated this week
- Reason-before-Retrieve: One-Stage Reflective Chain-of-Thoughts for Training-Free Zero-Shot Composed Image Retrieval [CVPR 2025 Highlight]☆65Jul 8, 2025Updated 7 months ago
- Code for the paper: Graph Jigsaw Learning for Cartoon Face Recognition☆10Jul 1, 2022Updated 3 years ago
- Home page for Microsoft Phi-Ground tech-report☆23Sep 8, 2025Updated 5 months ago
- Official repository for Scone (Subject-driven Composition and Distinction Enhancement) model, designed to support multi-subject compositi…☆28Jan 14, 2026Updated last month
- Reversi AI based on Monte Carlo search algorithm☆10Apr 2, 2025Updated 11 months ago
- [ICML 2025 Spotlight] RAPID: Long-Context Inference with Retrieval-Augmented Speculative Decoding☆19Mar 2, 2025Updated last year
- 支持Linux DO的ChatGPT/Claude/Midjourney/API/Grok 共享平台-前端项目☆13Apr 30, 2025Updated 10 months ago
- FunQA benchmarks funny, creative, and magic videos for challenging tasks including timestamp localization, video description, reasoning, …☆104Dec 25, 2025Updated 2 months ago