[ICML 2025] Streamline Without Sacrifice - Squeeze out Computation Redundancy in LMM
☆20May 22, 2025Updated last year
Alternatives and similar repositories for ProxyV
Users that are interested in ProxyV are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆77Apr 9, 2026Updated last month
- [ACL2026] Uni-MMMU : A Massive Multi-discipline Multimodal Unified Benchmark☆25Apr 13, 2026Updated last month
- Toolbox for GTA-Human Datasets☆26Oct 9, 2024Updated last year
- A local AI assistant running on your device. It turns your files into actionable memory.☆55Mar 24, 2026Updated 2 months ago
- Syphus: Automatic Instruction-Response Generation Pipeline☆14Dec 14, 2023Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- [ACL2025 Findings] Benchmarking Multihop Multimodal Internet Agents☆54Feb 27, 2025Updated last year
- Evaluating Knowledge Acquisition from Multi-Discipline Professional Videos☆70Sep 5, 2025Updated 8 months ago
- ☆27Oct 5, 2023Updated 2 years ago
- Benchmarking and Analyzing Generative Data for Visual Recognition☆26Jul 25, 2023Updated 2 years ago
- Code for FreeTraj, a tuning-free method for trajectory-controllable video generation☆114Sep 19, 2025Updated 8 months ago
- ☆79May 4, 2025Updated last year
- Ego-R1: Chain-of-Tool-Thought for Ultra-Long Egocentric Video Reasoning☆151Aug 21, 2025Updated 9 months ago
- Code for the paper: Graph Jigsaw Learning for Cartoon Face Recognition☆10Jul 1, 2022Updated 3 years ago
- A framework that allows you to apply Sparse AutoEncoder on any models☆54Jul 11, 2025Updated 10 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- A large-scale place image dataset with multi-faceted annotations. Multi-level place recognition.☆10Jul 15, 2020Updated 5 years ago
- NEO Series: Native Vision-Language Models from First Principles☆748Apr 26, 2026Updated last month
- Unofficial Implementation of "Stable Video Diffusion Multi-View"☆79Apr 15, 2024Updated 2 years ago
- [ECCV 2024] Characterizing Robustness via Natural Input Gradients☆13Mar 6, 2026Updated 2 months ago
- NTU SC2002 Group Project - Final Year Project Management System (FYPMS)☆18Aug 12, 2025Updated 9 months ago
- You Only Condense Once: Two Rules for Pruning Condensed Datasets (NeurIPS 2023)☆16Nov 18, 2023Updated 2 years ago
- Cut2Next: Generating Next Shot via In-Context Tuning☆33Aug 21, 2025Updated 9 months ago
- Code for CineScale, higher-resolution video generation based on Wan☆186Aug 25, 2025Updated 9 months ago
- Portrait matting model for academic use only.☆13Jan 7, 2024Updated 2 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Omni6D: Large-Vocabulary 3D Object Dataset for Category-Level 6D Object Pose Estimation☆48Dec 11, 2024Updated last year
- ☆10Oct 9, 2022Updated 3 years ago
- [ACL 2026 Findings, ICCV 2025 Workshop Outstanding Paper Award] VChain: Chain-of-Visual-Thought for Reasoning in Video Generation☆119Apr 8, 2026Updated last month
- [ACL-2026] MMSearch-R1 is an end-to-end RL framework that enables LMMs to perform on-demand, multi-turn search with real-world multimodal…☆446Apr 7, 2026Updated last month
- (CVPR 2026 Highlight) Official repository for Scone (Subject-driven COmposition and DistinctioN Enhancement) model, supporting subject co…☆31Apr 9, 2026Updated last month
- [ICLR 2025] MLLM for On-Demand Spatial-Temporal Understanding at Arbitrary Resolution☆329Jul 4, 2025Updated 10 months ago
- FunQA benchmarks funny, creative, and magic videos for challenging tasks including timestamp localization, video description, reasoning, …☆104Dec 25, 2025Updated 5 months ago
- Video-MME-v2: Towards the Next Stage in Benchmarks for Comprehensive Video Understanding☆352Apr 14, 2026Updated last month
- Multi-Space Alignments Towards Universal LiDAR Segmentation☆54Oct 24, 2024Updated last year
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- ☆15Dec 30, 2025Updated 4 months ago
- [CVPR'25] Attention IoU: Examining Biases in CelebA using Attention Maps☆13Mar 26, 2025Updated last year
- [ICLR 2025] Official repository for the paper "Influence-Guided Diffusion for Dataset Distillation".☆15Feb 12, 2025Updated last year
- [ECCV 2024] 4D Contrastive Superflows are Dense 3D Representation Learners☆52Dec 4, 2025Updated 5 months ago
- Self-Supervised Dataset Distillation for Transfer Learning☆18Apr 10, 2024Updated 2 years ago
- [CVPR 2025] HMAR: Efficient Hierarchical Masked Auto-Regressive Image Generation☆63Jul 8, 2025Updated 10 months ago
- ☆20Jun 10, 2025Updated 11 months ago