[T-PAMI 2025] EMOv2: Pushing 5M Vision Model Frontier
☆54Dec 30, 2024Updated last year
Alternatives and similar repositories for EMOv2
Users that are interested in EMOv2 are comparing it to the libraries listed below
Sorting:
- [ICCV 2025] Official implementation of LLaVA-KD: A Framework of Distilling Multimodal Large Language Models☆125Oct 14, 2025Updated 4 months ago
- [[NeurIPS 2025] UltraVideo: High-Quality UHD Video Dataset with Comprehensive Captions☆85Jul 14, 2025Updated 7 months ago
- SaRA: High-Efficient Diffusion Model Fine-tuning with Progressive Sparse Low-Rank Adaptation☆120Oct 18, 2024Updated last year
- [ECCV'24 Workshops Oral] DALDA: Data Augmentation Leveraging Diffusion Model and LLM with Adaptive Guidance Scaling☆31Feb 6, 2026Updated last month
- XmodelLM☆38Nov 19, 2024Updated last year
- [CVPR25] Official implementation of `MobileMamba: Lightweight Multi-Receptive Visual Mamba Network.'☆357Mar 20, 2025Updated 11 months ago
- [NeurIPS 2025] The official repository of "Inst-IT: Boosting Multimodal Instance Understanding via Explicit Visual Prompt Instruction Tun…☆40Feb 20, 2025Updated last year
- [NeurIPS 2024] official code release for our paper "Revisiting the Integration of Convolution and Attention for Vision Backbone".☆43Jan 21, 2025Updated last year
- ☆30Jan 18, 2026Updated last month
- [CVPR 2026] Soul: Breathe Life into Digital Human for High-fidelity Long-term Multimodal Animation☆55Dec 16, 2025Updated 2 months ago
- Official implementation of the paper "M3CoTBench: Benchmark Chain-of-Thought of MLLMs in Medical Image Understanding"☆21Jan 14, 2026Updated last month
- Technical Challenge Repository for Visual Anomaly Detection Workshop (VAND) at CVPR☆13Jul 21, 2025Updated 7 months ago
- ☆12Oct 7, 2024Updated last year
- Code release for AccDiffusionV2 (TPAMI)☆35Nov 4, 2025Updated 4 months ago
- Code repository for the paper "The Inherent Limits of Pretrained LLMs: The Unexpected Convergence of Instruction Tuning and In-Context Le…☆13Jan 16, 2025Updated last year
- Official PyTorch implementation of WPS from our paper: WPS-SAM: Towards Weakly-Supervised Part Segmentation with Foundation Models☆14Jun 12, 2025Updated 8 months ago
- [CVPR25] IAR☆17Jun 13, 2025Updated 8 months ago
- Official implementation for P2SAM (ACM MM 2024)☆14Dec 7, 2024Updated last year
- Descrição diário da toda minha trajetória de estudos☆15Jan 30, 2025Updated last year
- KV cache compression via sparse coding☆17Oct 26, 2025Updated 4 months ago
- Official pytorch implementation of the paper "Analogous to Evolutionary Algorithm: Designing a Unified Sequence Model, NeurIPS'21".☆17Jun 12, 2022Updated 3 years ago
- Evaluation Tool for Anomaly Detection Research☆16May 9, 2024Updated last year
- Codebase for Math Neurosurgery: Isolating LLMs' Math Reasoning Abilities Using Only Forward Passes☆21Jun 15, 2025Updated 8 months ago
- Set-Encoder: Permutation-Invariant Inter-Passage Attention for Listwise Passage Re-Ranking with Cross-Encoders☆18May 23, 2025Updated 9 months ago
- ☆35Nov 25, 2025Updated 3 months ago
- Code for this paper "HyperRouter: Towards Efficient Training and Inference of Sparse Mixture of Experts via HyperNetwork"☆33Nov 29, 2023Updated 2 years ago
- ☆17Apr 9, 2025Updated 11 months ago
- [ACL 2025 Findings] Implicit Reasoning in Transformers is Reasoning through Shortcuts☆17Mar 11, 2025Updated 11 months ago
- [NeurIPS 2024] The official implementation of "Image Copy Detection for Diffusion Models"☆18Oct 1, 2024Updated last year
- Friends of OLMo and their links.☆358Sep 15, 2025Updated 5 months ago
- [ICLR 26] Visual Multi-Agent System: Mitigating Hallucination Snowballing via Visual Flow☆36Oct 3, 2025Updated 5 months ago
- The official implementation of Preference Data Reward-Augmentation.☆18May 1, 2025Updated 10 months ago
- ☆19Mar 25, 2025Updated 11 months ago
- ☆17Aug 7, 2024Updated last year
- Implementation of SmoothCache, a project aimed at speeding-up Diffusion Transformer (DiT) based GenAI models with error-guided caching.☆48Jul 17, 2025Updated 7 months ago
- RecConv: Efficient Recursive Convolutions for Multi-Frequency Representations☆19Oct 13, 2025Updated 4 months ago
- ☆36Dec 16, 2025Updated 2 months ago
- Official Implementation of UA^{2}-Agent and other baseline algorithms of "Towards Unified Alignment Between Agents, Humans, and Environme…☆19Nov 12, 2024Updated last year
- [TII 2025] Official Implementation and Dataset Release for "Center-aware Residual Anomaly Synthesis for Multi-class Industrial Anomaly De…☆30Oct 5, 2025Updated 5 months ago