Code implementation of our ICCV 2025 paper: On Large Multimodal Models as Open-World Image Classifiers
☆26Dec 4, 2025Updated 3 months ago
Alternatives and similar repositories for lmms-owc
Users that are interested in lmms-owc are comparing it to the libraries listed below
Sorting:
- Official implementation of the CVPR '25 highlight paper "Compositional Caching for Training-free Open-vocabulary Attribute Detection"☆23Dec 23, 2024Updated last year
- [CVPR '23 Highlight] Official repository for the paper "Quantum Multi-Model Fitting".☆11Mar 7, 2025Updated last year
- Official Implementation of MULTI-LANE (Multi Label class incremental learning via summarising pAtch tokeN Embeddings). Published in 3rd C…☆15Feb 20, 2025Updated last year
- [CVPR '24] Official implementation of the paper "Multiflow: Shifting Towards Task-Agnostic Vision-Language Pruning".☆23Mar 7, 2025Updated last year
- [CVPR '25] Official implementation of the paper "Rethinking Few-Shot Adaptation of Vision-Language Models in Two Stages", CVPR 2025.☆30Mar 30, 2025Updated 11 months ago
- [NeurIPS '24] Frustratingly easy Test-Time Adaptation of VLMs!!☆61Mar 24, 2025Updated 11 months ago
- Official implementation of "Test-Time Zero-Shot Temporal Action Localization", CVPR 2024☆70Sep 11, 2024Updated last year
- Code implementation of our NeurIPS 2023 paper: Vocabulary-free Image Classification☆107Feb 2, 2024Updated 2 years ago
- Official implementation of "ConViS-Bench: Estimating Video Similarity Through Semantic Concepts", NeurIPS 2025☆25Nov 28, 2025Updated 3 months ago
- [CVPR 2024 Highlight] OpenBias: Open-set Bias Detection in Text-to-Image Generative Models☆26Feb 13, 2025Updated last year
- Follow-Up Differential Descriptions: Language Models Resolve Ambiguities for Image Classification☆11Nov 15, 2023Updated 2 years ago
- [ICML 2024] "Envisioning Outlier Exposure by Large Language Models for Out-of-Distribution Detection"☆14Feb 15, 2025Updated last year
- This is an implementation of the paper "Are We Done with Object-Centric Learning?"☆12Sep 11, 2025Updated 5 months ago
- [ICPR 2024] Exemplar-free continual deepfake detector that leverages CLIP and domain-specific multi-modal prompts☆15Aug 1, 2024Updated last year
- Code implementation of the paper 'FIction: 4D Future Interaction Prediction from Video'☆18Mar 19, 2025Updated 11 months ago
- If CLIP Could Talk: Understanding Vision-Language Model Representations Through Their Preferred Concept Descriptions☆17Apr 4, 2024Updated last year
- ☆21Dec 15, 2025Updated 2 months ago
- [WACV'23] Mixture Outlier Exposure for Out-of-Distribution Detection in Fine-grained Environments☆27Apr 12, 2023Updated 2 years ago
- Official implementation of "Harnessing Large Language Models for Training-free Video Anomaly Detection", CVPR 2024☆132Jul 15, 2024Updated last year
- ☆18Jun 10, 2025Updated 8 months ago
- PyTorch implementation of our ECCV 2022 paper "Rethinking Confidence Calibration for Failure Prediction"☆26Jun 10, 2023Updated 2 years ago
- PyTorch code and pretrained weights for the UNIC models.☆44Aug 29, 2024Updated last year
- [CVPR 2023] Learning Attention as Disentangler for Compositional Zero-shot Learning☆39Aug 16, 2023Updated 2 years ago
- [ICLR 2024] Test-Time RL with CLIP Feedback for Vision-Language Models.☆99Oct 20, 2025Updated 4 months ago
- Thesis Template☆10Updated this week
- CoMA: Compositional Human Motion Generation with Multi-modal Agents☆14Jul 31, 2025Updated 7 months ago
- Official Implementation of paper: PromptCAL: Contrastive Affinity Learning via Auxiliary Prompts for Generalized Novel Category Discovery…☆49Mar 18, 2023Updated 2 years ago
- [NeurIPS 2025] Official Implementation of paper "Sherlock: Self-Correcting Reasoning in Vision-Language Models"☆28Sep 18, 2025Updated 5 months ago
- [CVPR 2024 Highlight] ImageNet-D☆47Oct 15, 2024Updated last year
- The Official Repository for CVPR2023 Paper "NICO++: Towards Better Benchmarking for Domain Generalization".☆41Jul 29, 2023Updated 2 years ago
- [ICCV 2023] ViLLA: Fine-grained vision-language representation learning from real-world data☆46Oct 15, 2023Updated 2 years ago
- Code for paper "Rethinking Text-based Protein Understanding: Retrieval or LLM?"☆18Oct 7, 2025Updated 5 months ago
- ☆12Apr 13, 2019Updated 6 years ago
- [ICML 2022] "Linearity Grafting: Relaxed Neuron Pruning Helps Certifiable Robustness" by Tianlong Chen*, Huan Zhang*, Zhenyu Zhang, Shiyu…☆17Jun 22, 2022Updated 3 years ago
- The Koudai48 VOD Manager☆10May 2, 2019Updated 6 years ago
- Cross-Self KV Cache Pruning for Efficient Vision-Language Inference☆10Dec 15, 2024Updated last year
- [ICLR 2026] Official Implementation of ProxyThinker: Test-Time Guidance through Small Visual Reasoners.☆20Sep 24, 2025Updated 5 months ago
- Official implementation of "In-style: Bridging Text and Uncurated Videos with Style Transfer for Cross-modal Retrieval." ICCV 2023☆11Oct 5, 2023Updated 2 years ago
- [ICML 2025 Spotlight] RAPID: Long-Context Inference with Retrieval-Augmented Speculative Decoding☆19Mar 2, 2025Updated last year