Code implementation of our ICCV 2025 paper: On Large Multimodal Models as Open-World Image Classifiers
☆27Dec 4, 2025Updated 6 months ago
Alternatives and similar repositories for lmms-owc
Users that are interested in lmms-owc are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [CVPR '23 Highlight] Official repository for the paper "Quantum Multi-Model Fitting".☆11Mar 7, 2025Updated last year
- Official implementation of the CVPR '25 highlight paper "Compositional Caching for Training-free Open-vocabulary Attribute Detection"☆23Dec 23, 2024Updated last year
- Official Implementation of MULTI-LANE (Multi Label class incremental learning via summarising pAtch tokeN Embeddings). Published in 3rd C…☆15Feb 20, 2025Updated last year
- Official repo of the paper “AL-GTD: Deep Active Learning for Gaze Target Detection” (ACMMM2024)☆12Nov 29, 2024Updated last year
- Code implementation of our NeurIPS 2023 paper: Vocabulary-free Image Classification☆107Feb 2, 2024Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Official implementation of "Test-Time Zero-Shot Temporal Action Localization", CVPR 2024☆74Sep 11, 2024Updated last year
- [ECCV 2024] BUSCA: "Lost and Found: Overcoming Detector Failures in Online Multi-Object Tracking"☆44Dec 6, 2024Updated last year
- [NeurIPS '24] Frustratingly easy Test-Time Adaptation of VLMs!!☆63Mar 24, 2025Updated last year
- [CVPR 2024 Highlight] OpenBias: Open-set Bias Detection in Text-to-Image Generative Models☆26Feb 13, 2025Updated last year
- Code implementation of our BMVC 2022 paper: Cluster-level pseudo-labelling for source-free cross-domain facial expression recognition☆11Dec 18, 2022Updated 3 years ago
- [ICPR 2024] Exemplar-free continual deepfake detector that leverages CLIP and domain-specific multi-modal prompts☆15Aug 1, 2024Updated last year
- Official implementation of "ConViS-Bench: Estimating Video Similarity Through Semantic Concepts", NeurIPS 2025☆27Nov 28, 2025Updated 6 months ago
- [ICCV 2023] Going Beyond Nouns With Vision & Language Models Using Synthetic Data☆13Sep 30, 2023Updated 2 years ago
- This is an implementation of the paper "Are We Done with Object-Centric Learning?"☆13Jun 8, 2026Updated last week
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- [CVPR-25🔥] Test-time Counterattacks (TTC) towards adversarial robustness of CLIP☆42Jun 4, 2025Updated last year
- Follow-Up Differential Descriptions: Language Models Resolve Ambiguities for Image Classification☆11Nov 15, 2023Updated 2 years ago
- [ICLR 2025] Adaptive prompt tailored pruning of T2I diffusion models.☆15Feb 1, 2025Updated last year
- Collaborative retina modelling across datasets and species.☆20Apr 10, 2026Updated 2 months ago
- Code for BYOP [CVPR 2023]☆12Sep 25, 2023Updated 2 years ago
- If CLIP Could Talk: Understanding Vision-Language Model Representations Through Their Preferred Concept Descriptions☆17Apr 4, 2024Updated 2 years ago
- 2SSP: A Two-Stage Framework for Structured Pruning of LLMs☆21Aug 18, 2025Updated 9 months ago
- Official repo for the TMLR paper "Discffusion: Discriminative Diffusion Models as Few-shot Vision and Language Learners"☆29Apr 27, 2024Updated 2 years ago
- [ICLR 2024] Test-Time RL with CLIP Feedback for Vision-Language Models.☆102Oct 20, 2025Updated 7 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Code for Fast as CHITA: Neural Network Pruning with Combinatorial Optimization☆14Aug 2, 2023Updated 2 years ago
- [WACV'23] Mixture Outlier Exposure for Out-of-Distribution Detection in Fine-grained Environments☆26Apr 12, 2023Updated 3 years ago
- a parody of the ever-increasing amount of papers that appear on arXiv☆38May 31, 2026Updated 2 weeks ago
- Code implementation of the paper 'FIction: 4D Future Interaction Prediction from Video'☆20Mar 19, 2025Updated last year
- PHASE annotations for societal bias in vision-and-language tasks.☆18Jun 18, 2024Updated last year
- [ICLR 2025] Understanding and Enhancing Safety Mechanisms of LLMs via Safety-Specific Neuron☆32Apr 30, 2025Updated last year
- [CVPR 2023, top-10%] Authors official PyTorch implementation of the "Attribute-preserving Face Dataset Anonymization via Latent Code Opti…☆77Aug 8, 2024Updated last year
- (ICML 2024) Improve Context Understanding in Multimodal Large Language Models via Multimodal Composition Learning☆28Sep 27, 2024Updated last year
- The Land-Diffuser is a novel application of the Denoising Diffusion Probabilistic Model (DDPM) in the realm of 3D Talking Head generation…☆13Dec 23, 2023Updated 2 years ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- [CVPR 2023] Learning Attention as Disentangler for Compositional Zero-shot Learning☆41Aug 16, 2023Updated 2 years ago
- Official Implementation of "CLEFT: Language-Image Contrastive Learning with Efficient Large Language Model and Prompt Fine-Tuning" on MIC…☆18Feb 12, 2025Updated last year
- TTRV: Test-Time Reinforcement Learning for Vision–Language Models (CVPR 2026)☆43Mar 8, 2026Updated 3 months ago
- (Pattern Recognition 2025) Towards Trustworthy Dataset Distillation☆14Dec 8, 2024Updated last year
- ☆15Apr 25, 2025Updated last year
- PyTorch code and pretrained weights for the UNIC models.☆45Aug 29, 2024Updated last year
- [IEEE TMM 2023] This is the official repo of the paper "Perceptual Quality Improvement in Videoconferencing using Keyframes-based GAN".☆17Dec 10, 2024Updated last year