marco-garosi/ComCa

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/marco-garosi/ComCa)

marco-garosi / ComCa

Official implementation of the CVPR '25 highlight paper "Compositional Caching for Training-free Open-vocabulary Attribute Detection"

☆23

Alternatives and similar repositories for ComCa

Users that are interested in ComCa are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

benedettaliberatori / convisbench
View on GitHub
Official implementation of "ConViS-Bench: Estimating Video Similarity Through Semantic Concepts", NeurIPS 2025
☆27Nov 28, 2025Updated 8 months ago
Moreno98 / UWM
View on GitHub
[WACV 26] Official code for the paper Safe Vision-Language Models via Unsafe Weights Manipulation
☆16Mar 3, 2026Updated 4 months ago
francescotonini / al-gtd
View on GitHub
Official repo of the paper “AL-GTD: Deep Active Learning for Gaze Target Detection” (ACMMM2024)
☆12Updated this week
tdemin16 / proactivebench
View on GitHub
Official repository of "ProactiveBench: Benchmarking Proactiveness in Multimodal Large Language Models" (ECCV 2026)
☆29Jun 22, 2026Updated last month
tdemin16 / multi-lane
View on GitHub
Official Implementation of MULTI-LANE (Multi Label class incremental learning via summarising pAtch tokeN Embeddings). Published in 3rd C…
☆15Feb 20, 2025Updated last year
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
FarinaMatteo / qmmf
View on GitHub
[CVPR '23 Highlight] Official repository for the paper "Quantum Multi-Model Fitting".
☆11Mar 7, 2025Updated last year
altndrr / lmms-owc
View on GitHub
Code implementation of our ICCV 2025 paper: On Large Multimodal Models as Open-World Image Classifiers
☆27Dec 4, 2025Updated 7 months ago
Deepayan137 / R2P
View on GitHub
Official codebase for the paper "Training-Free Personalization via Retrieval and Reasoning on Fingerprints"
☆25Nov 6, 2025Updated 8 months ago
laitifranz / MemCoach
View on GitHub
[CVPR'26 Highlight] MemCoach: Steering-based MLLM for Actionable Image Memorability Feedback
☆42Updated this week
BerasiDavide / vlm_image_compositionality
View on GitHub
[CVPR'25] Official implementation of the paper "Not Only Text: Exploring Compositionality of Visual Representations in Vision-Language Mo…
☆18Nov 21, 2025Updated 8 months ago
marco-garosi / CIRCLE
View on GitHub
[CVPR Findings 2026] Large Multimodal Models as General In-Context Classifiers
☆24Mar 1, 2026Updated 4 months ago
benedettaliberatori / T3AL
View on GitHub
Official implementation of "Test-Time Zero-Shot Temporal Action Localization", CVPR 2024
☆75Sep 11, 2024Updated last year
Picsart-AI-Research / OpenBias
View on GitHub
[CVPR 2024 Highlight] OpenBias: Open-set Bias Detection in Text-to-Image Generative Models
☆26Feb 13, 2025Updated last year
FarinaMatteo / multiflow
View on GitHub
[CVPR '24] Official implementation of the paper "Multiflow: Shifting Towards Task-Agnostic Vision-Language Pruning".
☆24Mar 7, 2025Updated last year
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
FarinaMatteo / rethinking_fewshot_vlms
View on GitHub
[CVPR '25] Official implementation of the paper "Rethinking Few-Shot Adaptation of Vision-Language Models in Two Stages", CVPR 2025.
☆33Mar 30, 2025Updated last year
lorenzovaquero / BUSCA
View on GitHub
[ECCV 2024] BUSCA: "Lost and Found: Overcoming Detector Failures in Online Multi-Object Tracking"
☆44Dec 6, 2024Updated last year
hanweikung / nullface
View on GitHub
[FG 2026] Official implementation of the paper "NullFace: Training-Free Localized Face Anonymization"
☆29Apr 28, 2026Updated 3 months ago
zchoi / SPT
View on GitHub
[TCSVT23] Official code for "SPT: Spatial Pyramid Transformer for Image Captioning".
☆10Aug 14, 2024Updated last year
laitifranz / Prompt2Guard
View on GitHub
[ICPR 2024] Exemplar-free continual deepfake detector that leverages CLIP and domain-specific multi-modal prompts
☆15Aug 1, 2024Updated last year
FarinaMatteo / zero
View on GitHub
[NeurIPS '24] Frustratingly easy Test-Time Adaptation of VLMs!!
☆64Mar 24, 2025Updated last year
zipengxuc / StylerDALLE
View on GitHub
Code for ICCV 2023 paper ✨ "StylerDALLE: Language-Guided Style Transfer Using a Vector-Quantized Tokenizer of a Large-Scale Generative Mo…
☆18Jan 25, 2024Updated 2 years ago
altndrr / vic
View on GitHub
Code implementation of our NeurIPS 2023 paper: Vocabulary-free Image Classification
☆107Feb 2, 2024Updated 2 years ago
quhongyu / ClusPro
View on GitHub
[ICLR 2025] Official repository of "Learning Clustering-based Prototypes for Compositional Zero-shot Learning"
☆25Feb 3, 2026Updated 5 months ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
vturrisi / disef
View on GitHub
Pytorch implementation of "Diversified in-domain synthesis with efficient fine-tuning for few-shot classification"
☆17Mar 25, 2024Updated 2 years ago
Markus-Pobitzer / wlp
View on GitHub
Loomis Painter: Reconstructing the painting process
☆55Nov 24, 2025Updated 8 months ago
mlfoundations / dcvlm
View on GitHub
☆54Updated this week
LancasterLi / RefSAM
View on GitHub
☆28Oct 31, 2024Updated last year
41xu / DEMO
View on GitHub
[3DV 2026] Dense Motion Captioning
☆35Jan 28, 2026Updated 6 months ago
mazumder-lab / CHITA
View on GitHub
Code for Fast as CHITA: Neural Network Pruning with Combinatorial Optimization
☆14Aug 2, 2023Updated 2 years ago
suny-sht / clip-red-circle
View on GitHub
Official implementation of "What does CLIP know about a red circle? Visual Prompt Engineering for VLMs", ICCV 2023
☆12Sep 21, 2023Updated 2 years ago
Amazingren / MIRAGE
View on GitHub
(ICLR2026) Efficient Degradation-agnostic Image Restoration via Channel-Wise Functional Decomposition and Manifold Regularization
☆35Jun 23, 2026Updated last month
open-retina / open-retina
View on GitHub
Collaborative retina modelling across datasets and species.
☆20Updated this week
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
lucazanella / lavad
View on GitHub
Official implementation of "Harnessing Large Language Models for Training-free Video Anomaly Detection", CVPR 2024
☆149Jul 15, 2024Updated 2 years ago
Mia-YatingYu / STDD
View on GitHub
[AAAI'25]: Building a Multi-modal Spatiotemporal Expert for Zero-shot Action Recognition with CLIP
☆23Aug 5, 2025Updated 11 months ago
marco-garosi / COPS
View on GitHub
Official implementation of the WACV 2025 paper "3D Part Segmentation via Geometric Aggregation of 2D Visual Features"
☆25Jun 8, 2025Updated last year
zchoi / VCRN
View on GitHub
☆11Jul 11, 2023Updated 3 years ago
YangYY-Liu / MatrixChatGPTVoiceBot
View on GitHub
Talk to ChatGPT and Generate image via any Matrix client!
☆16Apr 25, 2023Updated 3 years ago
FocoosAI / focoos
View on GitHub
🚀 Lightning-fast computer vision models. Fine-tune SOTA models with just a few lines of code. Ready for cloud ☁️ and edge 📱 deployment.…
☆352Dec 11, 2025Updated 7 months ago
FabrizioSandri / 2SSP
View on GitHub
2SSP: A Two-Stage Framework for Structured Pruning of LLMs
☆21Aug 18, 2025Updated 11 months ago