marco-garosi / ComCaView external linksLinks
Official implementation of the CVPR '25 highlight paper "Compositional Caching for Training-free Open-vocabulary Attribute Detection"
☆23Dec 23, 2024Updated last year
Alternatives and similar repositories for ComCa
Users that are interested in ComCa are comparing it to the libraries listed below
Sorting:
- Official repo of the paper “AL-GTD: Deep Active Learning for Gaze Target Detection” (ACMMM2024)☆12Nov 29, 2024Updated last year
- Official Implementation of MULTI-LANE (Multi Label class incremental learning via summarising pAtch tokeN Embeddings). Published in 3rd C…☆14Feb 20, 2025Updated 11 months ago
- Code implementation of our ICCV 2025 paper: On Large Multimodal Models as Open-World Image Classifiers☆26Dec 4, 2025Updated 2 months ago
- Official implementation of "ConViS-Bench: Estimating Video Similarity Through Semantic Concepts", NeurIPS 2025☆25Nov 28, 2025Updated 2 months ago
- [CVPR '25] Official implementation of the paper "Rethinking Few-Shot Adaptation of Vision-Language Models in Two Stages", CVPR 2025.☆29Mar 30, 2025Updated 10 months ago
- [CVPR '24] Official implementation of the paper "Multiflow: Shifting Towards Task-Agnostic Vision-Language Pruning".☆23Mar 7, 2025Updated 11 months ago
- [TCSVT23] Official code for "SPT: Spatial Pyramid Transformer for Image Captioning".☆10Aug 14, 2024Updated last year
- [CVPR 2024 Highlight] OpenBias: Open-set Bias Detection in Text-to-Image Generative Models☆26Feb 13, 2025Updated last year
- [ICPR 2024] Exemplar-free continual deepfake detector that leverages CLIP and domain-specific multi-modal prompts☆15Aug 1, 2024Updated last year
- Code for ICCV 2023 paper ✨ "StylerDALLE: Language-Guided Style Transfer Using a Vector-Quantized Tokenizer of a Large-Scale Generative Mo…☆18Jan 25, 2024Updated 2 years ago
- ☆27Oct 31, 2024Updated last year
- Loomis Painter: Reconstructing the painting process☆52Nov 24, 2025Updated 2 months ago
- [NeurIPS '24] Frustratingly easy Test-Time Adaptation of VLMs!!☆60Mar 24, 2025Updated 10 months ago
- [CVPR-25🔥] Test-time Counterattacks (TTC) towards adversarial robustness of CLIP☆39Jun 4, 2025Updated 8 months ago
- A collection of awesome think with videos papers.☆89Dec 1, 2025Updated 2 months ago
- The implementation of Learning Instance and Task-Aware Dynamic Kernels for Few Shot Learning☆13Apr 14, 2024Updated last year
- ☆13Aug 28, 2024Updated last year
- CoMA: Compositional Human Motion Generation with Multi-modal Agents☆14Jul 31, 2025Updated 6 months ago
- [CVPR 2024] LoSh: Long-Short Text Joint Prediction Network for Referring Video Object Segmentation☆13Jun 17, 2024Updated last year
- Thesis Template☆10Jan 26, 2026Updated 3 weeks ago
- The Koudai48 VOD Manager☆10May 2, 2019Updated 6 years ago
- Official training code for MUG-V 10B video generation model. Built on Megatron-LM (v0.14.0) with production-ready distributed training fo…☆19Oct 20, 2025Updated 3 months ago
- ☆10Mar 31, 2025Updated 10 months ago
- This project is a demonstration of a content-based recommendation system for Spotify that leverages user's preferences and audio features…☆16Apr 4, 2023Updated 2 years ago
- Code implementation of our BMVC 2022 paper: Cluster-level pseudo-labelling for source-free cross-domain facial expression recognition☆11Dec 18, 2022Updated 3 years ago
- Official implementation of "In-style: Bridging Text and Uncurated Videos with Style Transfer for Cross-modal Retrieval." ICCV 2023☆11Oct 5, 2023Updated 2 years ago
- Code implementation of our NeurIPS 2023 paper: Vocabulary-free Image Classification☆107Feb 2, 2024Updated 2 years ago
- ☆11Jul 2, 2022Updated 3 years ago
- 用于自动预约民政局婚姻登记处的号,限广东省民政局☆10Jun 25, 2023Updated 2 years ago
- [ICCV 2025] Boosting Multi-View Indoor 3D Object Detection via Adaptive 3D Volume Construction☆23Oct 1, 2025Updated 4 months ago
- 数模组新生入门手册——长期维护> <(使用GPL许可证 非商用授权 如果使用其中内容请表明出处)☆11Oct 11, 2019Updated 6 years ago
- The Pytorch implemetation of "FeatWalk: Enhancing Few-Shot Classification through Local View Leveraging", AAAI 2024.☆11Mar 4, 2024Updated last year
- Source codes for the paper "Personalized Dynamic Music Emotion Recognition with Dual-Scale Attention-Based Meta-Learning" (PDMER) which p…☆14Mar 24, 2025Updated 10 months ago
- ☆14Jan 5, 2022Updated 4 years ago
- [🔥ACM MM2025] EchoMask: Speech-Queried Attention-based Mask Modeling for Holistic Co-Speech Motion Generation☆23Dec 30, 2025Updated last month
- CVPR 2021 Oral Paper PatchGenCN☆12Oct 28, 2021Updated 4 years ago
- [CVPR 2025] Official implementation of SSP: High Temporal Consistency through Semantic Similarity Propagation in Semi-Supervised Video Se…☆15Jun 26, 2025Updated 7 months ago
- Official repo of the paper "Object-aware Gaze Target Detection" (ICCV 2023)☆45Dec 5, 2024Updated last year
- ☆12Mar 30, 2023Updated 2 years ago