francescotonini / al-gtd
Official repo of the paper “AL-GTD: Deep Active Learning for Gaze Target Detection” (ACMMM2024)
☆12Updated 4 months ago
Alternatives and similar repositories for al-gtd:
Users that are interested in al-gtd are comparing it to the libraries listed below
- Official Implementation of MULTI-LANE (Multi Label class incremental learning via summarising pAtch tokeN Embeddings). Published in 3rd C…☆12Updated last month
- Official implementation of the CVPR '25 highlight paper "Compositional Caching for Training-free Open-vocabulary Attribute Detection"☆11Updated 3 months ago
- [NeurIPS '24] Frustratingly easy Test-Time Adaptation of VLMs!!☆43Updated 3 weeks ago
- Code implementation of our paper: On Large Multimodal Models as Open-World Image Classifiers☆15Updated 3 weeks ago
- Official Repository of "On the Effectiveness of LayerNorm Tuning for Continual Learning in Vision Transformers" (Visual Continual Learnin…☆8Updated last year
- [NeurIPS 2023] Align Your Prompts: Test-Time Prompting with Distribution Alignment for Zero-Shot Generalization☆106Updated last year
- Pytorch implementation of "Diversified in-domain synthesis with efficient fine-tuning for few-shot classification"☆16Updated last year
- [CVPRW-25 MMFM] Official repository of paper titled "How Good is my Video LMM? Complex Video Reasoning and Robustness Evaluation Suite fo…☆47Updated 7 months ago
- Library implementation of "No Train, all Gain: Self-Supervised Gradients Improve Deep Frozen Representations"☆36Updated 5 months ago
- Compress conventional Vision-Language Pre-training data☆49Updated last year
- Official implementation of "Test-Time Zero-Shot Temporal Action Localization", CVPR 2024☆53Updated 7 months ago
- ☆46Updated last month
- Exploring prompt tuning with pseudolabels for multiple modalities, learning settings, and training strategies.☆49Updated 5 months ago
- [ICLR2023] PLOT: Prompt Learning with Optimal Transport for Vision-Language Models☆159Updated last year
- Official repository for the ICCV 2023 paper: "Waffling around for Performance: Visual Classification with Random Words and Broad Concepts…☆56Updated last year
- ☆23Updated 10 months ago
- Distribution-Aware Prompt Tuning for Vision-Language Models (ICCV 2023)☆38Updated last year
- [ICLR 2024] Test-Time Adaptation with CLIP Reward for Zero-Shot Generalization in Vision-Language Models.☆76Updated 8 months ago
- Repository for the paper: Teaching VLMs to Localize Specific Objects from In-context Examples☆23Updated 4 months ago
- LaFTer: Label-Free Tuning of Zero-shot Classifier using Language and Unlabeled Image Collections (NeurIPS 2023)☆28Updated last year
- The official repository for ICLR2024 paper "FROSTER: Frozen CLIP is a Strong Teacher for Open-Vocabulary Action Recognition"☆78Updated 3 months ago
- Code for the paper: "SuS-X: Training-Free Name-Only Transfer of Vision-Language Models" [ICCV'23]☆98Updated last year
- [CVPR23 Highlight] CREPE: Can Vision-Language Foundation Models Reason Compositionally?☆32Updated last year
- Perceptual Grouping in Contrastive Vision-Language Models (ICCV'23)☆36Updated last year
- [CVPR2024 Highlight] Official implementation for Transferable Visual Prompting. The paper "Exploring the Transferability of Visual Prompt…☆39Updated 3 months ago
- ☆26Updated last year
- Official code for ICCV 2023 paper, "Improving Zero-Shot Generalization for CLIP with Synthesized Prompts"☆99Updated last year
- ☆16Updated 3 months ago
- Official Pytorch implementation of 'Facing the Elephant in the Room: Visual Prompt Tuning or Full Finetuning'? (ICLR2024)☆13Updated last year
- ☆20Updated 11 months ago