WangWenhao0716 / TransHPLinks
[NeurIPS 2023] The official implementation of "TransHP: Image Classification with Hierarchical Prompting"
☆21Updated last year
Alternatives and similar repositories for TransHP
Users that are interested in TransHP are comparing it to the libraries listed below
Sorting:
- An official implementation of "GOAL⚽: Global-local Object Alignment Learning" (CVPR 2025).☆21Updated last month
- Official PyTorch repository for GRAM☆95Updated 5 months ago
- [ICLR 2024] FROSTER: Frozen CLIP is a Strong Teacher for Open-Vocabulary Action Recognition☆89Updated 8 months ago
- [AAAI 2024] AVSegFormer: Audio-Visual Segmentation with Transformer☆68Updated 7 months ago
- [ECCV 2022] What to Hide from Your Students: Attention-Guided Masked Image Modeling☆74Updated last year
- [2025 CVPR] Towards Open-Vocabulary Audio-Visual Event Localization☆29Updated 7 months ago
- [ICLR 2024] Test-Time Adaptation with CLIP Reward for Zero-Shot Generalization in Vision-Language Models.☆92Updated last year
- Official Pytorch implementation of "Improved Probabilistic Image-Text Representations" (ICLR 2024)☆57Updated last year
- Official repository for "Vita-CLIP: Video and text adaptive CLIP via Multimodal Prompting" [CVPR 2023]☆123Updated 2 years ago
- Official Repository for CVPR 2024 Paper: "Large Language Models are Good Prompt Learners for Low-Shot Image Classification"☆38Updated last year
- The code of MGCC: Text-based Occluded Person Re-identification via Multi-Granularity Contrastive Consistency Learning☆17Updated 7 months ago
- Vision Transformers are Parameter-Efficient Audio-Visual Learners☆103Updated 2 years ago
- Official Implementation of "Read-only Prompt Optimization for Vision-Language Few-shot Learning", ICCV 2023☆54Updated 2 years ago
- [AAAI 2024] XKD: Cross-modal Knowledge Distillation with Domain Alignment for Video Representation Learning.☆14Updated last year
- [CVPR 2024 Highlight] Official implementation of the paper: Cooperation Does Matter: Exploring Multi-Order Bilateral Relations for Audio-…☆39Updated 5 months ago
- ☆52Updated 3 months ago
- [CVPR 2024] Offical implemention of the paper "DePT: Decoupled Prompt Tuning"☆108Updated 4 months ago
- ICLR‘24 Offical Implementation of Composed Image Retrieval with Text Feedback via Multi-grained Uncertainty Regularization☆73Updated last year
- Adaptation of vision-language models (CLIP) to downstream tasks using local and global prompts.☆48Updated 2 months ago
- [ICLR2023] PLOT: Prompt Learning with Optimal Transport for Vision-Language Models☆171Updated last year
- Question-Aware Gaussian Experts for Audio-Visual Question Answering -- Official Pytorch Implementation (CVPR'25, Highlight)☆23Updated 4 months ago
- (CVPR2024) MeaCap: Memory-Augmented Zero-shot Image Captioning☆50Updated last year
- [ICCV'23 Main Track, WECIA'23 Oral] Official repository of paper titled "Self-regulating Prompts: Foundational Model Adaptation without F…☆277Updated 2 years ago
- [CVPR 2024] Do you remember? Dense Video Captioning with Cross-Modal Memory Retrieval☆59Updated last year
- ☆27Updated 2 years ago
- The official repo for "Ref-AVS: Refer and Segment Objects in Audio-Visual Scenes", ECCV 2024☆47Updated 10 months ago
- Official code for WACV 2024 paper, "Annotation-free Audio-Visual Segmentation"☆33Updated 11 months ago
- ☆36Updated 2 years ago
- ☆75Updated last year
- (CVPR2024 Highlight) Novel Class Discovery for Ultra-Fine-Grained Visual Categorization (UFG-NCD)☆21Updated last year