mlvlab / SugaFormerLinks

Official Implementation (Pytorch) of "Super-class guided Transformer for Zero-Shot Attribute Classification", AAAI 2025

☆14

Alternatives and similar repositories for SugaFormer

Users that are interested in SugaFormer are comparing it to the libraries listed below

Sorting:

mlvlab / ProMetaR
Official implementation of CVPR 2024 paper "Prompt Learning via Meta-Regularization".
☆27Updated 4 months ago
mlvlab / OVQA
Open-Vocabulary Video Question Answering: A New Benchmark for Evaluating the Generalizability of Video Question Answering Models (ICCV 20…
☆17Updated last year
lezhang7 / SAIL
[CVPR 2025 Highlight] Official Pytorch codebase for paper: "Assessing and Learning Alignment of Unimodal Vision and Language Models"
☆46Updated last month
lixinustc / GraphAdapter
The efficient tuning method for VLMs
☆80Updated last year
mlvlab / RALF
Official implementation of CVPR 2024 paper "Retrieval-Augmented Open-Vocabulary Object Detection".
☆41Updated 10 months ago
mlvlab / MELTR
MELTR: Meta Loss Transformer for Learning to Fine-tune Video Foundation Models (CVPR 2023)
☆33Updated last year
mlvlab / MCTF
Official implementation of CVPR 2024 paper "Multi-criteria Token Fusion with One-step-ahead Attention for Efficient Vision Transformers".
☆39Updated last year
richard-peng-xia / HGCLIP
[COLING'25] HGCLIP: Exploring Vision-Language Models with Graph Representations for Hierarchical Understanding
☆40Updated 7 months ago
pumpkin805 / FALIP
[ECCV2024]FALIP: Visual Prompt as Foveal Attention Boosts CLIP Zero-Shot Performance
☆14Updated 10 months ago
miccunifi / Cross-the-Gap
[ICLR 2025] - Cross the Gap: Exposing the Intra-modal Misalignment in CLIP via Modality Inversion
☆49Updated 2 months ago
muzairkhattak / transformers-transforming-vision
Validating image classification benchmark results on ViTs and ResNets (v2)
☆12Updated 2 years ago
JoakimHaurum / TokenReduction
Official PyTorch implementation of Which Tokens to Use? Investigating Token Reduction in Vision Transformers presented at ICCV 2023 NIVT …
☆34Updated last year
naver-ai / lut
[ECCV 2024] Official PyTorch implementation of LUT "Learning with Unmasked Tokens Drives Stronger Vision Learners"
☆12Updated 7 months ago
mzhaoshuai / RLCF
[ICLR 2024] Test-Time Adaptation with CLIP Reward for Zero-Shot Generalization in Vision-Language Models.
☆83Updated 11 months ago
ThomasWangY / 2024-AAAI-HPT
Learning Hierarchical Prompt with Structured Linguistic Knowledge for Vision-Language Models (AAAI 2024)
☆73Updated 5 months ago
dogehhh / ReCLIP
Pytorch Implementation for CVPR 2024 paper: Learn to Rectify the Bias of CLIP for Unsupervised Semantic Segmentation
☆48Updated last week
mlvlab / RPO
Official Implementation of "Read-only Prompt Optimization for Vision-Language Few-shot Learning", ICCV 2023
☆53Updated last year
YBZh / DMN
CVPR2024: Dual Memory Networks: A Versatile Adaptation Approach for Vision-Language Models
☆79Updated last year
visresearch / mgc
The official implementation of paper: "Multi-Grained Contrast for Data-Efficient Unsupervised Representation Learning"
☆30Updated last year
xjjxmu / TextRefiner
The official code for "TextRefiner: Internal Visual Feature as Efficient Refiner for Vision-Language Models Prompt Tuning" | [AAAI2025]
☆40Updated 4 months ago
miccunifi / KDPL
[ECCV 2024] - Improving Zero-shot Generalization of Learned Prompts via Unsupervised Knowledge Distillation
☆62Updated last week
renytek13 / Soft-Prompt-Generation
[ECCV 2024] Soft Prompt Generation for Domain Generalization
☆25Updated 9 months ago
callsys / GenPromp
[ICCV 2023] Generative Prompt Model for Weakly Supervised Object Localization
☆57Updated last year
hy0Y / ST-GT
[CVPR 2024] Official repository of ST_GT
☆9Updated 10 months ago
akhtarvision / cal-detr
☆42Updated last year
Zi-hao-Wei / Efficient-Vision-Language-Pre-training-by-Cluster-Masking
[CVPR 2024] Improving language-visual pretraining efficiency by perform cluster-based masking on images.
☆28Updated last year
DavidYanAnDe / ARC
☆35Updated last year
JoakimHaurum / ATC
Official PyTorch implementation of Agglomerative Token Clustering presented at ECCV 2024
☆17Updated 10 months ago
runtsang / VFPT
Official implementation of NeurIPS 2024 "Visual Fourier Prompt Tuning"
☆31Updated 6 months ago
haoosz / ade-czsl
[CVPR 2023] Learning Attention as Disentangler for Compositional Zero-shot Learning
☆39Updated last year