alexlioralexli / attention-transferLinks

☆20

Alternatives and similar repositories for attention-transfer

Users that are interested in attention-transfer are comparing it to the libraries listed below

Sorting:

ZhengYu518 / VL-Mamba
Implementation of "VL-Mamba: Exploring State Space Models for Multimodal Learning"
☆80Updated last year
JiuTian-VL / MoME
[NeurIPS 2024] MoME: Mixture of Multimodal Experts for Generalist Multimodal Large Language Models
☆68Updated 2 months ago
zihuixue / MFH
[ICLR 23 oral] The Modality Focusing Hypothesis: Towards Understanding Crossmodal Knowledge Distillation
☆45Updated 2 years ago
Haochen-Wang409 / DropPos
[NeurIPS'23] DropPos: Pre-Training Vision Transformers by Reconstructing Dropped Positions
☆61Updated last year
hunto / DiffKD
Official implementation for paper "Knowledge Diffusion for Distillation", NeurIPS 2023
☆88Updated last year
sarahESL / AlignCLIP
AlignCLIP: Improving Cross-Modal Alignment in CLIP (ICLR 2025)
☆42Updated 4 months ago
OpenSparseLLMs / CLIP-MoE
CLIP-MoE: Mixture of Experts for CLIP
☆42Updated 9 months ago
JieShibo / PETL-ViT
[ICCV 2023 & AAAI 2023] Binary Adapters & FacT, [Tech report] Convpass
☆192Updated last year
ZjjConan / VLM-MultiModalAdapter
The official pytorch implemention of our CVPR-2024 paper "MMA: Multi-Modal Adapter for Vision-Language Models".
☆73Updated 2 months ago
YingWANGG / M2IB
Code for the paper Visual Explanations of Image–Text Representations via Multi-Modal Information Bottleneck Attribution
☆53Updated last year
mzhaoshuai / RLCF
[ICLR 2024] Test-Time Adaptation with CLIP Reward for Zero-Shot Generalization in Vision-Language Models.
☆83Updated 11 months ago
OliverRensu / ARM
[ICLR2025] This repository is the official implementation of our Autoregressive Pretraining with Mamba in Vision
☆82Updated last month
Koorye / DePT
[CVPR 2024] Offical implemention of the paper "DePT: Decoupled Prompt Tuning"
☆106Updated last month
SHI-Labs / Diffusion-Driven-Test-Time-Adaptation-via-Synthetic-Domain-Alignment
Everything to the Synthetic: Diffusion-driven Test-time Adaptation via Synthetic-Domain Alignment, arXiv 2024 / CVPR 2025
☆30Updated 4 months ago
lezhang7 / SAIL
[CVPR 2025 Highlight] Official Pytorch codebase for paper: "Assessing and Learning Alignment of Unimodal Vision and Language Models"
☆46Updated last month
CHENGY12 / PLOT
[ICLR2023] PLOT: Prompt Learning with Optimal Transport for Vision-Language Models
☆167Updated last year
mlvlab / DAPT
Distribution-Aware Prompt Tuning for Vision-Language Models (ICCV 2023)
☆41Updated last year
lloongx / DIKI
[ECCV 2024] Mind the Interference: Retaining Pre-trained Knowledge in Parameter Efficient Continual Learning of Vision-Language Models
☆50Updated last year
Cyang-Zhao / Grad-Eclip
☆44Updated 2 months ago
techmonsterwang / iLLaMA
Adapting LLaMA Decoder to Vision Transformer
☆28Updated last year
PalAvik / hycoclip
Code for the paper "Compositional Entailment Learning for Hyperbolic Vision-Language Models".
☆75Updated last month
mlvlab / ProMetaR
Official implementation of CVPR 2024 paper "Prompt Learning via Meta-Regularization".
☆27Updated 4 months ago
mr-eggplant / FOA
Code for ICML 2024 paper (Oral) — Test-Time Model Adaptation with Only Forward Passes
☆81Updated 10 months ago
htyao89 / KgCoOp
☆96Updated last year
Zi-hao-Wei / Efficient-Vision-Language-Pre-training-by-Cluster-Masking
[CVPR 2024] Improving language-visual pretraining efficiency by perform cluster-based masking on images.
☆28Updated last year
zefang-liu / AdaMoLE
AdaMoLE: Adaptive Mixture of LoRA Experts
☆33Updated 9 months ago
jameelhassan / PromptAlign
[NeurIPS 2023] Align Your Prompts: Test-Time Prompting with Distribution Alignment for Zero-Shot Generalization
☆106Updated last year
mlvlab / RPO
Official Implementation of "Read-only Prompt Optimization for Vision-Language Few-shot Learning", ICCV 2023
☆53Updated last year
meetdavidwan / crg
PyTorch code for "Contrastive Region Guidance: Improving Grounding in Vision-Language Models without Training"
☆35Updated last year
mihirp1998 / Diffusion-TTA
Diffusion-TTA improves pre-trained discriminative models such as image classifiers or segmentors using pre-trained generative models.
☆74Updated last year