deepglint / ALIPLinks

[ICCV 2023] ALIP: Adaptive Language-Image Pre-training with Synthetic Caption

☆102

Alternatives and similar repositories for ALIP

Users that are interested in ALIP are comparing it to the libraries listed below

Sorting:

Paranioar / UniPT
[CVPR2024] The code of "UniPT: Universal Parallel Tuning for Transfer Learning with Efficient Parameter and Memory"
☆67Updated last year
yangyangyang127 / APE
[ICCV 2023] Code for "Not All Features Matter: Enhancing Few-shot CLIP with Adaptive Prior Refinement"
☆149Updated last year
ylingfeng / FGVP
Official Codes for Fine-Grained Visual Prompting, NeurIPS 2023
☆55Updated last year
vishaal27 / SuS-X
Code for the paper: "SuS-X: Training-Free Name-Only Transfer of Vision-Language Models" [ICCV'23]
☆105Updated 2 years ago
ant-research / DreamLIP
[ECCV 2024] Official PyTorch implementation of DreamLIP: Language-Image Pre-training with Long Captions
☆136Updated 6 months ago
geekyutao / TaskRes
Task Residual for Tuning Vision-Language Models (CVPR 2023)
☆73Updated 2 years ago
wengzejia1 / Open-VCLIP
☆119Updated last year
SY-Xuan / Pink
Pink: Unveiling the Power of Referential Comprehension for Multi-modal LLMs
☆95Updated 10 months ago
sail-sg / ptp
[CVPR2023] The code for 《Position-guided Text Prompt for Vision-Language Pre-training》
☆152Updated 2 years ago
wusize / F-LMM
[CVPR2025] Code Release of F-LMM: Grounding Frozen Large Multimodal Models
☆108Updated 6 months ago
bladewaltz1 / PromptSwitch
☆30Updated 2 years ago
yuhangzang / UPT
☆60Updated 7 months ago
bruceyo / V-PETL
Towards a Unified View on Visual Parameter-Efficient Transfer Learning
☆26Updated 3 years ago
linhuixiao / CLIP-VG
[TMM 2023] Self-paced Curriculum Adapting of CLIP for Visual Grounding.
☆131Updated 3 weeks ago
SivanDoveh / TSVLC
Repository for the paper: Teaching Structured Vision & Language Concepts to Vision & Language Models
☆47Updated 2 years ago
Liuziyu77 / RAR
The official implementation of RAR
☆92Updated last year
Monoxide-Chen / uncertainty_retrieval
ICLR‘24 Offical Implementation of Composed Image Retrieval with Text Feedback via Multi-grained Uncertainty Regularization
☆73Updated last year
LijieFan / LaCLIP
[NeurIPS 2023] Text data, code and pre-trained models for paper "Improving CLIP Training with Language Rewrites"
☆286Updated last year
TencentARC / TVTS
Turning to Video for Transcript Sorting
☆48Updated 2 years ago
Qinying-Liu / TagAlign
Official implementation of TagAlign
☆35Updated 11 months ago
prannaykaul / mm-ovod
Official repo for our ICML 23 paper: "Multi-Modal Classifiers for Open-Vocabulary Object Detection"
☆95Updated 2 years ago
lorebianchi98 / FG-OVD
[CVPR 2024 Highlight] Official repository of the paper "The devil is in the fine-grained details: Evaluating open-vocabulary object detec…
☆61Updated 8 months ago
callsys / GenPromp
[ICCV 2023] Generative Prompt Model for Weakly Supervised Object Localization
☆57Updated 2 years ago
amazon-science / prompt-pretraining
Official implementation for the paper "Prompt Pre-Training with Over Twenty-Thousand Classes for Open-Vocabulary Visual Recognition"
☆259Updated last year
linyq2117 / TagCLIP
[AAAI 2024] TagCLIP: A Local-to-Global Framework to Enhance Open-Vocabulary Multi-Label Classification of CLIP Without Training
☆104Updated last year
Koorye / DePT
[CVPR 2024] Offical implemention of the paper "DePT: Decoupled Prompt Tuning"
☆109Updated last week
sunanhe / MKT
Official implementation of "Open-Vocabulary Multi-Label Classification via Multi-Modal Knowledge Transfer".
☆129Updated last year
linziyi96 / st-adapter
☆84Updated 2 years ago
ZhangYuanhan-AI / visual_prompt_retrieval
[NeurIPS2023] Official implementation and model release of the paper "What Makes Good Examples for Visual In-Context Learning?"
☆179Updated last year
palchenli / VL-Instruction-Tuning
☆91Updated 2 years ago