fengyuli-dev / distribution-normalizationLinks

Test-Time Distribution Normalization For Contrastively Learned Vision-language Models

☆27

Alternatives and similar repositories for distribution-normalization

Users that are interested in distribution-normalization are comparing it to the libraries listed below

Sorting:

jonkahana / CLIPPR
An official PyTorch implementation for CLIPPR
☆29Updated 2 years ago
mshukor / eP-ALM
[ICCV23] Official implementation of eP-ALM: Efficient Perceptual Augmentation of Language Models.
☆27Updated 2 years ago
codezakh / LilT
[ICLR 23] Contrastive Aligned of Vision to Language Through Parameter-Efficient Transfer Learning
☆40Updated 2 years ago
syp2ysy / prompt-SelF
[TIP] Exploring Effective Factors for Improving Visual In-Context Learning
☆19Updated 4 months ago
ExplainableML / WaffleCLIP
Official repository for the ICCV 2023 paper: "Waffling around for Performance: Visual Classification with Random Words and Broad Concepts…
☆61Updated 2 years ago
Hritikbansal / generative-robustness
Create generated datasets and train robust classifiers
☆36Updated 2 years ago
ExplainableML / fomo_in_flux
Code and benchmark for the paper: "A Practitioner's Guide to Continual Multimodal Pretraining" [NeurIPS'24]
☆60Updated 11 months ago
showlab / datacentric.vlp
Compress conventional Vision-Language Pre-training data
☆52Updated 2 years ago
k1rezaei / Text-to-concept
☆35Updated last year
hammoudhasan / DiversitySSL
Original code base for On Pretraining Data Diversity for Self-Supervised Learning
☆14Updated 11 months ago
facebookresearch / genecis
Code and Models for "GeneCIS A Benchmark for General Conditional Image Similarity"
☆61Updated 2 years ago
jochemloedeman / PGN
Prompt Generation Networks for Input-Space Adaptation of Frozen Vision Transformers. Jochem Loedeman, Maarten C. Stol, Tengda Han, Yuki M…
☆42Updated last year
yuhui-zh15 / drml
Official Code Release for "Diagnosing and Rectifying Vision Models using Language" (ICLR 2023)
☆34Updated 2 years ago
BatsResearch / ex2
If CLIP Could Talk: Understanding Vision-Language Model Representations Through Their Preferred Concept Descriptions
☆17Updated last year
boyazeng / understand_bias
Code release for "Understanding Bias in Large-Scale Visual Datasets"
☆22Updated 11 months ago
alinlab / b2t
Bias-to-Text: Debiasing Unknown Visual Biases through Language Interpretation
☆31Updated 2 years ago
amitakamath / vl_text_encoders_are_bottlenecks
Code and datasets for "Text encoders are performance bottlenecks in contrastive vision-language models". Coming soon!
☆11Updated 2 years ago
TencentARC / pi-Tuning
Official code for "pi-Tuning: Transferring Multimodal Foundation Models with Optimal Multi-task Interpolation", ICML 2023.
☆33Updated 2 years ago
james-oldfield / muMoE
[NeurIPS'24] Multilinear Mixture of Experts: Scalable Expert Specialization through Factorization
☆37Updated last year
kdariina / CLIP-not-BoW-unimodally
Code for "CLIP Behaves like a Bag-of-Words Model Cross-modally but not Uni-modally"
☆16Updated 9 months ago
orrzohar / LOVM
[NeurIPS 2023] Official Pytorch code for LOVM: Language-Only Vision Model Selection
☆21Updated last year
lisadunlap / ALIA
Augmenting with Language-guided Image Augmentation (ALIA)
☆80Updated 2 years ago
arijitray1993 / COLA
COLA: Evaluate how well your vision-language model can Compose Objects Localized with Attributes!
☆25Updated last year
mlfoundations / clip_quality_not_quantity
☆29Updated 3 years ago
McGill-NLP / diffusion-itm
Code and data setup for the paper "Are Diffusion Models Vision-and-language Reasoners?"
☆33Updated last year
mwalmer-umd / vit_analysis
☆35Updated 2 years ago
naver-ai / prolip
☆55Updated 3 months ago
ExplainableML / ImageSelect
Code for the paper "If at First You Don't Succeed, Try, Try Again: Faithful Diffusion-based Text-to-Image Generation by Selection"
☆27Updated 2 years ago
RAIVNLab / CREPE
[CVPR23 Highlight] CREPE: Can Vision-Language Foundation Models Reason Compositionally?
☆35Updated 2 years ago
brendel-group / clip-ood
Official code for the paper "Does CLIP's Generalization Performance Mainly Stem from High Train-Test Similarity?" (ICLR 2024)
☆10Updated last year