naver-ai / clip4dmLinks

Official PyTorch implementation of Extract Free Dense Misalignment from CLIP (AAAI'25)

☆22

Alternatives and similar repositories for clip4dm

Users that are interested in clip4dm are comparing it to the libraries listed below

Sorting:

naver-ai / hype
[ECCV 2024] Official PyTorch implementation of "HYPE: Hyperbolic Entailment Filtering for Underspecified Images and Texts"
☆18Updated 7 months ago
naver-ai / elva
On Efficient Language and Vision Assistants for Visually-Situated Natural Language Understanding: What Matters in Reading and Reasoning, …
☆16Updated 7 months ago
naver-ai / prolip
☆51Updated 4 months ago
miccunifi / Cross-the-Gap
[ICLR 2025] - Cross the Gap: Exposing the Intra-modal Misalignment in CLIP via Modality Inversion
☆49Updated 2 months ago
navervision / lincir
Official Pytorch implementation of LinCIR: Language-only Training of Zero-shot Composed Image Retrieval (CVPR 2024)
☆135Updated 11 months ago
ldynx / SAVE
☆27Updated 7 months ago
naver-ai / lut
[ECCV 2024] Official PyTorch implementation of LUT "Learning with Unmasked Tokens Drives Stronger Vision Learners"
☆12Updated 7 months ago
yejipark-m / ConVis
[AAAI 2025] ConVis: Contrastive Decoding with Hallucination Visualization for Mitigating Hallucinations in Multimodal Large Language Mode…
☆18Updated 9 months ago
naver-ai / eccv-caption
Extended COCO Validation (ECCV) Caption dataset (ECCV 2022)
☆55Updated last year
naver-ai / dawin
☆25Updated this week
khanrc / tcl
Official implementation of TCL (CVPR 2023)
☆114Updated 2 years ago
naver-ai / dap-cl
Official code of "Generating Instance-level Prompts for Rehearsal-free Continual Learning (ICCV 2023)"
☆42Updated last year
aimagelab / pacscore
[CVPR 2023] Positive-Augmented Contrastive Learning for Image and Video Captioning Evaluation
☆62Updated 4 months ago
ys-zong / VL-ICL
[ICLR 2025] VL-ICL Bench: The Devil in the Details of Multimodal In-Context Learning
☆61Updated 5 months ago
mbzuai-oryx / CVRR-Evaluation-Suite
[CVPRW-25 MMFM] Official repository of paper titled "How Good is my Video LMM? Complex Video Reasoning and Robustness Evaluation Suite fo…
☆48Updated 10 months ago
naver-ai / cl-vs-mim
(ICLR 2023) Official PyTorch implementation of "What Do Self-Supervised Vision Transformers Learn?"
☆111Updated last year
ExplainableML / WaffleCLIP
Official repository for the ICCV 2023 paper: "Waffling around for Performance: Visual Classification with Random Words and Broad Concepts…
☆57Updated 2 years ago
naver-ai / pcmepp
Official Pytorch implementation of "Improved Probabilistic Image-Text Representations" (ICLR 2024)
☆57Updated last year
ant-research / DreamLIP
[ECCV 2024] Official PyTorch implementation of DreamLIP: Language-Image Pre-training with Long Captions
☆134Updated 2 months ago
mlvlab / DAPT
Distribution-Aware Prompt Tuning for Vision-Language Models (ICCV 2023)
☆41Updated last year
saibr / hypvl
This repository is related to 'Intriguing Properties of Hyperbolic Embeddings in Vision-Language Models', published at TMLR (2024), https…
☆20Updated last year
mrwu-mac / R-Bench
[ICML2024] Repo for the paper `Evaluating and Analyzing Relationship Hallucinations in Large Vision-Language Models'
☆21Updated 6 months ago
Muhammad-Huzaifaa / ObjectCompose
[ACCV 2024] ObjectCompose: Evaluating Resilience of Vision-Based Models on Object-to-Background Compositional Changes 🚀🚀🚀
☆37Updated 5 months ago
mjkmain / BOK-VQA
[AAAI2024] BOK-VQA : Bilingual Outside Knowledge-based Visual Question Answering via Graph Representation Pretraining
☆2Updated last year
alinlab / s-clip
S-CLIP: Semi-supervised Vision-Language Pre-training using Few Specialist Captions
☆48Updated 2 years ago
jameelhassan / PromptAlign
[NeurIPS 2023] Align Your Prompts: Test-Time Prompting with Distribution Alignment for Zero-Shot Generalization
☆106Updated last year
Zi-hao-Wei / Efficient-Vision-Language-Pre-training-by-Cluster-Masking
[CVPR 2024] Improving language-visual pretraining efficiency by perform cluster-based masking on images.
☆28Updated last year
changdaeoh / multimodal-mixup
Official implementation for NeurIPS'23 paper "Geodesic Multi-Modal Mixup for Robust Fine-Tuning"
☆34Updated 9 months ago
naver-ai / NeglectedFreeLunch
☆38Updated last year
Seonghoon-Yu / Pseudo-RIS
[ECCV 2024] Official code for "Pseudo-RIS: Distinctive Pseudo-supervision Generation for Referring Image Segmentation"
☆18Updated 9 months ago