naver-ai / clip4dm
Official PyTorch implementation of Extract Free Dense Misalignment from CLIP (AAAI'25)
☆21Updated 3 weeks ago
Alternatives and similar repositories for clip4dm
Users that are interested in clip4dm are comparing it to the libraries listed below
Sorting:
- [ECCV 2024] Official PyTorch implementation of "HYPE: Hyperbolic Entailment Filtering for Underspecified Images and Texts"☆18Updated 5 months ago
- On Efficient Language and Vision Assistants for Visually-Situated Natural Language Understanding: What Matters in Reading and Reasoning, …☆16Updated 4 months ago
- ☆50Updated 2 months ago
- ☆24Updated 2 months ago
- [AAAI 2025] ConVis: Contrastive Decoding with Hallucination Visualization for Mitigating Hallucinations in Multimodal Large Language Mode…☆17Updated 7 months ago
- ☆46Updated last year
- Extended COCO Validation (ECCV) Caption dataset (ECCV 2022)☆56Updated last year
- ☆38Updated last year
- ☆26Updated 5 months ago
- [ECCV2024][ICCV2023] Official PyTorch implementation of SeiT++ and SeiT☆55Updated 9 months ago
- [ICLR 2023] RC-MAE☆52Updated last year
- [ECCV 2024] Official PyTorch implementation of LUT "Learning with Unmasked Tokens Drives Stronger Vision Learners"☆12Updated 5 months ago
- Evaluating Multimodal Generative AI with Korean Educational Standards, NAACL 2025.☆12Updated last week
- [AAAI2024] BOK-VQA : Bilingual Outside Knowledge-based Visual Question Answering via Graph Representation Pretraining☆1Updated 10 months ago
- Official implementation for CVPR'23 paper "BlackVIP: Black-Box Visual Prompting for Robust Transfer Learning"☆111Updated last year
- Official Pytorch implementation of "Improved Probabilistic Image-Text Representations" (ICLR 2024)☆58Updated 11 months ago
- Official Pytorch implementation of LinCIR: Language-only Training of Zero-shot Composed Image Retrieval (CVPR 2024)☆133Updated 9 months ago
- Official implementation of TCL (CVPR 2023)☆111Updated 2 years ago
- [ICLR 2025] - Cross the Gap: Exposing the Intra-modal Misalignment in CLIP via Modality Inversion☆37Updated 3 weeks ago
- read 1 paper everyday (only weekday)☆56Updated 3 years ago
- ☆38Updated 11 months ago
- [ECCV-2022]Grounding Visual Representations with Texts for Domain Generalization☆31Updated 2 years ago
- [ECCV 2024] Official code for "Pseudo-RIS: Distinctive Pseudo-supervision Generation for Referring Image Segmentation"☆18Updated 7 months ago
- Official PyTorch Implementation of Efficient and Versatile Robust Fine-Tuning of Zero-shot Models, ECCV 2024☆13Updated 7 months ago
- Official repository for the ICCV 2023 paper: "Waffling around for Performance: Visual Classification with Random Words and Broad Concepts…☆57Updated last year
- [ICLR 2025] VL-ICL Bench: The Devil in the Details of Multimodal In-Context Learning☆54Updated 3 months ago
- Official code of "Generating Instance-level Prompts for Rehearsal-free Continual Learning (ICCV 2023)"☆42Updated last year
- ☆81Updated 2 years ago
- Visually-Situated Natural Language Understanding with Contrastive Reading Model and Frozen Large Language Models, EMNLP 2023☆45Updated 11 months ago
- Official Implementation of "The Role of Masking for Efficient Supervised Knowledge Distillation of Vision Transformers (ECCV 2024)”☆22Updated 3 months ago