[ACL Main 2025] I0T: Embedding Standardization Method Towards Zero Modality Gap
☆12Jun 18, 2025Updated 9 months ago
Alternatives and similar repositories for I0T
Users that are interested in I0T are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆29Aug 25, 2024Updated last year
- ☆14Dec 31, 2024Updated last year
- [ICLR 2025] - Cross the Gap: Exposing the Intra-modal Misalignment in CLIP via Modality Inversion☆62Nov 30, 2025Updated 3 months ago
- This is the official code for the EMNLP findings 2025 paper "Enhancing Time Awareness in Generative Recommendation".☆17Aug 30, 2025Updated 6 months ago
- ☆10May 22, 2019Updated 6 years ago
- Up-to-date Vision Language Models collection. Mainly focus on computer vision☆19Feb 9, 2023Updated 3 years ago
- Enhaced version of Wikiextrator: A wikipedia dumps extractor☆28Sep 17, 2025Updated 6 months ago
- Some papers about *diverse* image (a few videos) captioning☆26Apr 4, 2023Updated 2 years ago
- [ECCV 2024] "REVISION: Rendering Tools Enable Spatial Fidelity in Vision-Language Models"☆13Aug 6, 2024Updated last year
- [Findings of ACL'2023] Improving Contrastive Learning of Sentence Embeddings from AI Feedback☆40Aug 14, 2023Updated 2 years ago
- ☆19Mar 4, 2024Updated 2 years ago
- Code and data setup for the paper "Are Diffusion Models Vision-and-language Reasoners?"☆33Mar 15, 2024Updated 2 years ago
- A Pytorch implementation of "Rare Tokens Degenerate All Tokens: Improving Neural Text Generation via Adaptive Gradient Gating for Rare To…☆10Apr 20, 2022Updated 3 years ago
- numpy implementation of deep learning models including Transformer (With 6 exercise)☆12Feb 24, 2024Updated 2 years ago
- The official code and model for ACL 2023 paper 'mCLIP: Multilingual CLIP via Cross-lingual Transfer'☆10Jan 23, 2024Updated 2 years ago
- ☆10Apr 7, 2024Updated last year
- Interpreting CLIP with Hierarchical Sparse Autoencoders (ICML 2025)☆24Jan 17, 2026Updated 2 months ago
- Boostcamp AI Tech 3rd / Basic Paper reading w.r.t Embedding☆13Jun 1, 2022Updated 3 years ago
- ☆13Sep 8, 2024Updated last year
- ☆14Oct 14, 2019Updated 6 years ago
- Korean Benchmark for Korean Legal Language Understanding☆18Nov 16, 2024Updated last year
- ☆13Oct 4, 2023Updated 2 years ago
- dreamgonfly's blog☆11Sep 23, 2021Updated 4 years ago
- Official code for Zero-shot Referring Expression Comprehension via Structural Similarity Between Images and Captions (CVPR 2024)☆28Jun 21, 2024Updated last year
- [ECCV 2024] Robust-Wide: Robust Watermarking against Instruction-driven Image Editing (Official Implementation)☆34May 30, 2025Updated 9 months ago
- Code for the paper "You Truly Understand What I Need : Intellectual and Friendly Dialogue Agents grounding Knowledge and Persona" which i…☆23Apr 6, 2023Updated 2 years ago
- ☆21Feb 8, 2025Updated last year
- FineCLIP: Self-distilled Region-based CLIP for Better Fine-grained Understanding (NIPS24)☆37Nov 12, 2025Updated 4 months ago
- ☆13Apr 10, 2025Updated 11 months ago
- NegCLIP.☆39Feb 6, 2023Updated 3 years ago
- ☆13Jul 1, 2024Updated last year
- ☆16Apr 22, 2021Updated 4 years ago
- [CVPR 2025] Fine-Grained Image-Text Correspondence with Cost Aggregation for Open-Vocabulary Part Segmentation☆24Nov 17, 2025Updated 4 months ago
- Welcome to the official repository of AC-LORA: (Almost) Training-Free Access Control-Aware Multi-Modal LLMs, a mechanism that provides tr…☆20Nov 14, 2025Updated 4 months ago
- ☆19Jul 9, 2024Updated last year
- ☆19Dec 13, 2023Updated 2 years ago
- [ICCV 2023] ViLLA: Fine-grained vision-language representation learning from real-world data☆45Oct 15, 2023Updated 2 years ago
- Linguistic-Aware Patch Slimming Framework for Fine-grained Cross-Modal Alignment, CVPR, 2024☆108Jun 26, 2025Updated 8 months ago
- ☆17May 31, 2023Updated 2 years ago