[ACL Main 2025] I0T: Embedding Standardization Method Towards Zero Modality Gap
☆12Jun 18, 2025Updated 8 months ago
Alternatives and similar repositories for I0T
Users that are interested in I0T are comparing it to the libraries listed below
Sorting:
- ☆14Dec 31, 2024Updated last year
- ☆19Mar 4, 2024Updated 2 years ago
- Up-to-date Vision Language Models collection. Mainly focus on computer vision☆19Feb 9, 2023Updated 3 years ago
- <혼자 만들면서 공부하는 파이썬> 책의 깃허브 자료실☆15Jan 14, 2026Updated last month
- [ICLR 2025] - Cross the Gap: Exposing the Intra-modal Misalignment in CLIP via Modality Inversion☆62Nov 30, 2025Updated 3 months ago
- ☆28Aug 25, 2024Updated last year
- ☆10May 22, 2019Updated 6 years ago
- Code for the paper "You Truly Understand What I Need : Intellectual and Friendly Dialogue Agents grounding Knowledge and Persona" which i…☆23Apr 6, 2023Updated 2 years ago
- Some papers about *diverse* image (a few videos) captioning☆26Apr 4, 2023Updated 2 years ago
- FineCLIP: Self-distilled Region-based CLIP for Better Fine-grained Understanding (NIPS24)☆34Nov 12, 2025Updated 3 months ago
- Code and data setup for the paper "Are Diffusion Models Vision-and-language Reasoners?"☆33Mar 15, 2024Updated last year
- Official code for Zero-shot Referring Expression Comprehension via Structural Similarity Between Images and Captions (CVPR 2024)☆28Jun 21, 2024Updated last year
- NegCLIP.☆39Feb 6, 2023Updated 3 years ago
- Improving Continuous Sign Language Recognition with Adapted Image Models☆14Nov 10, 2025Updated 3 months ago
- [COLING'25] HGCLIP: Exploring Vision-Language Models with Graph Representations for Hierarchical Understanding☆44Nov 30, 2024Updated last year
- Token-free Language Modeling with ByGPT5 & Friends!☆12Jul 18, 2025Updated 7 months ago
- [CVPR2025] Official code for Lost in Translation Found in Context☆23Jan 14, 2026Updated last month
- 한국어 소설 텍스트를 위한 자연어처리 라이브러리입니다. Natural Language Processing Library for Korean Literary Text. (Will be open in February, 2024)☆11Jan 16, 2024Updated 2 years ago
- Mastering-NLP-from-Foundations-to-LLMs☆10Apr 11, 2025Updated 10 months ago
- Code for Beyond Generic: Enhancing Image Captioning with Real-World Knowledge using Vision-Language Pre-Training Model☆13Feb 15, 2024Updated 2 years ago
- 깃헙에 NLP 잔디심기 시즌 5☆10Aug 19, 2024Updated last year
- ☆10Apr 7, 2024Updated last year
- [Findings of ACL'2023] Improving Contrastive Learning of Sentence Embeddings from AI Feedback☆40Aug 14, 2023Updated 2 years ago
- AlignCLIP: Improving Cross-Modal Alignment in CLIP (ICLR 2025)☆59Mar 1, 2025Updated last year
- ☆46Oct 27, 2023Updated 2 years ago
- [ICCV 2023] ViLLA: Fine-grained vision-language representation learning from real-world data☆46Oct 15, 2023Updated 2 years ago
- [ECCV 2024] "REVISION: Rendering Tools Enable Spatial Fidelity in Vision-Language Models"☆13Aug 6, 2024Updated last year
- 《7가지 프로젝트로 배우는 LLM AI 에이전트 개발》 추가 지원 저장소☆15Apr 1, 2025Updated 11 months ago
- Welcome to the official repository of AC-LORA: (Almost) Training-Free Access Control-Aware Multi-Modal LLMs, a mechanism that provides tr…☆20Nov 14, 2025Updated 3 months ago
- Source code of our AAAI 2024 paper "Cross-Modal and Uni-Modal Soft-Label Alignment for Image-Text Retrieval"☆55Mar 28, 2024Updated last year
- ☆10Mar 30, 2023Updated 2 years ago
- Ranking-Consistent Language-Image Pretraining☆12Oct 24, 2025Updated 4 months ago
- ☆11Aug 7, 2025Updated 6 months ago
- The official code and model for ACL 2023 paper 'mCLIP: Multilingual CLIP via Cross-lingual Transfer'☆10Jan 23, 2024Updated 2 years ago
- ☆13Sep 8, 2024Updated last year
- ☆11Sep 1, 2024Updated last year
- [ACM MM-24] Probabilistic Vision-Language Representation for Weakly Supervised Temporal Action Localization☆12Oct 8, 2024Updated last year
- This is the official code for the EMNLP findings 2025 paper "Enhancing Time Awareness in Generative Recommendation".☆17Aug 30, 2025Updated 6 months ago
- [Findings of ACL-2023] This is the official implementation of On the Difference of BERT-style and CLIP-style Text Encoders.☆14Jun 7, 2023Updated 2 years ago