xfactlab/I0T

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/xfactlab/I0T)

xfactlab / I0T

[ACL Main 2025] I0T: Embedding Standardization Method Towards Zero Modality Gap

☆12

Alternatives and similar repositories for I0T

Users that are interested in I0T are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

naver-ai / muco
View on GitHub
Official Pytorch implementation of MuCo: Multi-turn Contrastive Learning for Multimodal Embedding Model (CVPR 2026)
☆15Apr 16, 2026Updated 3 months ago
taewhankim / VIPCAP
View on GitHub
☆15Dec 31, 2024Updated last year
liyongqi67 / GRACE
View on GitHub
☆29Aug 25, 2024Updated last year
modulabs / solutions4students
View on GitHub
☆10May 22, 2019Updated 7 years ago
ysw1021 / AGG
View on GitHub
A Pytorch implementation of "Rare Tokens Degenerate All Tokens: Improving Neural Text Generation via Adaptive Gradient Gating for Rare To…
☆10Apr 20, 2022Updated 4 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
OpenSparseLLMs / CLIP-MoE
View on GitHub
CLIP-MoE: Mixture of Experts for CLIP
☆58Oct 10, 2024Updated last year
Georgelingzj / up-to-date-Vision-Language-Models
View on GitHub
Up-to-date Vision Language Models collection. Mainly focus on computer vision
☆20Feb 9, 2023Updated 3 years ago
sugyeonge / Towards-diverse-QAG
View on GitHub
☆19Mar 4, 2024Updated 2 years ago
RyanLiut / awesome-diverse-captioning
View on GitHub
Some papers about *diverse* image (a few videos) captioning
☆25Apr 4, 2023Updated 3 years ago
McGill-NLP / diffusion-itm
View on GitHub
Code and data setup for the paper "Are Diffusion Models Vision-and-language Reasoners?"
☆33Mar 15, 2024Updated 2 years ago
paulgavrikov / vlm_shapebias
View on GitHub
Official code for "Can We Talk Models Into Seeing the World Differently?" (ICLR 2025).
☆31Jan 26, 2025Updated last year
xiami2019 / CLAIF
View on GitHub
[Findings of ACL'2023] Improving Contrastive Learning of Sentence Embeddings from AI Feedback
☆40Aug 14, 2023Updated 2 years ago
agneet42 / revision
View on GitHub
[ECCV 2024] "REVISION: Rendering Tools Enable Spatial Fidelity in Vision-Language Models"
☆13Aug 6, 2024Updated last year
naye971012 / numpy_transformer
View on GitHub
numpy implementation of deep learning models including Transformer (With 6 exercise)
☆12Feb 24, 2024Updated 2 years ago
End-to-end encrypted cloud storage - Proton Drive • Ad
Special offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
ghchen18 / acl23_mclip
View on GitHub
The official code and model for ACL 2023 paper 'mCLIP: Multilingual CLIP via Cross-lingual Transfer'
☆10Jan 23, 2024Updated 2 years ago
duyngtr16061999 / KDMCSE
View on GitHub
☆10Apr 7, 2024Updated 2 years ago
SSL-Sign-Language / Korean-Disaster-Safety-Information-Sign-Language-Translation-Benchmark-Dataset
View on GitHub
☆21May 23, 2024Updated 2 years ago
sarahESL / AlignCLIP
View on GitHub
AlignCLIP: Improving Cross-Modal Alignment in CLIP (ICLR 2025)
☆67Mar 1, 2025Updated last year
miccunifi / Cross-the-Gap
View on GitHub
[ICLR 2025] - Cross the Gap: Exposing the Intra-modal Misalignment in CLIP via Modality Inversion
☆70Nov 30, 2025Updated 7 months ago
conghui1002 / DG-UCDIR
View on GitHub
☆13Oct 4, 2023Updated 2 years ago
kaist-ami / AVHBench
View on GitHub
[ICLR'25] Official repository for "AVHBench: A Cross-Modal Hallucination Evaluation for Audio-Visual Large Language Models"
☆25Mar 8, 2026Updated 4 months ago
taehwakkwon / papago-translator
View on GitHub
☆16Apr 22, 2021Updated 5 years ago
Wangt-CN / Code_CASC
View on GitHub
☆14Oct 14, 2019Updated 6 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
kimcando / BoostcampAITech3-PaperReading-Embedding
View on GitHub
Boostcamp AI Tech 3rd / Basic Paper reading w.r.t Embedding
☆13Jun 1, 2022Updated 4 years ago
dlawjddn803 / INFO
View on GitHub
Code for the paper "You Truly Understand What I Need : Intellectual and Friendly Dialogue Agents grounding Knowledge and Persona" which i…
☆23Apr 6, 2023Updated 3 years ago
au-revoir / model-editing-ft
View on GitHub
☆13Sep 8, 2024Updated last year
skleee / GRUT
View on GitHub
This is the official code for the EMNLP findings 2025 paper "Enhancing Time Awareness in Generative Recommendation".
☆19May 24, 2026Updated 2 months ago
Show-han / Zeroshot_REC
View on GitHub
Official code for Zero-shot Referring Expression Comprehension via Structural Similarity Between Images and Captions (CVPR 2024)
☆28Jun 21, 2024Updated 2 years ago
vinid / neg_clip
View on GitHub
NegCLIP.
☆41Feb 6, 2023Updated 3 years ago
wrudman / NOTICE
View on GitHub
☆14Apr 10, 2025Updated last year
YennNing / CoFiRec
View on GitHub
CoFiRec: Coarse-to-Fine Tokenization for Generative Recommendationn
☆21Jan 23, 2026Updated 6 months ago
zilunzhang / StreetCLIP-Repoduce
View on GitHub
☆13Jul 1, 2024Updated 2 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
StanfordMIMI / villa
View on GitHub
[ICCV 2023] ViLLA: Fine-grained vision-language representation learning from real-world data
☆45Oct 15, 2023Updated 2 years ago
dreamgonfly / dreamgonfly.github.io
View on GitHub
dreamgonfly's blog
☆10Sep 23, 2021Updated 4 years ago
GingL / CMPA
View on GitHub
☆16May 31, 2023Updated 3 years ago
ry-eon / Bubble-Detector-YOLOv4
View on GitHub
☆23Nov 4, 2021Updated 4 years ago
xinghaow99 / DenoSent
View on GitHub
[AAAI 2024] DenoSent: A Denoising Objective for Self-Supervised Sentence Representation Learning
☆15Apr 29, 2024Updated 2 years ago
ZhangXu0963 / VSL
View on GitHub
The code of "Image-text Retrieval via Preserving Main Semantic of Vision" in ICME 2023.
☆15Dec 25, 2023Updated 2 years ago
lunash0 / prometheus5_project_AIDrivingGuide
View on GitHub
Project to provide driver guidance through object recognition in the vehicle driving environment: Display bounding boxes on objects in im…
☆20Aug 25, 2024Updated last year