[ACL Main 2025] I0T: Embedding Standardization Method Towards Zero Modality Gap
☆12Jun 18, 2025Updated 11 months ago
Alternatives and similar repositories for I0T
Users that are interested in I0T are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆29Aug 25, 2024Updated last year
- ☆14Dec 31, 2024Updated last year
- [ICLR 2025] - Cross the Gap: Exposing the Intra-modal Misalignment in CLIP via Modality Inversion☆65Nov 30, 2025Updated 5 months ago
- This is the official code for the EMNLP findings 2025 paper "Enhancing Time Awareness in Generative Recommendation".☆18Aug 30, 2025Updated 8 months ago
- ☆10May 22, 2019Updated 7 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Up-to-date Vision Language Models collection. Mainly focus on computer vision☆19Feb 9, 2023Updated 3 years ago
- Enhaced version of Wikiextrator: A wikipedia dumps extractor☆28Sep 17, 2025Updated 8 months ago
- Some papers about *diverse* image (a few videos) captioning☆26Apr 4, 2023Updated 3 years ago
- Official code for "Can We Talk Models Into Seeing the World Differently?" (ICLR 2025).☆30Jan 26, 2025Updated last year
- [Findings of ACL'2023] Improving Contrastive Learning of Sentence Embeddings from AI Feedback☆40Aug 14, 2023Updated 2 years ago
- [ECCV 2024] "REVISION: Rendering Tools Enable Spatial Fidelity in Vision-Language Models"☆13Aug 6, 2024Updated last year
- ☆19Mar 4, 2024Updated 2 years ago
- Code and data setup for the paper "Are Diffusion Models Vision-and-language Reasoners?"☆33Mar 15, 2024Updated 2 years ago
- A Pytorch implementation of "Rare Tokens Degenerate All Tokens: Improving Neural Text Generation via Adaptive Gradient Gating for Rare To…☆10Apr 20, 2022Updated 4 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- numpy implementation of deep learning models including Transformer (With 6 exercise)☆12Feb 24, 2024Updated 2 years ago
- The official code and model for ACL 2023 paper 'mCLIP: Multilingual CLIP via Cross-lingual Transfer'☆10Jan 23, 2024Updated 2 years ago
- ☆10Apr 7, 2024Updated 2 years ago
- Interpreting CLIP with Hierarchical Sparse Autoencoders (ICML 2025)☆26Jan 17, 2026Updated 4 months ago
- Boostcamp AI Tech 3rd / Basic Paper reading w.r.t Embedding☆13Jun 1, 2022Updated 3 years ago
- ☆14Oct 14, 2019Updated 6 years ago
- ☆13Sep 8, 2024Updated last year
- Korean Benchmark for Korean Legal Language Understanding☆19Nov 16, 2024Updated last year
- ☆13Oct 4, 2023Updated 2 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- dreamgonfly's blog☆10Sep 23, 2021Updated 4 years ago
- Official code for Zero-shot Referring Expression Comprehension via Structural Similarity Between Images and Captions (CVPR 2024)☆28Jun 21, 2024Updated last year
- ☆22Feb 8, 2025Updated last year
- Code for the paper "You Truly Understand What I Need : Intellectual and Friendly Dialogue Agents grounding Knowledge and Persona" which i…☆23Apr 6, 2023Updated 3 years ago
- FineCLIP: Self-distilled Region-based CLIP for Better Fine-grained Understanding (NIPS24)☆38Nov 12, 2025Updated 6 months ago
- ☆13Apr 10, 2025Updated last year
- [ECCV 2024] Robust-Wide: Robust Watermarking against Instruction-driven Image Editing (Official Implementation)☆36May 30, 2025Updated 11 months ago
- NegCLIP.☆41Feb 6, 2023Updated 3 years ago
- ☆13Jul 1, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆16Apr 22, 2021Updated 5 years ago
- Welcome to the official repository of AC-LORA: (Almost) Training-Free Access Control-Aware Multi-Modal LLMs, a mechanism that provides tr…☆21Nov 14, 2025Updated 6 months ago
- ☆19Jul 9, 2024Updated last year
- [ICCV 2023] ViLLA: Fine-grained vision-language representation learning from real-world data☆45Oct 15, 2023Updated 2 years ago
- ☆21Jun 4, 2025Updated 11 months ago
- ☆19Dec 13, 2023Updated 2 years ago
- Linguistic-Aware Patch Slimming Framework for Fine-grained Cross-Modal Alignment, CVPR, 2024☆108Jun 26, 2025Updated 10 months ago