TIGER-AI-Lab/ABC

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/TIGER-AI-Lab/ABC)

TIGER-AI-Lab / ABC

ABC: Achieving Better Control of Multimodal Embeddings using VLMs [TMLR2025]

☆19

Alternatives and similar repositories for ABC

Users that are interested in ABC are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

chs20 / fuselip
View on GitHub
FuseLIP: Multimodal Embeddings via Early Fusion of Discrete Tokens
☆17Sep 8, 2025Updated 10 months ago
haon-chen / mmE5
View on GitHub
☆59Feb 27, 2025Updated last year
XMUDeepLIT / LLaVE
View on GitHub
LLaVE: Large Language and Vision Embedding Models with Hardness-Weighted Contrastive Learning
☆78May 23, 2025Updated last year
TIGER-AI-Lab / VideoEval-Pro
View on GitHub
VideoEval-Pro: Robust and Realistic Long Video Understanding Evaluation [TMLR26]
☆15Jun 1, 2026Updated last month
arubique / OCCAM
View on GitHub
This is an implementation of the paper "Are We Done with Object-Centric Learning?"
☆13Jun 21, 2026Updated last month
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
TIGER-AI-Lab / QuickVideo
View on GitHub
Quick Long Video Understanding [TMLR2025]
☆79Oct 27, 2025Updated 8 months ago
TIGER-AI-Lab / VLM2Vec
View on GitHub
This repo contains the code for "VLM2Vec / MMEB" [ICLR 2025], "VLM2Vec-V2 / MMEB-V2" [TMLR 2026], and "MMEB-V3" [COLM 2026]
☆667Jun 24, 2026Updated 3 weeks ago
deepglint / UniME
View on GitHub
[ACM MM 2025] The official code of "Breaking the Modality Barrier: Universal Embedding Learning with Multimodal LLMs"
☆105Dec 8, 2025Updated 7 months ago
deepglint / Victor
View on GitHub
ViCToR: Improving Visual Comprehension via Token Reconstruction for Pretraining LMMs
☆29Aug 15, 2025Updated 11 months ago
zifuwanggg / Jigsaw-R1
View on GitHub
[TMLR 2025] Jigsaw-R1: A Study of Rule-based Visual Reinforcement Learning with Jigsaw Puzzles
☆15Oct 17, 2025Updated 9 months ago
YanNeu / DASH
View on GitHub
DASH: Detection and Assessment of Systematic Hallucinations of VLMs
☆15Jul 2, 2025Updated last year
microsoft / clarification-qgen-globalinfo
View on GitHub
☆15Apr 29, 2021Updated 5 years ago
fsndzomga / open_source_lrm
View on GitHub
☆10Oct 24, 2024Updated last year
xiaoxing2001 / DeGLA
View on GitHub
[ACM MM25] Official Pytorch implementation of [Decoupled Global-Local Alignment for Improving Compositional Understanding]
☆16Jul 15, 2025Updated last year
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
OpenGVLab / VKnowU
View on GitHub
[ECCV 2026] VKnowU: Evaluating Visual Knowledge Understanding in Multimodal LLMs
☆15Feb 3, 2026Updated 5 months ago
multimodal-art-projection / IV-Bench
View on GitHub
☆14Apr 23, 2025Updated last year
linkangheng / Video-UTR
View on GitHub
[ICLR2025] Official code implementation of Video-UTR: Unhackable Temporal Rewarding for Scalable Video MLLMs
☆61Feb 27, 2025Updated last year
zsweet / zsw_AI_model
View on GitHub
☆12Sep 25, 2018Updated 7 years ago
raghavlite / B3
View on GitHub
☆43Jan 12, 2026Updated 6 months ago
szacho / pointcam
View on GitHub
Self-supervised adversarial masking for point clouds
☆11Jul 12, 2023Updated 3 years ago
aaronserianni / attention-iou
View on GitHub
[CVPR'25] Attention IoU: Examining Biases in CelebA using Attention Maps
☆13Mar 26, 2025Updated last year
THU-KEG / LongWriter-V
View on GitHub
[ACM MM25] LongWriter-V: Enabling Ultra-Long and High-Fidelity Generation in Vision-Language Models
☆24Mar 29, 2025Updated last year
ytaek-oh / fsc-clip
View on GitHub
[EMNLP 2024] Preserving Multi-Modal Capabilities of Pre-trained VLMs for Improving Vision-Linguistic Compositionality
☆22Oct 8, 2024Updated last year
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
ExplainableML / DeLoRA
View on GitHub
[ICLR25] Official Implementation of "Decoupling Angles and Strength in Low-rank Adaptation"
☆15Dec 12, 2025Updated 7 months ago
wuw2019 / LoTLIP
View on GitHub
[NeurIPS 2024] Official PyTorch implementation of LoTLIP: Improving Language-Image Pre-training for Long Text Understanding
☆49Jan 14, 2025Updated last year
haon-chen / MoCa
View on GitHub
☆68Aug 14, 2025Updated 11 months ago
chinmay5 / vesselformer
View on GitHub
☆14Jul 8, 2023Updated 3 years ago
kaist-cvml / I-HallA-v1.0
View on GitHub
[AAAI 2025] Official Implementation of I-HallA v1.0
☆16Feb 2, 2025Updated last year
hrishioa / meeting-diary
View on GitHub
Simple meeting diarization and speaker id assistant for meetings.
☆12Feb 10, 2025Updated last year
XWalways / Papers
View on GitHub
Reading Papers
☆14Mar 26, 2021Updated 5 years ago
TIGER-AI-Lab / UniIR
View on GitHub
Official code for paper "UniIR: Training and Benchmarking Universal Multimodal Information Retrievers" (ECCV 2024)
☆183Oct 1, 2024Updated last year
haonan3 / V1
View on GitHub
V1: Toward Multimodal Reasoning by Designing Auxiliary Task
☆36Apr 14, 2025Updated last year
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
g-luo / vlm_cross_modal_reps
View on GitHub
Official PyTorch Implementation for Vision-Language Models Create Cross-Modal Task Representations, ICML 2025
☆34May 1, 2025Updated last year
ExplainableML / cosmos
View on GitHub
[CVPR 2025] COSMOS: Cross-Modality Self-Distillation for Vision Language Pre-training
☆42Mar 27, 2025Updated last year
thuwyh / BAAI-2020-CrowdHuman-Baseline
View on GitHub
IterDet: Iterative Scheme for Object Detection in Crowded Environments
☆11Jul 7, 2020Updated 6 years ago
flrneha / ElasticBasisForSpectralMatching
View on GitHub
Accompanying code for "An Elastic Basis for Spectral Shape Correspondence"
☆12Aug 2, 2023Updated 2 years ago
GaryGuTC / LaPA_model
View on GitHub
[CVPRW 2024] LaPA: Latent Prompt Assist Model For Medical Visual Question Answering
☆27Apr 24, 2025Updated last year
mwbini / ether
View on GitHub
[ICML24] Official Implementation of "ETHER: Efficient Finetuning of Large-Scale Models with Hyperplane Reflections"
☆16May 31, 2024Updated 2 years ago
riccardomarin / Diff-FMAPs-PyTorch
View on GitHub
☆16May 11, 2022Updated 4 years ago