dhg-wei/MCL

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/dhg-wei/MCL)

dhg-wei / MCL

(ICML 2024) Improve Context Understanding in Multimodal Large Language Models via Multimodal Composition Learning

☆28

Alternatives and similar repositories for MCL

Users that are interested in MCL are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

iLearn-Lab / SIGIR24-FTI4CIR
View on GitHub
Codes of the Fine-grained Textual Inversion network for Zero-Shot Composed Image Retrieval
☆27Apr 9, 2026Updated 3 months ago
iLearn-Lab / SIGIR24-DQU-CIR
View on GitHub
[SIGIR 2024] - Simple but Effective Raw-Data Level Multimodal Fusion for Composed Image Retrieval
☆44Jul 14, 2024Updated 2 years ago
iLearn-Lab / TOIS25-Awesome-Composed-Image-Retrieval
View on GitHub
Collection of Composed Image Retrieval (CIR) papers.
☆360Jun 8, 2026Updated last month
dhg-wei / TOPA
View on GitHub
(NeurIPS 2024 Spotlight) TOPA: Extend Large Language Models for Video Understanding via Text-Only Pre-Alignment
☆29Sep 27, 2024Updated last year
xianzhangzx / FINER-MLLM
View on GitHub
The implementation of FINER-MLLM, which is accepted by MM2024.
☆18Oct 8, 2024Updated last year
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
VamosC / CoLearning-meet-StitchUp
View on GitHub
[TIP 2023] Co-Learning Meets Stitch-Up for Noisy Multi-label Visual Recognition.
☆13Aug 19, 2023Updated 2 years ago
navervision / lincir
View on GitHub
Official Pytorch implementation of LinCIR: Language-only Training of Zero-shot Composed Image Retrieval (CVPR 2024)
☆148Jan 5, 2026Updated 6 months ago
icq-benchmark / icq-benchmark
View on GitHub
☆19Jul 28, 2025Updated 11 months ago
chunmeifeng / SPRC
View on GitHub
【ICLR 2024, Spotlight】Sentence-level Prompts Benefit Composed Image Retrieval
☆94Apr 16, 2024Updated 2 years ago
levymsn / ChatIR
View on GitHub
Official repository of "Chatting Makes Perfect: Chat-based Image Retrieval"
☆33Feb 5, 2025Updated last year
Chiangsonw / CaLa
View on GitHub
The official code of "CaLa: Complementary Association Learning for Augmenting Composed Image Retrieval"
☆15Sep 19, 2024Updated last year
allenai / close
View on GitHub
☆59Aug 30, 2023Updated 2 years ago
Cuberick-Orion / Candidate-Reranking-CIR
View on GitHub
The official implementation for Candidate Set Re-ranking for Composed Image Retrieval (TMLR) 01/2024
☆20Feb 7, 2024Updated 2 years ago
miccunifi / SEARLE
View on GitHub
[ICCV 2023] - Zero-shot Composed Image Retrieval with Textual Inversion
☆198Jul 31, 2025Updated 11 months ago
End-to-end encrypted cloud storage - Proton Drive • Ad
Special offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
BetterZH / SEVLM-code
View on GitHub
Training A Small Emotional Vision Language Model for Visual Art Comprehension
☆17Jul 26, 2024Updated last year
youngkyunJang / VDG
View on GitHub
Visual Delta Generator with Large Multi-modal Model for Semi-supervised Composed Image Retrieval - CVPR2024
☆21May 30, 2024Updated 2 years ago
BUAADreamer / SPN4CIR
View on GitHub
[ACM MM 2024] Improving Composed Image Retrieval via Contrastive Learning with Scaling Positives and Negatives
☆39Sep 9, 2025Updated 10 months ago
google-research / composed_image_retrieval
View on GitHub
☆197Updated this week
yuexihang / DeltaPhi
View on GitHub
Implementation for "DeltaPhi: Learning Physical Trajectory Residual for PDE Solving"
☆13Jun 17, 2024Updated 2 years ago
mikecheninoulu / Emotional-gesture-papers
View on GitHub
☆23May 29, 2025Updated last year
sung-yeon-kim / GENIUS-CVPR25
View on GitHub
Official Implementation of GENIUS: A Generative Framework for Universal Multimodal Search, CVPR 2025
☆55Aug 8, 2025Updated 11 months ago
sxl142 / GLoT
View on GitHub
Global-to-Local Modeling for Video-based 3D Human Pose and Shape Estimation
☆59Jun 21, 2023Updated 3 years ago
z-x-yang / DoraemonGPT
View on GitHub
Official repository of DoraemonGPT: Toward Understanding Dynamic Scenes with Large Language Models
☆91Jun 19, 2026Updated last month
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
facebookresearch / genecis
View on GitHub
Code and Models for "GeneCIS A Benchmark for General Conditional Image Similarity"
☆61Jun 12, 2023Updated 3 years ago
pansanity666 / TransHuman
View on GitHub
Official code for ICCV 2023 paper: "TransHuman: A Transformer-based Human Representation for Generalizable Neural Human Rendering".
☆67Jan 11, 2024Updated 2 years ago
aimagelab / ScanDiff
View on GitHub
This is the official repository for the paper "Modeling Human Gaze Behavior with Diffusion Models for Unified Scanpath Prediction". ICCV …
☆27May 13, 2026Updated 2 months ago
SooLab / DDCOT
View on GitHub
[NeurIPS 2023]DDCoT: Duty-Distinct Chain-of-Thought Prompting for Multimodal Reasoning in Language Models
☆48Mar 18, 2024Updated 2 years ago
irgroup / repro_eval
View on GitHub
A Python Interface to Reproducibility Measures of System-Oriented IR Experiments
☆11Dec 2, 2025Updated 7 months ago
yuangan / evaluation_eat
View on GitHub
Evaluation code for "Efficient Emotional Adaptation for Audio-Driven Talking-Head Generation"
☆19Mar 10, 2024Updated 2 years ago
mzhaoshuai / RLCF
View on GitHub
[ICLR 2024] Test-Time RL with CLIP Feedback for Vision-Language Models.
☆102Oct 20, 2025Updated 9 months ago
mpourkeshavarz / Cost-Free-Incremental-Learning
View on GitHub
☆23Aug 14, 2022Updated 3 years ago
ABaldrati / CLIP4CirDemo
View on GitHub
[CVPR 2022 - Demo Track] - Effective conditioned and composed image retrieval combining CLIP-based features
☆85Nov 12, 2024Updated last year
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
RUC-NLPIR / DeepImageSearch
View on GitHub
☆87May 2, 2026Updated 2 months ago
Pter61 / context-i2w
View on GitHub
Context-I2W: Mapping Images to Context-dependent words for Accurate Zero-Shot Composed Image Retrieval [AAAI 2024 Oral]
☆54May 27, 2025Updated last year
suoych / KEDs
View on GitHub
Implementation of the paper Knowledge-Enhanced Dual-stream Zero-shot Composed Image Retrieval (CVPR 2024)
☆20Nov 4, 2024Updated last year
ZJCV / TSM
View on GitHub
[ICCV 2019] TSM: Temporal Shift Module for Efficient Video Understanding
☆22Nov 4, 2020Updated 5 years ago
Monoxide-Chen / uncertainty_retrieval
View on GitHub
ICLR‘24 Offical Implementation of Composed Image Retrieval with Text Feedback via Multi-grained Uncertainty Regularization
☆74Jan 30, 2024Updated 2 years ago
NUS-HPC-AI-Lab / Multimodal-ICL-Retriever
View on GitHub
☆10Nov 12, 2024Updated last year
knightyxp / DGL
View on GitHub
[AAAI 2024] DGL: Dynamic Global-Local Prompt Tuning for Text-Video Retrieval.
☆49Oct 14, 2024Updated last year