TIGER-AI-Lab / VLM2Vec

This repo contains the code for "VLM2Vec: Training Vision-Language Models for Massive Multimodal Embedding Tasks" [ICLR25]
142Updated last week

Alternatives and similar repositories for VLM2Vec:

Users that are interested in VLM2Vec are comparing it to the libraries listed below