QQ-MM/QQMM-embed

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/QQ-MM/QQMM-embed)

QQ-MM / QQMM-embed

☆23

Alternatives and similar repositories for QQMM-embed

Users that are interested in QQMM-embed are comparing it to the libraries listed below

Sorting:

raghavlite / B3
View on GitHub
☆37Jan 12, 2026Updated last month
deepglint / UniME
View on GitHub
[ACM MM 2025] The official code of "Breaking the Modality Barrier: Universal Embedding Learning with Multimodal LLMs"
☆103Dec 8, 2025Updated 2 months ago
XMUDeepLIT / LLaVE
View on GitHub
LLaVE: Large Language and Vision Embedding Models with Hardness-Weighted Contrastive Learning
☆77May 23, 2025Updated 9 months ago
tripletclip / TripletCLIP
View on GitHub
[NeurIPS 2024] Official PyTorch implementation of "Improving Compositional Reasoning of CLIP via Synthetic Vision-Language Negatives"
☆46Dec 1, 2024Updated last year
haon-chen / MoCa
View on GitHub
☆67Aug 14, 2025Updated 6 months ago
TIGER-AI-Lab / UniIR
View on GitHub
Official code for paper "UniIR: Training and Benchmarking Universal Multimodal Information Retrievers" (ECCV 2024)
☆178Oct 1, 2024Updated last year
TIGER-AI-Lab / VLM2Vec
View on GitHub
This repo contains the code for "VLM2Vec: Training Vision-Language Models for Massive Multimodal Embedding Tasks" [ICLR 2025]
☆579Feb 11, 2026Updated 2 weeks ago
i2vec / MM-R5
View on GitHub
The official repository of MM-R5
☆28Jun 22, 2025Updated 8 months ago
xiaoxing2001 / DeGLA
View on GitHub
[ACM MM25] Official Pytorch implementation of [Decoupled Global-Local Alignment for Improving Compositional Understanding]
☆15Jul 15, 2025Updated 7 months ago
czy341181 / monoconX
View on GitHub
☆10Apr 8, 2022Updated 3 years ago
racinmat / GTAVisionExport-postprocessing
View on GitHub
☆11Jan 27, 2020Updated 6 years ago
NatLee / simply-blur-detector
View on GitHub
A simply deep learning based blur image detector.
☆10Mar 29, 2023Updated 2 years ago
leolee99 / Online-CNCLIP
View on GitHub
ChineseCLIP using online learning
☆13Nov 7, 2022Updated 3 years ago
typesense / typesense-vue-instantsearch-demo
View on GitHub
A demo app that shows you how to use Vue & the Typesense InstantSearch adapter, to build rich search interfaces.
☆11Jan 23, 2024Updated 2 years ago
caipeng328 / ForCenNet
View on GitHub
☆75Jul 31, 2025Updated 7 months ago
VectorSpaceLab / MegaPairs
View on GitHub
[ACL 2025 Oral] 🔥🔥 MegaPairs: Massive Data Synthesis for Universal Multimodal Retrieval
☆243Nov 6, 2025Updated 3 months ago
justliulong / OGHFYOLO
View on GitHub
The official code for "OG-HFYOLO :Orientation Gradient Guidance and Heterogeneous Feature Fusion For Deformation Table Cell Instance Segm…
☆13Jul 28, 2025Updated 7 months ago
florinshen / Rope3D-Toolkit
View on GitHub
Third-party toolkit for Rope3D dataset
☆13Jun 13, 2022Updated 3 years ago
aicourse / zmc201-greedy-cv-2019
View on GitHub
☆11Sep 25, 2019Updated 6 years ago
plantsgo / ECAA-AI-
View on GitHub
ECAA首届电子商务AI算法大赛
☆11Aug 31, 2021Updated 4 years ago
dreams-flying / nlp-CGED
View on GitHub
Chinese Grammatical Error Diagnosis
☆12Oct 26, 2021Updated 4 years ago
XuweiyiChen / Pix2Gif
View on GitHub
☆13Mar 8, 2024Updated last year
loveisp / KDD_2024_AQA
View on GitHub
KDD 2024 AQA competition 2nd place solution
☆12Jul 21, 2024Updated last year
bardisafa / PreSel
View on GitHub
[CVPR 2025] An Implementation of the paper "Pre-Instruction Data Selection for Visual Instruction Tuning"
☆17Jun 9, 2025Updated 8 months ago
qijimrc / mm_evaluation
View on GitHub
☆11Aug 4, 2024Updated last year
Rbn3D / GTAV_NativeDB_Explorer
View on GitHub
A fully featured desktop application where you can download and browse GTA V Native functions from NativeDB (for script developers)
☆14Jan 2, 2018Updated 8 years ago
mmadry / st-hmp
View on GitHub
Implementation of the Spatio-Temporal Hierarchical Matching Pursuit (ST-HMP) descriptor presented in the paper: M. Madry, L. Bo, D. Kragi…
☆14Aug 4, 2014Updated 11 years ago
DSE-MSU / Recommender-System-Datasets
View on GitHub
A list of compatible datasets, noting other major repositories containing popular real-world datasets, along with sample code for a range…
☆12Mar 18, 2020Updated 5 years ago
kanhaoning / RAG-Optimization-Practices
View on GitHub
☆88Jul 30, 2025Updated 7 months ago
injadlu / DAMA
View on GitHub
[ICML 2025] Official code of "DAMA: Data- and Model-aware Alignment of Multi-modal LLMs"
☆16May 24, 2025Updated 9 months ago
Jmipar-k / lora-dinov2
View on GitHub
A repo for fine-tuning DINOv2 using LoRA layers for multi-class classification
☆17Nov 14, 2024Updated last year
weimengmeng1999 / AdapterSIS
View on GitHub
Enhancing Surgical Instrument Segmentation: Integrating Vision Transformer Insights with Adapter
☆12Mar 21, 2024Updated last year
TenMilesLotus / DTSM
View on GitHub
Code and data for the paper: DTSM: Toward Dense Table Structure Recognition with Text Query Encoder and Adjacent Feature Aggregator
☆12Apr 28, 2024Updated last year
CharlesNeilWilliams / TME
View on GitHub
[CVPR 2025] Official Pytorch implementation of "Learning with Noisy Triplet Correspondence for Composed Image Retrieval".
☆22Jun 9, 2025Updated 8 months ago
brielle751992346 / gesture-recognition
View on GitHub
基于Win10 + Python3.7环境，从采集手势库开始，提取手势轮廓线，提取轮廓线的傅里叶算子作为特征，用KNN和SVM作为分类器训练模型，并用PyQt制作简易桌面
☆10Aug 6, 2019Updated 6 years ago
zhongpeixiang / affect-rich-conversational-model
View on GitHub
The PyTorch code for paper: An Affect-Rich Neural Conversational Model with Biased Attention and Weighted Cross-Entropy Loss
☆12Oct 7, 2019Updated 6 years ago
chs20 / fuselip
View on GitHub
FuseLIP: Multimodal Embeddings via Early Fusion of Discrete Tokens
☆17Sep 8, 2025Updated 5 months ago
zhengye1995 / Autonomous-driving-traffic-sign-recognition-based-on-virtual-simulation-environment-Rank3
View on GitHub
基于虚拟仿真环境下的自动驾驶交通标志识别第三名方案
☆10Jan 11, 2020Updated 6 years ago
cXPromise / 2021ECAA_Top2_Solution
View on GitHub
首届电子商务AI算法大赛TOP2开源代码
☆13Aug 31, 2021Updated 4 years ago