360CVGroup/RzenEmbed

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/360CVGroup/RzenEmbed)

360CVGroup / RzenEmbed

Embedding model prioritized towards Multimodal RAG, overall + VisDoc double top1 on MMEB benchmark

☆36

Alternatives and similar repositories for RzenEmbed

Users that are interested in RzenEmbed are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

raghavlite / B3
View on GitHub
☆43Jan 12, 2026Updated 6 months ago
XMUDeepLIT / UME-R1
View on GitHub
The code implementation for UME-R1: Exploring Reasoning-Driven Generative Multimodal Embeddings (ICLR 2026).
☆69Feb 25, 2026Updated 4 months ago
Tencent-QQMM / QQMM-embed
View on GitHub
☆25Jun 22, 2026Updated 3 weeks ago
CLUEbenchmark / Math24o
View on GitHub
Math24o: 高中奥林匹克数学竞赛测评集 High School Olympiad Mathematics Chinese Benchmark
☆14Mar 27, 2025Updated last year
GaryGuTC / UniME-v2
View on GitHub
[AAAI 2026 Oral] The official code of "UniME-V2: MLLM-as-a-Judge for Universal Multimodal Embedding Learning"
☆74Dec 8, 2025Updated 7 months ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
haon-chen / MoCa
View on GitHub
☆68Aug 14, 2025Updated 11 months ago
360CVGroup / Inner-Adaptor-Architecture
View on GitHub
LMM solved catastrophic forgetting, AAAI2025
☆45Apr 15, 2025Updated last year
TIGER-AI-Lab / VLM2Vec
View on GitHub
This repo contains the code for "VLM2Vec / MMEB" [ICLR 2025], "VLM2Vec-V2 / MMEB-V2" [TMLR 2026], and "MMEB-V3" [COLM 2026]
☆667Jun 24, 2026Updated 3 weeks ago
jina-ai / jina-vdr
View on GitHub
Jina VDR is a multilingual, multi-domain benchmark for visual document retrieval
☆38Aug 4, 2025Updated 11 months ago
Episoode / Double-Bench
View on GitHub
[AAAI-26] Are We on the Right Way for Assessing Document Retrieval-Augmented Generation?
☆31Dec 14, 2025Updated 7 months ago
360CVGroup / 360VL
View on GitHub
Our 2nd-gen LMM
☆34May 22, 2024Updated 2 years ago
96-Zachary / vse_2ad
View on GitHub
☆15Apr 30, 2022Updated 4 years ago
pspdada / SENTINEL
View on GitHub
[ICCV 2025] Official repository of "Mitigating Object Hallucinations via Sentence-Level Early Intervention".
☆31Jul 2, 2026Updated 2 weeks ago
google / humanio
View on GitHub
Human I/O, published at CHI 2024, Honorable Mentions Award
☆18May 21, 2026Updated last month
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
pritamqu / HALVA
View on GitHub
[ICLR 2025] Data-Augmented Phrase-Level Alignment for Mitigating Object Hallucination
☆21Jan 27, 2025Updated last year
ag2ai / SimpleDoc
View on GitHub
☆41Jan 9, 2026Updated 6 months ago
XMUDeepLIT / LLaVE
View on GitHub
LLaVE: Large Language and Vision Embedding Models with Hardness-Weighted Contrastive Learning
☆78May 23, 2025Updated last year
QwenLM / Qwen3-VL-Embedding
View on GitHub
☆1,333Jun 23, 2026Updated 3 weeks ago
varshakishore / IncDSI
View on GitHub
☆11Sep 10, 2023Updated 2 years ago
zt706 / tensorflow-mtcnn
View on GitHub
tensorflow mtcnn
☆24Feb 20, 2017Updated 9 years ago
DCDmllm / Momentor
View on GitHub
☆81Nov 24, 2024Updated last year
irfl-dataset / IRFL
View on GitHub
IRFL: Image Recognition of Figurative Language
☆12Nov 30, 2023Updated 2 years ago
TwoBranchDracaena / OpenFace-PyTorch
View on GitHub
PyTorch model of OpenFace
☆12May 8, 2017Updated 9 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
chaxjli / U-MARVEL
View on GitHub
☆36Mar 24, 2026Updated 3 months ago
abeardear / ncnn-yolo
View on GitHub
convert pytorch trained yolo model to ncnn for Flexible deployment
☆10Aug 30, 2018Updated 7 years ago
ant-research / M2-Miner
View on GitHub
[ICLR 2026] M2-Miner: Multi-Agent Enhanced MCTS for Mobile GUI Agent Data Mining
☆55Apr 22, 2026Updated 2 months ago
360CVGroup / LMM-Det
View on GitHub
Make Large Multimodal Models excel in object detection, ICCV 2025
☆65Aug 1, 2025Updated 11 months ago
LgQu / TIGeR
View on GitHub
Code for paper: Unified Text-to-Image Generation and Retrieval
☆16Updated this week
shanface33 / GPT4MF_UB
View on GitHub
Official repository of the paper: Can ChatGPT Detect DeepFakes? A Study of Using Multimodal Large Language Models for Media Forensics
☆15Mar 22, 2024Updated 2 years ago
Mungeryang / colqwen3
View on GitHub
The code used to train and run inference with the ColQwen3 model. Welcome to follow and star! ⭐️⭐️⭐️ https://huggingface.co/goodman2001/…
☆15Jul 4, 2026Updated 2 weeks ago
Code-kunkun / LamRA
View on GitHub
[CVPR 2025] LamRA: Large Multimodal Model as Your Advanced Retrieval Assistant
☆182Jul 7, 2025Updated last year
panruotong / CAG
View on GitHub
Implementation of Not All Contexts Are Equal: Teaching LLMs Credibility-aware Generation. Paper: https://arxiv.org/abs/2404.06809
☆22Oct 22, 2024Updated last year
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
longmalongma / TW-GRPO
View on GitHub
The official repository of our paper "Reinforcing Video Reasoning with Focused Thinking"
☆36Jun 12, 2025Updated last year
mlvlab / VidChain
View on GitHub
Official Implementation (Pytorch) of the "VidChain: Chain-of-Tasks with Metric-based Direct Preference Optimization for Dense Video Capti…
☆25Jan 26, 2025Updated last year
Show-han / Zeroshot_REC
View on GitHub
Official code for Zero-shot Referring Expression Comprehension via Structural Similarity Between Images and Captions (CVPR 2024)
☆28Jun 21, 2024Updated 2 years ago
musicman217 / Text-Proxy
View on GitHub
Text Proxy: Decomposing Retrieval from a 1-to-N Relationship into N 1-to-1 Relationships for Text-Video Retrieval -- AAAI2025
☆21May 8, 2026Updated 2 months ago
NEUIR / M2RAG
View on GitHub
[MM '25] This is the code repo for our paper "Benchmarking Retrieval-Augmented Generation in Multi-Modal Contexts".
☆44Sep 27, 2025Updated 9 months ago
plm-team / PLM
View on GitHub
PLM: Efficient Peripheral Language Models Hardware-Co-Designed for Ubiquitous Computing
☆21Mar 18, 2025Updated last year
ZackAkil / gmail-event-genie
View on GitHub
Workspace Add-on that uses Gemini to extract out the important dates from an email and quickly add them to your calendar.
☆18Nov 6, 2024Updated last year