ZoengHN/Embed-RL

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/ZoengHN/Embed-RL)

ZoengHN / Embed-RL

☆44

Alternatives and similar repositories for Embed-RL

Users that are interested in Embed-RL are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

XMUDeepLIT / UME-R1
View on GitHub
The code implementation for UME-R1: Exploring Reasoning-Driven Generative Multimodal Embeddings (ICLR 2026).
☆69Feb 25, 2026Updated 4 months ago
VoyageWang / VG-Refiner
View on GitHub
The repository of VG-Refiner paper
☆20Dec 9, 2025Updated 7 months ago
VoyageWang / IteRPrimE
View on GitHub
The official implementation of our paper ''IteRPrimE: Zero-shot Referring Image Segmentation with Iterative Grad-CAM Refinement and Prima…
☆20Apr 6, 2025Updated last year
MCG-NJU / RGE
View on GitHub
Reasoning Guided Embeddings: Leveraging MLLM Reasoning for Improved Multimodal Retrieval
☆15Nov 29, 2025Updated 7 months ago
WeChatCV / ObjEmbed
View on GitHub
(ICML 2026) Official repository of paper "ObjEmbed: Towards Universal Multimodal Object Embeddings"
☆51May 18, 2026Updated 2 months ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
Roytsai27 / GIRCSE
View on GitHub
Official implementation of ICLR 2026: Let LLMs Speak Embedding Languages: Generative Text Embeddings via Iterative Contrastive Refinement
☆15May 24, 2026Updated last month
GaryGuTC / UniME-v2
View on GitHub
[AAAI 2026 Oral] The official code of "UniME-V2: MLLM-as-a-Judge for Universal Multimodal Embedding Learning"
☆74Dec 8, 2025Updated 7 months ago
yunzeliu / awesome-unified-embedding
View on GitHub
A curated list of papers, models, datasets, and benchmarks for unified multi-modal embedding models.
☆43Apr 29, 2026Updated 2 months ago
haoyu-bu / CAFe
View on GitHub
Code for "CAFe: Unifying Representation and Generation with Contrastive-Autoregressive Finetuning"
☆33Mar 26, 2025Updated last year
ChoS3nE11ven / Agentic-MME
View on GitHub
☆36Apr 13, 2026Updated 3 months ago
XMUDeepLIT / LLaVE
View on GitHub
LLaVE: Large Language and Vision Embedding Models with Hardness-Weighted Contrastive Learning
☆78May 23, 2025Updated last year
Yxxxb / LAVT-RS
View on GitHub
[CVPR'2022, TPAMI'2024] LAVT: Language-Aware Vision Transformer for Referring Segmentation
☆26Jan 21, 2025Updated last year
haoxiangzhao12138 / PLUME
View on GitHub
[ACMMM 2026] PLUME: Latent Reasoning Based Universal Multimodal Embedding
☆24Apr 29, 2026Updated 2 months ago
yongliu20 / SCAN
View on GitHub
[CVPR 2024] The repository contains the official implementation of "Open-Vocabulary Segmentation with Semantic-Assisted Calibration"
☆77Sep 23, 2024Updated last year
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
Henglin-Liu / ArtQuant
View on GitHub
[AAAI26] Bridging Cognitive Gap: Hierarchical Description Learning for Artistic Image Aesthetics Assessment
☆22Dec 30, 2025Updated 6 months ago
Koreyoshi01 / VISD
View on GitHub
This repository is the official implementation for VISD.
☆21May 17, 2026Updated 2 months ago
RemRico / Recall
View on GitHub
A composed retrieval project
☆17Apr 9, 2026Updated 3 months ago
RammusLeo / ScoreHOI
View on GitHub
Official repository of ScoreHOI (ICCV 2025)
☆16Dec 21, 2025Updated 7 months ago
zhang9302002 / ThinkingWithVideos
View on GitHub
The official code of "Thinking With Videos: Multimodal Tool-Augmented Reinforcement Learning for Long Video Reasoning"
☆102Oct 15, 2025Updated 9 months ago
shiyi-zh0408 / NAE_CVPR2024
View on GitHub
[CVPR 2024] Narrative Action Evaluation with Prompt-Guided Multimodal Interaction
☆43May 16, 2024Updated 2 years ago
RUCAIBox / LMM-Searcher
View on GitHub
The official code of "Towards Long-horizon Agentic Multimodal Search"
☆27Apr 17, 2026Updated 3 months ago
saibr / hypvl
View on GitHub
This repository is related to 'Intriguing Properties of Hyperbolic Embeddings in Vision-Language Models', published at TMLR (2024), https…
☆21Jul 5, 2024Updated 2 years ago
RammusLeo / DPMesh
View on GitHub
The repository contains the official implementation of "DPMesh: Exploiting Diffusion Prior for Occluded Human Mesh Recovery"
☆25Jul 25, 2024Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
DeepExperience / HyperEyes
View on GitHub
HyperEyes is a parallel multimodal search agent that fuses visual grounding and retrieval into a single atomic action, enabling concurren…
☆70May 23, 2026Updated last month
EvolvingLMMs-Lab / LLaVA-OneVision-1.5-RL
View on GitHub
Fully Open Framework for Democratized Multimodal Reinforcement Learning.
☆51Dec 19, 2025Updated 7 months ago
MCG-NJU / VideoEval
View on GitHub
VideoEval: Comprehensive Benchmark Suite for Low-Cost Evaluation of Video Foundation Model
☆15Jul 31, 2025Updated 11 months ago
haon-chen / mmE5
View on GitHub
☆59Feb 27, 2025Updated last year
chendy25 / V-Retrver
View on GitHub
☆36May 27, 2026Updated last month
VisionChengzhuo / CoF-T2I
View on GitHub
Video models as pure visual reasoners for high-quality text-to-image generation via Chain-of-Frame reasoning.
☆39Jan 16, 2026Updated 6 months ago
deepglint / DanQing
View on GitHub
The official repo for the DanQing dataset.
☆36Mar 25, 2026Updated 3 months ago
DaoD / DCL
View on GitHub
From Easy to Hard: A Dual Curriculum Learning Framework for Context-Aware Document Ranking
☆14Oct 25, 2022Updated 3 years ago
deepglint / UniME
View on GitHub
[ACM MM 2025] The official code of "Breaking the Modality Barrier: Universal Embedding Learning with Multimodal LLMs"
☆105Dec 8, 2025Updated 7 months ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
AMAP-ML / UniVG-R1
View on GitHub
UniVG-R1: Reasoning Guided Universal Visual Grounding with Reinforcement Learning
☆165Jun 2, 2025Updated last year
guangmingjian / MVANA
View on GitHub
☆11May 10, 2024Updated 2 years ago
MYMY-young / DelimScaling
View on GitHub
[ICLR 2026] Official implementation of "Enhancing Multi-Image Understanding Through Delimiter Token Scaling"
☆15Jul 10, 2026Updated last week
QwenLM / Qwen3-VL-Embedding
View on GitHub
☆1,335Jun 23, 2026Updated 3 weeks ago
VincentLeebang / lvr
View on GitHub
Official codebase for the paper Latent Visual Reasoning
☆170Oct 22, 2025Updated 8 months ago
BIGBALLON / BeyondCLIP
View on GitHub
Not a neutral survey — a field manual for engineers who build, train, and ship multimodal retrieval at production scale. The C-L-I triang…
☆79Apr 20, 2026Updated 3 months ago
lmb-freiburg / two-effects-one-trigger
View on GitHub
Official code for the paper "Two Effects, One Trigger: On the Modality Gap, Object Bias, and Information Imbalance in Contrastive Vision-…
☆24May 11, 2025Updated last year