BIGBALLON/UME-Search

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/BIGBALLON/UME-Search)

BIGBALLON / UME-Search

Toward Universal Multimodal Embedding

☆77

Alternatives and similar repositories for UME-Search

Users that are interested in UME-Search are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

hmchuong / CoLLM
View on GitHub
[CVPR25] CoLLM: A Large Language Model for Composed Image Retrieval
☆28Mar 26, 2025Updated last year
haoyu-bu / CAFe
View on GitHub
Code for "CAFe: Unifying Representation and Generation with Contrastive-Autoregressive Finetuning"
☆33Mar 26, 2025Updated last year
VectorSpaceLab / MegaPairs
View on GitHub
[ACL 2025 Oral] 🔥🔥 MegaPairs: Massive Data Synthesis for Universal Multimodal Retrieval
☆244Nov 6, 2025Updated 5 months ago
VoyageWang / VG-Refiner
View on GitHub
The repository of VG-Refiner paper
☆17Dec 9, 2025Updated 4 months ago
ZoengHN / Embed-RL
View on GitHub
☆34Updated this week
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
XMUDeepLIT / LLaVE
View on GitHub
LLaVE: Large Language and Vision Embedding Models with Hardness-Weighted Contrastive Learning
☆77May 23, 2025Updated 10 months ago
shuangge-jb / sklearn
View on GitHub
CCF大数据竞赛--垃圾短信基于文本内容的识别
☆11Mar 13, 2016Updated 10 years ago
tju-chengyijia / RDNet
View on GitHub
☆12Nov 3, 2023Updated 2 years ago
V-STaR-Bench / V-STaR
View on GitHub
Benchmarking Video-LLMs on Video Spatio-Temporal Reasoning
☆43Mar 2, 2026Updated last month
Caesar-xxx / Human_ReID
View on GitHub
使用yolov8自动标注，运用度量学习metric learning 的ReID算法，实现跨镜头人脸追踪
☆10May 15, 2024Updated last year
HCPLab-SYSU / AUE-CRL
View on GitHub
AU-Expression Knowledge Constrained Representation Learning for Facial Expression Recognition (ICRA 2021)
☆11Dec 29, 2023Updated 2 years ago
Timsty1 / FineCLIP
View on GitHub
FineCLIP: Self-distilled Region-based CLIP for Better Fine-grained Understanding (NIPS24)
☆38Nov 12, 2025Updated 5 months ago
yangjie-cv / WeThink
View on GitHub
WeThink: Toward General-purpose Vision-Language Reasoning via Reinforcement Learning
☆36Jun 10, 2025Updated 10 months ago
zyj20 / MPReID
View on GitHub
☆10Dec 16, 2023Updated 2 years ago
DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
FlyleafPaopao / yolov8-multi-keypoint
View on GitHub
This repo would give multi-task keypoint detect code based yolov8. The landmarks or keypoints with different classes and numbers can be …
☆12Feb 28, 2023Updated 3 years ago
hbchen121 / AICITY2022_Track2_SSM
View on GitHub
🏆 The 1st Place Solution for AICity2022 Challenge Track2: Natural Language-Based Vehicle Retrieval.
☆12Jul 25, 2022Updated 3 years ago
BinWangGzhu / SVLL-ReID
View on GitHub
☆13Aug 15, 2025Updated 7 months ago
SanroZhang / qwen3-ER-search
View on GitHub
使用Qwen3的Embedding和Reranker模型实现查找与精排
☆21Jun 22, 2025Updated 9 months ago
zjuruizhechen / Awesome-Video-Agent
View on GitHub
A collection of awesome think with videos papers.
☆97Dec 1, 2025Updated 4 months ago
RuningMangoPi / yolov8_QAT
View on GitHub
☆17Oct 16, 2023Updated 2 years ago
KishoreP1 / DetailCLIP
View on GitHub
Detail-Oriented CLIP for Fine-Grained Tasks (ICLR SSI-FM 2025)
☆57Mar 26, 2025Updated last year
seilk / LocalizationHeads
View on GitHub
[CVPR 2025 Highlight] Your Large Vision-Language Model Only Needs A Few Attention Heads For Visual Grounding
☆67Aug 31, 2025Updated 7 months ago
Vill-Lab / 2022-TIP-HCGA
View on GitHub
Human Co-Parsing Guided Alignment for Occluded Person Re-identification（IEEE T-IP 23）
☆14Aug 30, 2024Updated last year
DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
ca-joe-yang / ColorHarmonization
View on GitHub
☆23Jan 16, 2018Updated 8 years ago
deepglint / UniME
View on GitHub
[ACM MM 2025] The official code of "Breaking the Modality Barrier: Universal Embedding Learning with Multimodal LLMs"
☆105Dec 8, 2025Updated 4 months ago
232525 / ViBe.Cython
View on GitHub
Python reuse of ViBe Source C code based on Cython. ViBe: A universal background subtraction algorithm for video sequences
☆10Nov 19, 2020Updated 5 years ago
MarshalLeeeeee / Tamura-In-Python
View on GitHub
Tamura Texture implemented by python
☆15Feb 27, 2019Updated 7 years ago
anand-subu / blog_resources
View on GitHub
A repository of all code and resources of my published blog articles.
☆36Dec 21, 2025Updated 3 months ago
OmkarThawakar / composed-video-retrieval
View on GitHub
Composed Video Retrieval
☆62May 2, 2024Updated last year
CocoaPebble / mediapipe_pose_compare
View on GitHub
Joint angle comparison of mediapipe prediction results bvh conversion with ground truth bvh
☆11Apr 1, 2023Updated 3 years ago
PKU-ICST-MIPL / MAI_ICLR2025
View on GitHub
☆20Mar 5, 2025Updated last year
HELLORPG / CV-Framework
View on GitHub
A simple Computer Vision Framework, mainly based on PyTorch. Including distributed training, logging and so on.
☆12Dec 2, 2023Updated 2 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
zhangw864680355 / fire_detect_mobilenet_v2_ssdlite_keras
View on GitHub
A keras version of real-time fire detection network: mobilenet_v2_ssdlite.
☆17Dec 8, 2022Updated 3 years ago
Charles-Xie / awesome-described-object-detection
View on GitHub
A curated list of papers and resources related to Described Object Detection, Open-Vocabulary/Open-World Object Detection and Referring E…
☆348Nov 6, 2025Updated 5 months ago
cqu20160901 / yolov8obb_rknn_Cplusplus
View on GitHub
yolov8obb 旋转目标检测部署rknn的C++代码
☆20Jul 16, 2024Updated last year
ylm1379239710 / YOLOv8-PySide6-CLIP-REID
View on GitHub
☆48Oct 17, 2025Updated 5 months ago
jiangqn / TextVAE-pytorch
View on GitHub
Implementation of Variational Auto-Encoder for text generation in pytorch.
☆12Oct 9, 2020Updated 5 years ago
hanlanqian / Car_ReIdentification_application
View on GitHub
A car re-identification app based on multi-feature fusion technique
☆18Apr 24, 2022Updated 3 years ago
syp2ysy / prompt-SelF
View on GitHub
[TIP] Exploring Effective Factors for Improving Visual In-Context Learning
☆20Jul 2, 2025Updated 9 months ago