Toward Universal Multimodal Embedding
☆77Aug 1, 2025Updated 8 months ago
Alternatives and similar repositories for UME-Search
Users that are interested in UME-Search are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [CVPR25] CoLLM: A Large Language Model for Composed Image Retrieval☆28Mar 26, 2025Updated last year
- Code for "CAFe: Unifying Representation and Generation with Contrastive-Autoregressive Finetuning"☆33Mar 26, 2025Updated last year
- [ACL 2025 Oral] 🔥🔥 MegaPairs: Massive Data Synthesis for Universal Multimodal Retrieval☆244Nov 6, 2025Updated 5 months ago
- The repository of VG-Refiner paper☆17Dec 9, 2025Updated 4 months ago
- ☆34Updated this week
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- LLaVE: Large Language and Vision Embedding Models with Hardness-Weighted Contrastive Learning☆77May 23, 2025Updated 10 months ago
- CCF大数据竞赛--垃圾短信基于文本内容的识别☆11Mar 13, 2016Updated 10 years ago
- ☆12Nov 3, 2023Updated 2 years ago
- Benchmarking Video-LLMs on Video Spatio-Temporal Reasoning☆43Mar 2, 2026Updated last month
- 使用yolov8自动标注,运用度量学习metric learning 的ReID算法,实现跨镜头人脸追踪☆10May 15, 2024Updated last year
- AU-Expression Knowledge Constrained Representation Learning for Facial Expression Recognition (ICRA 2021)☆11Dec 29, 2023Updated 2 years ago
- FineCLIP: Self-distilled Region-based CLIP for Better Fine-grained Understanding (NIPS24)☆38Nov 12, 2025Updated 5 months ago
- WeThink: Toward General-purpose Vision-Language Reasoning via Reinforcement Learning☆36Jun 10, 2025Updated 10 months ago
- ☆10Dec 16, 2023Updated 2 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- This repo would give multi-task keypoint detect code based yolov8. The landmarks or keypoints with different classes and numbers can be …☆12Feb 28, 2023Updated 3 years ago
- 🏆 The 1st Place Solution for AICity2022 Challenge Track2: Natural Language-Based Vehicle Retrieval.☆12Jul 25, 2022Updated 3 years ago
- ☆13Aug 15, 2025Updated 7 months ago
- 使用Qwen3的Embedding和Reranker模型实现查找与精排☆21Jun 22, 2025Updated 9 months ago
- A collection of awesome think with videos papers.☆97Dec 1, 2025Updated 4 months ago
- ☆17Oct 16, 2023Updated 2 years ago
- Detail-Oriented CLIP for Fine-Grained Tasks (ICLR SSI-FM 2025)☆57Mar 26, 2025Updated last year
- [CVPR 2025 Highlight] Your Large Vision-Language Model Only Needs A Few Attention Heads For Visual Grounding☆67Aug 31, 2025Updated 7 months ago
- Human Co-Parsing Guided Alignment for Occluded Person Re-identification(IEEE T-IP 23)☆14Aug 30, 2024Updated last year
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆23Jan 16, 2018Updated 8 years ago
- [ACM MM 2025] The official code of "Breaking the Modality Barrier: Universal Embedding Learning with Multimodal LLMs"☆105Dec 8, 2025Updated 4 months ago
- Python reuse of ViBe Source C code based on Cython. ViBe: A universal background subtraction algorithm for video sequences☆10Nov 19, 2020Updated 5 years ago
- Tamura Texture implemented by python☆15Feb 27, 2019Updated 7 years ago
- A repository of all code and resources of my published blog articles.☆36Dec 21, 2025Updated 3 months ago
- Composed Video Retrieval☆62May 2, 2024Updated last year
- Joint angle comparison of mediapipe prediction results bvh conversion with ground truth bvh☆11Apr 1, 2023Updated 3 years ago
- ☆20Mar 5, 2025Updated last year
- A simple Computer Vision Framework, mainly based on PyTorch. Including distributed training, logging and so on.☆12Dec 2, 2023Updated 2 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- A keras version of real-time fire detection network: mobilenet_v2_ssdlite.☆17Dec 8, 2022Updated 3 years ago
- A curated list of papers and resources related to Described Object Detection, Open-Vocabulary/Open-World Object Detection and Referring E…☆348Nov 6, 2025Updated 5 months ago
- yolov8obb 旋转目标检测部署rknn的C++代码☆20Jul 16, 2024Updated last year
- ☆48Oct 17, 2025Updated 5 months ago
- Implementation of Variational Auto-Encoder for text generation in pytorch.☆12Oct 9, 2020Updated 5 years ago
- A car re-identification app based on multi-feature fusion technique☆18Apr 24, 2022Updated 3 years ago
- [TIP] Exploring Effective Factors for Improving Visual In-Context Learning☆20Jul 2, 2025Updated 9 months ago