Not a neutral survey — a field manual for engineers who build, train, and ship multimodal retrieval at production scale. The C-L-I triangle (Compression · Localization · Instruction), MLLM encoders vs late interaction, MUVERA economics, and falsifiable forecasts through 2030.
☆79Apr 20, 2026Updated last month
Alternatives and similar repositories for BeyondCLIP
Users that are interested in BeyondCLIP are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [CVPR25] CoLLM: A Large Language Model for Composed Image Retrieval☆28Mar 26, 2025Updated last year
- Code for "CAFe: Unifying Representation and Generation with Contrastive-Autoregressive Finetuning"☆33Mar 26, 2025Updated last year
- [ACL 2025 Oral] 🔥🔥 MegaPairs: Massive Data Synthesis for Universal Multimodal Retrieval☆247Nov 6, 2025Updated 7 months ago
- LLaVE: Large Language and Vision Embedding Models with Hardness-Weighted Contrastive Learning☆78May 23, 2025Updated last year
- ☆12Nov 3, 2023Updated 2 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Benchmarking Video-LLMs on Video Spatio-Temporal Reasoning☆43Mar 2, 2026Updated 3 months ago
- FineCLIP: Self-distilled Region-based CLIP for Better Fine-grained Understanding (NIPS24)☆38Nov 12, 2025Updated 6 months ago
- The top conferences on video retrieval libraries in recent years, synchronized with my blog.☆14Nov 27, 2021Updated 4 years ago
- ☆10Dec 16, 2023Updated 2 years ago
- 🏆 The 1st Place Solution for AICity2022 Challenge Track2: Natural Language-Based Vehicle Retrieval.☆12Jul 25, 2022Updated 3 years ago
- ☆14Aug 15, 2025Updated 9 months ago
- 使用Qwen3的Embedding和Reranker模型实现查找与精排☆23Jun 22, 2025Updated 11 months ago
- A Fine-grained Benchmark for Video Captioning and Retrieval☆30Jul 16, 2025Updated 10 months ago
- yolov8在hisi3536a推理☆11Dec 15, 2023Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Detail-Oriented CLIP for Fine-Grained Tasks (ICLR SSI-FM 2025)☆58Mar 26, 2025Updated last year
- ☆23Jul 23, 2025Updated 10 months ago
- [CVPR 2025 Highlight] Your Large Vision-Language Model Only Needs A Few Attention Heads For Visual Grounding☆75Aug 31, 2025Updated 9 months ago
- Human Co-Parsing Guided Alignment for Occluded Person Re-identification(IEEE T-IP 23)☆14Aug 30, 2024Updated last year
- Python reuse of ViBe Source C code based on Cython. ViBe: A universal background subtraction algorithm for video sequences☆10Nov 19, 2020Updated 5 years ago
- A repository of all code and resources of my published blog articles.☆36Dec 21, 2025Updated 5 months ago
- Composed Video Retrieval☆62May 2, 2024Updated 2 years ago
- ☆21Mar 5, 2025Updated last year
- Normalizing Flows with Multi-Scale Autoregressive Priors (CVPR 2020)☆16Jul 22, 2022Updated 3 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- QAT implementation on YOLOv8☆22Jan 28, 2024Updated 2 years ago
- A curated list of papers and resources related to Described Object Detection, Open-Vocabulary/Open-World Object Detection and Referring E…☆354Nov 6, 2025Updated 7 months ago
- GAN-Based Text Generation☆14Apr 19, 2020Updated 6 years ago
- ☆27Dec 3, 2021Updated 4 years ago
- C++ client of a GAN model hosted by TensorFlow Serving☆11Jul 31, 2018Updated 7 years ago
- A car re-identification app based on multi-feature fusion technique☆18Apr 24, 2022Updated 4 years ago
- ☆49Oct 17, 2025Updated 7 months ago
- Deep Learning Part 2, 2019 edition - transcriptions, screenshots and notebooks☆11Jul 19, 2019Updated 6 years ago
- [ICCV 2023] ViLLA: Fine-grained vision-language representation learning from real-world data☆45Oct 15, 2023Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- [ECCV2024]FALIP: Visual Prompt as Foveal Attention Boosts CLIP Zero-Shot Performance☆18Sep 11, 2024Updated last year
- [ECCV2022] PASS: Part-Aware Self-Supervised Pre-Training for Person Re-Identification☆71Jul 8, 2022Updated 3 years ago
- ☆12Jul 17, 2024Updated last year
- The inference of DINOv2 ONNX models using the ONNXRuntime library.☆21Apr 24, 2025Updated last year
- [ICCV 2023] - Composed Image Retrieval on Common Objects in context (CIRCO) dataset☆87Aug 6, 2025Updated 10 months ago
- ☆10Aug 5, 2019Updated 6 years ago
- Density Constrained Reinforcement Learning☆12Mar 24, 2023Updated 3 years ago