Jina VDR is a multilingual, multi-domain benchmark for visual document retrieval
☆38Aug 4, 2025Updated 8 months ago
Alternatives and similar repositories for jina-vdr
Users that are interested in jina-vdr are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- The training codes of Jasper-Token-Compression-600M☆19Nov 19, 2025Updated 5 months ago
- Math24o: 高中奥林匹克数学竞赛测评集 High School Olympiad Mathematics Chinese Benchmark☆11Mar 27, 2025Updated last year
- [EMNLP 2025] The official implementation of "Zero-shot Multimodal Document Retrieval via Cross-Modal Question Generation"☆15Aug 26, 2025Updated 7 months ago
- ☆64Feb 6, 2026Updated 2 months ago
- Official Implementation for the paper "VisCodex: Unified Multimodal Code Generation via Merging Vision and Coding Models"☆22Aug 14, 2025Updated 8 months ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- 🎹 Instruct.KR 2025 Summer Meetup: 오픈소스 LLM, vLLM으로 Production까지 🎹☆23Aug 2, 2025Updated 8 months ago
- Fast search index for SPLADE sparse retrieval models implemented in Python using Numpy and Numba☆38Oct 16, 2025Updated 6 months ago
- [WWW24-UrbanCLIP] A comprehensive toolkit designed to facilitate the collection, processing, and integration of satellite imagery and ass…☆18Oct 6, 2024Updated last year
- Query Expension for Better Query Embedding using LLMs☆69Feb 18, 2025Updated last year
- The code used to train and run inference with MMDocIR☆33May 29, 2025Updated 10 months ago
- Expand -> Retrieve -> Rerank - simple method with strong results on BRIGHT benchmark☆22Aug 22, 2025Updated 7 months ago
- Code for Personalized Large Language Models via Selective Prompt Tuning☆10Jun 26, 2024Updated last year
- Hands-on repository for fine-tuning Large Language Models (LLMs) in the clinical domain with tutorials☆16Jan 9, 2026Updated 3 months ago
- Chain of Images for Intuitively Reasoning☆10Nov 29, 2023Updated 2 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- SWIM-IR is a Synthetic Wikipedia-based Multilingual Information Retrieval training set with 28 million query-passage pairs spanning 33 la…☆49Nov 13, 2023Updated 2 years ago
- ☆20Jun 12, 2024Updated last year
- MEXMA: Token-level objectives improve sentence representations☆43Jan 6, 2025Updated last year
- mcp wrapper for openai built-in tools☆12Mar 13, 2025Updated last year
- [CVPR 2025] Docopilot: Improving Multimodal Models for Document-Level Understanding☆36Jul 22, 2025Updated 8 months ago
- Clober Solidity Library☆10Jun 9, 2025Updated 10 months ago
- ☆14Jul 7, 2024Updated last year
- MetricEval: A framework that conceptualizes and operationalizes four main components of metric evaluation, in terms of reliability and va…☆12Nov 6, 2023Updated 2 years ago
- ☆10Dec 6, 2022Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Testing flow matching in Euclidean space and Lie groups.☆13Mar 18, 2026Updated last month
- ☆38Jan 9, 2026Updated 3 months ago
- [EMNLP'2024 Findings] Explore generated documents for enhanced IR with LLMs. We enhance BM25 to surpass strong dense retriever on many da…☆15Mar 28, 2025Updated last year
- ☆16Sep 22, 2024Updated last year
- Use contrastive learning to train a large language model (LLM) as a retriever☆12Jul 19, 2024Updated last year
- Code for AAAI 2021 long paper Learning from Crowds by Modeling Common Confusions.☆11Feb 6, 2021Updated 5 years ago
- The Python Implementation of CRISP: Clustering Multi-Vector Representations for Denoising and Pruning☆27Jul 27, 2025Updated 8 months ago
- ☆16Oct 3, 2022Updated 3 years ago
- Converts Quora's new NLU dataset to SNLI txt/jsonl format, plus test/dev split, tokenization.☆14Jan 27, 2017Updated 9 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆24Feb 4, 2026Updated 2 months ago
- [Arxiv 2026] ActionPlan: Future-Aware Streaming Motion Synthesis via Frame-Level Action Planning☆74Mar 26, 2026Updated 3 weeks ago
- Submodular optimization for context engineering: query fan-out, text selection, passage reranking☆80Jul 14, 2025Updated 9 months ago
- ☆60Jan 26, 2025Updated last year
- Official Code Repository for the paper "Generative Modeling on Manifolds Through Mixture of Riemannian Diffusion Processes" (ICML 2024).☆16Jul 21, 2024Updated last year
- Pycon KR 2023 presentation☆13Feb 7, 2024Updated 2 years ago
- ☆11Jun 14, 2024Updated last year