☆24Feb 4, 2026Updated last month
Alternatives and similar repositories for bm25-benchmarks
Users that are interested in bm25-benchmarks are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A python library to generate highly realistic typos (fuzz-testing)☆13Mar 16, 2025Updated last year
- Your dive log for deep work☆11Aug 16, 2023Updated 2 years ago
- ☆19May 16, 2024Updated last year
- 🎹 Instruct.KR 2025 Summer Meetup: 오픈소스 LLM, vLLM으로 Production까지 🎹☆23Aug 2, 2025Updated 7 months ago
- ☆10Aug 22, 2022Updated 3 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Fair search elasticsearch plugin☆15Dec 9, 2022Updated 3 years ago
- FeedbackQA: Improving Question Answering Post-Deployment with Interactive Feedback☆12Jul 13, 2022Updated 3 years ago
- An experiment in visualizing your Solr index via term counts, document counts, and memory usage per field and data type.☆15Feb 13, 2015Updated 11 years ago
- SWIM-IR is a Synthetic Wikipedia-based Multilingual Information Retrieval training set with 28 million query-passage pairs spanning 33 la…☆49Nov 13, 2023Updated 2 years ago
- Code for Personalized Large Language Models via Selective Prompt Tuning☆10Jun 26, 2024Updated last year
- Hands-on repository for fine-tuning Large Language Models (LLMs) in the clinical domain with tutorials☆14Jan 9, 2026Updated 2 months ago
- The training codes of Jasper-Token-Compression-600M☆19Nov 19, 2025Updated 4 months ago
- [EMNLP'2024 Findings] Explore generated documents for enhanced IR with LLMs. We enhance BM25 to surpass strong dense retriever on many da…☆15Mar 28, 2025Updated 11 months ago
- Baguetter is a flexible, efficient, and hackable search engine library implemented in Python. It's designed for quickly benchmarking, imp…☆209Aug 31, 2024Updated last year
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- minimal pytorch implementation of bm25 (with sparse tensors)☆104Oct 28, 2025Updated 4 months ago
- EPSScall☆11Jun 10, 2022Updated 3 years ago
- Use contrastive learning to train a large language model (LLM) as a retriever☆12Jul 19, 2024Updated last year
- The Python Implementation of CRISP: Clustering Multi-Vector Representations for Denoising and Pruning☆27Jul 27, 2025Updated 7 months ago
- TAXI: a Taxonomy Induction Method based on Lexico-Syntactic Patterns, Substrings and Focused Crawling☆29Jul 6, 2023Updated 2 years ago
- ☆11Mar 12, 2025Updated last year
- Aurora is a central design system for all products and applications for the Open, Accessible Digital Workspace. This repo is for all code…☆16Feb 23, 2024Updated 2 years ago
- codemirror extensions includes toolbar, helper, image-upload, event-emitter☆12Jan 15, 2026Updated 2 months ago
- Common Index File Format to to support interoperability between open-source IR engines☆40Sep 19, 2024Updated last year
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- Example application written using Reboot☆11Mar 10, 2026Updated 2 weeks ago
- ☆20Jul 24, 2024Updated last year
- Benchmarking Elasticsearch vs. Opensearch☆24Sep 15, 2025Updated 6 months ago
- Embedded facial recognition system involving PYNQ board, Webcam, and HDMI output.☆11May 10, 2018Updated 7 years ago
- ☆11Oct 14, 2020Updated 5 years ago
- Full text search that feels like a numpy array☆304Feb 1, 2026Updated last month
- Docker container to make running Luigi tasks real easy.☆11Aug 31, 2016Updated 9 years ago
- Evolving Interpretable Fuzzy Rule Based Systems with Genetic Programming for Predictive Maintenance☆12Nov 11, 2024Updated last year
- Querqy for Elasticsearch☆48Feb 2, 2026Updated last month
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Search relevance evaluation toolkit☆73Jan 21, 2022Updated 4 years ago
- Jina VDR is a multilingual, multi-domain benchmark for visual document retrieval☆38Aug 4, 2025Updated 7 months ago
- The non-user-of-rawdraw-facing side of rawdraw.☆12Jan 12, 2021Updated 5 years ago
- [ Text Analytics ] 법률 도메인 특화 한국어 기반 LLM 개발☆15Sep 14, 2025Updated 6 months ago
- RLIBM-ALL: A correctly rounded math library and a polynomial generator that produces correct results for multiple floating point represen…☆16Oct 6, 2023Updated 2 years ago
- SOLR bulk indexing utility for the command line.☆45Mar 5, 2026Updated 3 weeks ago
- Migration tool providing support for Apache Cassandra, DataStax Enterprise Cassandra, & DataStax Enterprise Solr.☆37Oct 22, 2019Updated 6 years ago