A framework for benchmarking embedding models in hybrid search scenarios (BM25 + vector search) using Weaviate.
☆38Apr 8, 2026Updated this week
Alternatives and similar repositories for hybrid-search-eval
Users that are interested in hybrid-search-eval are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Evaluate state-of-the-art sparse embedding models on the LIMIT dataset (`limit-small` and `limit`) from google's paper `On the Theoretica…☆16Sep 4, 2025Updated 7 months ago
- Code of fine-tuning neural sparse models and training from scratch. #SIGIR2025☆24Mar 11, 2026Updated last month
- ☆14Jul 7, 2024Updated last year
- Structured output benchmarks comparing DSPy and BAML with different LLMs☆28Dec 23, 2025Updated 3 months ago
- Performs benchmarking on two Korean datasets with minimal time and effort.☆46Jan 22, 2026Updated 2 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- AutoRAG example about benchmarking Korean embeddings.☆44Oct 2, 2024Updated last year
- An extensive and commented list of resources on Learned Sparse Retrieval.☆49Mar 29, 2026Updated 2 weeks ago
- 데이터와 모델로 채우는 모두를 위한 AI 허브 가든☆35Jul 4, 2025Updated 9 months ago
- Efficient Finetuning for OpenAI GPT-OSS☆23Oct 2, 2025Updated 6 months ago
- A Python library for creating adversarial splits☆14Jul 24, 2022Updated 3 years ago
- 🔍 Enable AI assistants to search and access ClinicalTrials.gov data through a simple MCP interface.☆15Apr 9, 2025Updated last year
- Marketing Attribution Data Model. SQL, Clickhouse, BigQuery☆23Apr 6, 2024Updated 2 years ago
- ☆11Apr 19, 2021Updated 4 years ago
- Generate workflows (for flowcharts or low code) via LLM. Also describe workflow given in DOT.☆18Nov 2, 2023Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- The training codes of Jasper-Token-Compression-600M☆19Nov 19, 2025Updated 4 months ago
- A Scalable and Consistent Distributed Cache☆15Feb 12, 2024Updated 2 years ago
- A curated list of reranking models, libraries, and resources for building high-quality Retrieval-Augmented Generation (RAG) applications.☆50Jan 20, 2026Updated 2 months ago
- The Python Implementation of CRISP: Clustering Multi-Vector Representations for Denoising and Pruning☆27Jul 27, 2025Updated 8 months ago
- C++17 implementation of einops for libtorch - clear and reliable tensor manipulations with einstein-like notation☆11Oct 16, 2023Updated 2 years ago
- Evaluate gpt-4o on CLIcK (Korean NLP Dataset)☆20May 18, 2024Updated last year
- 🔎 A Prodigy plugin for evaluating spaCy pipelines☆13Mar 26, 2024Updated 2 years ago
- Better Live Text for MacOS☆35Feb 8, 2026Updated 2 months ago
- A simple library for adding noise to data.☆12Apr 25, 2019Updated 6 years ago
- Deploy open-source AI quickly and easily - Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- ⚡ Super fast clustering for high-dimensional vectors on CPUs (x86, ARM) and GPUs — for Python and C++. 100x faster clustering of vector e…☆58Updated this week
- ☆20Apr 8, 2025Updated last year
- An update to the network of characters in Victor Hugo's Les Miserables first encoded by Donald Knuth, as part of the Stanford Graph Base …☆15Oct 27, 2015Updated 10 years ago
- Yet Another Matplotlib Extension☆15Dec 1, 2021Updated 4 years ago
- StrategyQA 데이터 세트 번역☆22Apr 12, 2024Updated 2 years ago
- Nord color scheme for BBEdit☆12May 9, 2022Updated 3 years ago
- A reasoning assistant for your STEM education☆24Mar 11, 2025Updated last year
- Alternate Implementation for Zero Shot Text Classification: Instead of reframing NLI/XNLI, this reframes the text backbone of CLIP models…☆37Apr 5, 2022Updated 4 years ago
- AgentBudget is the ulimit for AI agents. Just like Unix systems have ulimit to prevent a single process from consuming all system resourc…☆98Apr 5, 2026Updated last week
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Deep learning models for contextual multi-armed bandit setting☆13May 16, 2021Updated 4 years ago
- A simple converter from SpaCy Entities (Spans) to Huggingface BILOU formatted data (tokens and ner_tags)☆16Sep 29, 2024Updated last year
- KoCommonGEN v2: A Benchmark for Navigating Korean Commonsense Reasoning Challenges in Large Language Models☆25Aug 24, 2024Updated last year
- It shows how to deploy and use an agent with LLM.☆19Mar 1, 2025Updated last year
- Curated list of tools, skills, plugins, and MCP servers for Claude Code☆69Mar 20, 2026Updated 3 weeks ago
- Fast search index for SPLADE sparse retrieval models implemented in Python using Numpy and Numba☆38Oct 16, 2025Updated 5 months ago
- ☆22May 2, 2025Updated 11 months ago