machinelearningZH/hybrid-search-eval

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/machinelearningZH/hybrid-search-eval)

machinelearningZH / hybrid-search-eval

A framework for benchmarking embedding models in hybrid search scenarios (BM25 + vector search) using Weaviate.

☆40

Alternatives and similar repositories for hybrid-search-eval

Users that are interested in hybrid-search-eval are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

machinelearningZH / semantic-search-eval
View on GitHub
A framework for evaluating semantic search across custom datasets, metrics, and embedding backends.
☆40Jul 9, 2026Updated 2 weeks ago
NewBornRustacean / muvera-rs
View on GitHub
unofficial implementation of MUVERA: Multi-Vector Retrieval via Fixed Dimensional Encodings
☆15Feb 18, 2026Updated 5 months ago
rasyosef / splade-index
View on GitHub
Fast search index for SPLADE sparse retrieval models implemented in Python using Numpy and Numba
☆38Oct 16, 2025Updated 9 months ago
athrael-soju / little-scripts
View on GitHub
A monorepo containing various utility scripts, tools, and applications for development, automation, and AI-powered tasks.
☆16Mar 22, 2026Updated 4 months ago
entscheidsuche / NeueScraper
View on GitHub
Neue Scraper
☆11Jul 15, 2026Updated last week
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
AstraBert / PhiQwenSTEM
View on GitHub
A reasoning assistant for your STEM education
☆24Mar 11, 2025Updated last year
urchade / GraphER
View on GitHub
GraphER: A Structure-aware Text-to-Graph Model for Entity and Relation Extraction
☆98Jul 31, 2024Updated last year
hseb-benchmark / hseb
View on GitHub
HSEB: Hybrid Search Engine Benchmark
☆21Oct 5, 2025Updated 9 months ago
qdrant / miniCOIL
View on GitHub
Contextualized per-token embeddings
☆37Updated this week
MinishLab / tokenlearn
View on GitHub
Pre-train Static Word Embeddings
☆109Jun 9, 2026Updated last month
lightonai / fastkmeans-rs
View on GitHub
A Rust rewrite of FastKMeans for CPU-based clustering
☆17Jun 29, 2026Updated 3 weeks ago
fdugzc / opensearch-sparse-model-tuning-sample
View on GitHub
Code of fine-tuning neural sparse models and training from scratch. #SIGIR2025
☆26Mar 11, 2026Updated 4 months ago
EricLBuehler / candle_graphs
View on GitHub
Graph model execution API for Candle
☆18Jul 27, 2025Updated 11 months ago
lgnbhl / BFS
View on GitHub
🇨🇭Search and Download Data from the Swiss Federal Statistical Office
☆25Jul 3, 2026Updated 3 weeks ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
barjacks / swiss-asylum-judges
View on GitHub
Analysing 30'000 verdicts on deportation appeals of the Swiss Federal Administrative Court
☆14Feb 20, 2017Updated 9 years ago
vespaai-playground / vespaembed
View on GitHub
No code tool for finetuning embedding models
☆30Updated this week
cognica-io / bayesian-bm25
View on GitHub
Bayesian probability transforms for BM25 retrieval scores
☆77Jun 20, 2026Updated last month
DSBA-Lab / Contrastive-Accumulation
View on GitHub
☆14Jul 7, 2024Updated 2 years ago
Marker-Inc-Korea / AutoRAG-example-korean-embedding-benchmark
View on GitHub
AutoRAG example about benchmarking Korean embeddings.
☆45Oct 2, 2024Updated last year
tezansahu / dvc-pycaret-fastapi-demo
View on GitHub
Repository for the Demo of using DVC with PyCaret & MLOps (DVC Office Hours - 20th Jan, 2022)
☆11Jan 20, 2022Updated 4 years ago
sebastianschramm / fastapi_hf_endpoints
View on GitHub
Custom fastapi server packaged as docker image for Huggingface inference endpoints deployment
☆13Apr 17, 2024Updated 2 years ago
hanxiao / searchbox
View on GitHub
Airgapped closed-corpus QA loop: a self-hosted Qwen3.6 agent explores a .zip dataroom under a token budget with local tools
☆49Jun 26, 2026Updated 3 weeks ago
suhan1433 / LLM-as-a-judge-using-G-eval
View on GitHub
LLM-as-a-judge using G-eval Scratch
☆15Oct 12, 2025Updated 9 months ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
MantisAI / sieves
View on GitHub
Plug-and-play document AI with zero-shot models.
☆126May 11, 2026Updated 2 months ago
moodymudskipper / pkg
View on GitHub
Package Objects
☆12Jun 5, 2025Updated last year
microsoft / post-training-toolkit
View on GitHub
☆25Jan 28, 2026Updated 5 months ago
Knowledgator / GLinker
View on GitHub
Efficient and scalable zero-shot entity linking
☆140Updated this week
lakeraai / dsec-gandalf
View on GitHub
☆24Mar 18, 2025Updated last year
jina-ai / embedding-inversion-demo
View on GitHub
Embedding Inversion via Conditional Masked Diffusion: recover original text from embedding vectors using parallel denoising. Live demo + …
☆60Mar 7, 2026Updated 4 months ago
metaodi / swissparlpy
View on GitHub
Wrapper for the Swiss Parliament API and OpenParlData.ch API for Python
☆24Mar 4, 2026Updated 4 months ago
nickaggarwal / nvidia-triton-llm-streaming
View on GitHub
Integrating SSE with NVIDIA Triton Inference Server using a Python backend and Zephyr model. There is very less documentation how to use …
☆10May 29, 2024Updated 2 years ago
ashvardanian / JaccardIndex
View on GitHub
Optimizing bit-level Jaccard Index and Population Counts for large-scale quantized Vector Search via Harley-Seal CSA and Lookup Tables
☆22May 18, 2025Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
LawDigital / redink
View on GitHub
An alternative AI assistant for Microsoft Office that works with your favorite LLM API
☆95Updated this week
jwjohns / LFM2Sloth
View on GitHub
Modular task agnostic training pipeline using LFM2 from Liquid AI with unsloth.
☆16Sep 13, 2025Updated 10 months ago
hotchpotch / yasem
View on GitHub
YASEM - Yet Another Splade|Sparse Embedder - A simple and efficient library for SPLADE embeddings
☆13May 22, 2025Updated last year
viig99 / muvfde
View on GitHub
Generate fixed dimensional embeddings for multi-dimensional vectors in python based on Muvera from Google.
☆20Jun 28, 2025Updated last year
jina-ai / jzip-compressor
View on GitHub
Compression for unit-norm embedding vectors using spherical coordinates
☆83Jan 23, 2026Updated 6 months ago
mrseanryan / gpt-workflow
View on GitHub
Generate workflows (for flowcharts or low code) via LLM. Also describe workflow given in DOT.
☆19Nov 2, 2023Updated 2 years ago
recombee / CompresSAE
View on GitHub
Sparse Embedding Compression for Scalable Retrieval in Recommender Systems
☆39Nov 21, 2025Updated 8 months ago