PrithivirajDamodaran/FlashRank

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/PrithivirajDamodaran/FlashRank)

PrithivirajDamodaran / FlashRank

Lite & Super-fast re-ranking for your search & retrieval pipelines. Supports SoTA Listwise and Pairwise reranking based on LLMs and cross-encoders and more. Created by Prithivi Da, open for PRs & Collaborations.

☆997

Alternatives and similar repositories for FlashRank

Users that are interested in FlashRank are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

AnswerDotAI / rerankers
View on GitHub
A lightweight, low-dependency, unified API to use all common reranking and cross-encoder models.
☆1,625Dec 20, 2025Updated 7 months ago
PrithivirajDamodaran / SPLADERunner
View on GitHub
Lite weight wrapper for the independent implementation of SPLADE++ models for search & retrieval pipelines. Models and Library created by…
☆35Aug 24, 2024Updated last year
AnswerDotAI / RAGatouille
View on GitHub
Easily use and train state of the art late-interaction retrieval methods (ColBERT) in any RAG pipeline. Designed for modularity and ease-…
☆3,944May 17, 2025Updated last year
PrithivirajDamodaran / Route0x
View on GitHub
Low latency, High Accuracy, Custom Query routers for Humans and Agents. Built by Prithivi Da
☆122Mar 31, 2025Updated last year
michaelfeil / infinity
View on GitHub
Infinity is a high-throughput, low-latency serving engine for text-embeddings, reranking models, clip, clap and colpali
☆2,899Mar 24, 2026Updated 4 months ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
xhluca / bm25s
View on GitHub
Fast BM25 search in Python, powered by Numpy and Numba
☆1,751Jul 22, 2026Updated last week
lightonai / pylate
View on GitHub
Late Interaction Models Training & Retrieval
☆876Updated this week
qdrant / fastembed
View on GitHub
Fast, Accurate, Lightweight Python library to make State of the Art Embedding
☆3,113Jul 22, 2026Updated last week
castorini / rank_llm
View on GitHub
RankLLM is a Python toolkit for reproducible information retrieval research using rerankers, with a focus on listwise reranking.
☆611Jul 19, 2026Updated last week
stanford-futuredata / ColBERT
View on GitHub
ColBERT: state-of-the-art neural search (SIGIR'20, TACL'21, NeurIPS'21, NAACL'22, CIKM'22, ACL'23, EMNLP'23)
☆3,904Oct 14, 2025Updated 9 months ago
FlagOpen / FlagEmbedding
View on GitHub
Retrieval and Retrieval-augmented LLMs
☆11,997Apr 22, 2026Updated 3 months ago
naver / splade
View on GitHub
SPLADE: sparse neural search (SIGIR21, SIGIR22)
☆999May 3, 2024Updated 2 years ago
PrithivirajDamodaran / blitz-embed
View on GitHub
C++ inference wrappers for running blazing fast embedding services on your favourite serverless like AWS Lambda. By Prithivi Da, PRs welc…
☆24Mar 4, 2024Updated 2 years ago
urchade / GLiNER
View on GitHub
Generalist and Lightweight Model for Named Entity Recognition (Extract any entity types from texts)
☆3,462Updated this week
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
dottxt-ai / outlines
View on GitHub
Structured Outputs
☆15,419Updated this week
MinishLab / model2vec
View on GitHub
Fast State-of-the-Art Static Embeddings
☆2,167Jun 6, 2026Updated last month
Unstructured-IO / unstructured
View on GitHub
Convert documents to structured data effortlessly. Unstructured is open-source ETL solution for transforming complex documents into clean…
☆15,210Updated this week
stanfordnlp / dspy
View on GitHub
DSPy: The framework for programming—not prompting—language models
☆36,460Updated this week
vibrantlabsai / ragas
View on GitHub
Supercharge Your LLM Application Evaluations 🚀
☆15,031Feb 24, 2026Updated 5 months ago
huggingface / text-embeddings-inference
View on GitHub
A blazing fast inference solution for text embeddings models
☆4,964Updated this week
huggingface / setfit
View on GitHub
Efficient few-shot learning with Sentence Transformers
☆2,777May 26, 2026Updated 2 months ago
illuin-tech / colpali
View on GitHub
The code used to train and run inference with the ColVision models, e.g. ColPali, ColQwen2, and ColSmol.
☆2,711Jul 13, 2026Updated 2 weeks ago
mixedbread-ai / baguetter
View on GitHub
Baguetter is a flexible, efficient, and hackable search engine library implemented in Python. It's designed for quickly benchmarking, imp…
☆211Aug 31, 2024Updated last year
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
mixedbread-ai / batched
View on GitHub
The Batched API provides a flexible and efficient way to process multiple requests in a batch, with a primary focus on dynamic batching o…
☆161Jul 14, 2025Updated last year
sunnweiwei / RankGPT
View on GitHub
Is ChatGPT Good at Search? LLMs as Re-Ranking Agent [EMNLP 2023 Outstanding Paper Award]
☆669Mar 10, 2024Updated 2 years ago
D-Star-AI / dsRAG
View on GitHub
High-performance retrieval engine for unstructured data
☆1,589Nov 10, 2025Updated 8 months ago
aurelio-labs / semantic-router
View on GitHub
Superfast AI decision making and intelligent processing of multi-modal data.
☆3,756Updated this week
neuml / txtai
View on GitHub
💡 All-in-one AI framework for semantic search, LLM orchestration and language model workflows
☆12,765Updated this week
567-labs / instructor
View on GitHub
structured outputs for llms
☆13,650Updated this week
illuin-tech / contextual-embeddings
View on GitHub
Model implementation for the contextual embeddings project
☆48Jun 2, 2025Updated last year
castorini / pyserini
View on GitHub
Pyserini is a Python toolkit for reproducible information retrieval research with sparse and dense representations.
☆2,107Jul 16, 2026Updated last week
AmenRa / ranx
View on GitHub
⚡️A Blazing-Fast Python Library for Ranking Evaluation, Comparison, and Fusion 🐍
☆689Aug 7, 2025Updated 11 months ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
nlmatics / llmsherpa
View on GitHub
Developer APIs to Accelerate LLM Projects
☆1,745Oct 18, 2024Updated last year
oceanumeric / EnteRAG
View on GitHub
A RAG that can scale 🧑🏻‍💻
☆11May 28, 2024Updated 2 years ago
unicamp-dl / InRanker
View on GitHub
☆47Feb 7, 2024Updated 2 years ago
deepset-ai / haystack
View on GitHub
Open-source AI orchestration framework for building context-engineered, production-ready LLM applications. Design modular pipelines and a…
☆26,051Updated this week
argilla-io / distilabel
View on GitHub
Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verifi…
☆3,346Updated this week
axolotl-ai-cloud / axolotl
View on GitHub
Go ahead and axolotl questions
☆12,282Updated this week
jackboyla / GLiREL
View on GitHub
Generalist and Lightweight Model for Relation Extraction (Extract any relationship types from text)
☆289Mar 30, 2026Updated 3 months ago