Lite & Super-fast re-ranking for your search & retrieval pipelines. Supports SoTA Listwise and Pairwise reranking based on LLMs and cross-encoders and more. Created by Prithivi Da, open for PRs & Collaborations.
☆954Jan 1, 2026Updated 2 months ago
Alternatives and similar repositories for FlashRank
Users that are interested in FlashRank are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A lightweight, low-dependency, unified API to use all common reranking and cross-encoder models.☆1,605Dec 20, 2025Updated 3 months ago
- Lite weight wrapper for the independent implementation of SPLADE++ models for search & retrieval pipelines. Models and Library created by…☆34Aug 24, 2024Updated last year
- Easily use and train state of the art late-interaction retrieval methods (ColBERT) in any RAG pipeline. Designed for modularity and ease-…☆3,889May 17, 2025Updated 10 months ago
- Low latency, High Accuracy, Custom Query routers for Humans and Agents. Built by Prithivi Da☆120Mar 31, 2025Updated 11 months ago
- Infinity is a high-throughput, low-latency serving engine for text-embeddings, reranking models, clip, clap and colpali☆2,724Feb 5, 2026Updated last month
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Fast, Accurate, Lightweight Python library to make State of the Art Embedding☆2,791Mar 12, 2026Updated 2 weeks ago
- RankLLM is a Python toolkit for reproducible information retrieval research using rerankers, with a focus on listwise reranking.☆583Mar 12, 2026Updated 2 weeks ago
- Fast lexical search implementing BM25 in Python☆1,596Mar 17, 2026Updated last week
- Late Interaction Models Training & Retrieval☆754Mar 6, 2026Updated 3 weeks ago
- ColBERT: state-of-the-art neural search (SIGIR'20, TACL'21, NeurIPS'21, NAACL'22, CIKM'22, ACL'23, EMNLP'23)☆3,811Oct 14, 2025Updated 5 months ago
- Supercharge Your LLM Application Evaluations 🚀☆13,106Feb 24, 2026Updated last month
- Retrieval and Retrieval-augmented LLMs☆11,443Mar 10, 2026Updated 2 weeks ago
- SPLADE: sparse neural search (SIGIR21, SIGIR22)☆984May 3, 2024Updated last year
- C++ inference wrappers for running blazing fast embedding services on your favourite serverless like AWS Lambda. By Prithivi Da, PRs welc…☆23Mar 4, 2024Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Generalist and Lightweight Model for Named Entity Recognition (Extract any entity types from texts) @ NAACL 2024☆2,961Mar 19, 2026Updated last week
- Structured Outputs☆13,588Updated this week
- Fast State-of-the-Art Static Embeddings☆2,017Mar 12, 2026Updated 2 weeks ago
- Convert documents to structured data effortlessly. Unstructured is open-source ETL solution for transforming complex documents into clean…☆14,282Mar 16, 2026Updated last week
- DSPy: The framework for programming—not prompting—language models☆33,038Updated this week
- A blazing fast inference solution for text embeddings models☆4,625Updated this week
- Efficient few-shot learning with Sentence Transformers☆2,699Dec 11, 2025Updated 3 months ago
- Superfast AI decision making and intelligent processing of multi-modal data.☆3,381Mar 12, 2026Updated 2 weeks ago
- The Batched API provides a flexible and efficient way to process multiple requests in a batch, with a primary focus on dynamic batching o…☆160Jul 14, 2025Updated 8 months ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Generalist and Lightweight Model for Relation Extraction (Extract any relationship types from text)☆259Jun 11, 2025Updated 9 months ago
- Baguetter is a flexible, efficient, and hackable search engine library implemented in Python. It's designed for quickly benchmarking, imp…☆209Aug 31, 2024Updated last year
- High-performance retrieval engine for unstructured data☆1,567Nov 10, 2025Updated 4 months ago
- Is ChatGPT Good at Search? LLMs as Re-Ranking Agent [EMNLP 2023 Outstanding Paper Award]☆658Mar 10, 2024Updated 2 years ago
- structured outputs for llms☆12,589Updated this week
- 💡 All-in-one AI framework for semantic search, LLM orchestration and language model workflows☆12,322Mar 20, 2026Updated last week
- Developer APIs to Accelerate LLM Projects☆1,749Oct 18, 2024Updated last year
- A RAG that can scale 🧑🏻💻☆11May 28, 2024Updated last year
- Generalist and Lightweight Model for Text Classification☆200Feb 17, 2026Updated last month
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- The code used to train and run inference with the ColVision models, e.g. ColPali, ColQwen2, and ColSmol.☆2,563Mar 17, 2026Updated last week
- Pyserini is a Python toolkit for reproducible information retrieval research with sparse and dense representations.☆2,040Updated this week
- ☆47Feb 7, 2024Updated 2 years ago
- Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verifi…☆3,131Mar 16, 2026Updated last week
- ⚡️A Blazing-Fast Python Library for Ranking Evaluation, Comparison, and Fusion 🐍☆665Aug 7, 2025Updated 7 months ago
- Open-source AI orchestration framework for building context-engineered, production-ready LLM applications. Design modular pipelines and a…☆24,585Mar 20, 2026Updated last week
- Python SDK, Proxy Server (AI Gateway) to call 100+ LLM APIs in OpenAI (or native) format, with cost tracking, guardrails, loadbalancing a…☆40,834Updated this week