AnswerDotAI / rerankers
A lightweight, low-dependency, unified API to use all common reranking and cross-encoder models.
☆790Updated last week
Related projects: ⓘ
- Fast lexical search library implementing BM25 in Python using Numpy and Scipy☆767Updated this week
- The code used to train and run inference with the ColPali architecture.☆502Updated this week
- Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verifi…☆1,396Updated this week
- Use late-interaction multi-modal models such as ColPali in just a few lines of code.☆318Updated this week
- Evaluate your LLM's response with Prometheus and GPT4 💯☆745Updated last week
- Framework for enhancing LLMs for RAG tasks using fine-tuning.☆465Updated last week
- Lite & Super-fast re-ranking for your search & retrieval pipelines. Supports SoTA Listwise and Pairwise reranking based on LLMs and cro…☆571Updated 2 weeks ago
- ☆436Updated last month
- A lightweight library for generating synthetic instruction tuning datasets for your data without GPT.☆655Updated last week
- Curate better data for LLMs☆934Updated 6 months ago
- Fine-Tuning Embedding for RAG with Synthetic Data☆456Updated last year
- Fast, Accurate, Lightweight Python library to make State of the Art Embedding☆1,371Updated this week
- LightEval is a lightweight LLM evaluation suite that Hugging Face has been using internally with the recently released LLM data processin…☆659Updated this week
- Freeing data processing from scripting madness by providing a set of platform-agnostic customizable pipeline processing blocks.☆1,935Updated last week
- In-Context Learning for eXtreme Multi-Label Classification (XMC) using only a handful of examples.☆362Updated 7 months ago
- The official implementation of RAPTOR: Recursive Abstractive Processing for Tree-Organized Retrieval☆847Updated 2 weeks ago
- Efficient Retrieval Augmentation and Generation Framework☆1,255Updated last week
- High-performance retrieval engine for unstructured data☆778Updated this week
- Easily embed, cluster and semantically label text datasets☆433Updated 5 months ago
- DataDreamer: Prompt. Generate Synthetic Data. Train & Align Models. 🤖💤☆792Updated last month
- Automatically evaluate your LLMs in Google Colab☆511Updated 4 months ago
- Best practices for distilling large language models.☆370Updated 7 months ago
- ReFT: Representation Finetuning for Language Models☆1,076Updated 2 weeks ago
- ☆640Updated this week
- Generative Representational Instruction Tuning☆525Updated 2 weeks ago
- ☆418Updated 2 months ago
- An LLM-powered advanced RAG pipeline built from scratch☆785Updated 7 months ago
- LLM Comparator is an interactive data visualization tool for evaluating and analyzing LLM responses side-by-side, developed by the PAIR t…☆266Updated 2 months ago
- ☆772Updated 10 months ago
- Infinity is a high-throughput, low-latency REST API for serving text-embeddings, reranking models and clip☆1,261Updated 2 weeks ago