brandonstarxel / chunking_evaluationLinks
This package, developed as part of our research detailed in the Chroma Technical Report, provides tools for text chunking and evaluation. It allows users to compare different chunking methods and includes implementations of several novel chunking strategies.
☆454Updated 8 months ago
Alternatives and similar repositories for chunking_evaluation
Users that are interested in chunking_evaluation are comparing it to the libraries listed below
Sorting:
- ☆241Updated 5 months ago
- Use late-interaction multi-modal models such as ColPali in just a few lines of code.☆832Updated 10 months ago
- Recipes for learning, fine-tuning, and adapting ColPali to your multimodal RAG use cases. 👨🏻🍳☆343Updated 6 months ago
- Code for explaining and evaluating late chunking (chunked pooling)☆469Updated 11 months ago
- This repository shares end-to-end notebooks on how to use various Weaviate features and integrations!☆919Updated last week
- Benchmark various LLM Structured Output frameworks: Instructor, Mirascope, Langchain, LlamaIndex, Fructose, Marvin, Outlines, etc on task…☆179Updated last year
- 👩🏻🍳 A collection of example notebooks using Haystack☆513Updated last week
- A lightweight, low-dependency, unified API to use all common reranking and cross-encoder models.☆1,579Updated 6 months ago
- Visualize Different Text Splitting Methods☆308Updated 11 months ago
- A fast, lightweight and easy-to-use Python library for splitting text into semantically meaningful chunks.☆486Updated last month
- Lite & Super-fast re-ranking for your search & retrieval pipelines. Supports SoTA Listwise and Pairwise reranking based on LLMs and cro…☆892Updated 2 months ago
- ☆209Updated 2 months ago
- Automated Evaluation of RAG Systems☆676Updated 8 months ago
- Simple package to extract text with coordinates from programmatic PDFs☆218Updated last month
- An example of multi-agent orchestration with llama-index☆441Updated 10 months ago
- An Awesome list of curated DSPy resources.☆479Updated last month
- ARAGOG- Advanced RAG Output Grading. Exploring and comparing various Retrieval-Augmented Generation (RAG) techniques on AI research paper…☆114Updated last year
- A comprehensive guide to LLM evaluation methods designed to assist in identifying the most suitable evaluation techniques for various use…☆155Updated this week
- FastAPI wrapper around DSPy☆284Updated last year
- Readymade evaluators for agent trajectories☆404Updated 3 months ago
- A small library of LLM judges☆301Updated 4 months ago
- In-Context Learning for eXtreme Multi-Label Classification (XMC) using only a handful of examples.☆443Updated last year
- HtmlRAG: HTML is Better Than Plain Text for Modeling Retrieval Results in RAG Systems (WWW 2025)☆450Updated 5 months ago
- ☆1,409Updated last year
- RAG evaluation without the need for "golden answers"☆328Updated this week
- ☆275Updated last year
- ☆199Updated 2 weeks ago
- Build fast and accurate GenAI apps with GraphRAG SDK at scale.☆514Updated last week
- Build datasets using natural language☆548Updated 2 months ago
- Simple UI for debugging correlations of text embeddings☆302Updated 6 months ago