brandonstarxel / chunking_evaluationLinks

This package, developed as part of our research detailed in the Chroma Technical Report, provides tools for text chunking and evaluation. It allows users to compare different chunking methods and includes implementations of several novel chunking strategies.

☆373

Alternatives and similar repositories for chunking_evaluation

Users that are interested in chunking_evaluation are comparing it to the libraries listed below

Sorting:

aurelio-labs / semantic-chunkers
☆231Updated last month
tonywu71 / colpali-cookbooks
Recipes for learning, fine-tuning, and adapting ColPali to your multimodal RAG use cases. 👨🏻‍🍳
☆318Updated 2 months ago
AnswerDotAI / byaldi
Use late-interaction multi-modal models such as ColPali in just a few lines of code.
☆807Updated 6 months ago
jina-ai / late-chunking
Code for explaining and evaluating late chunking (chunked pooling)
☆427Updated 7 months ago
isaacus-dev / semchunk
A fast, lightweight and easy-to-use Python library for splitting text into semantically meaningful chunks.
☆348Updated last month
deepset-ai / haystack-cookbook
👩🏻‍🍳 A collection of example notebooks using Haystack
☆490Updated this week
gkamradt / ChunkViz
Visualize Different Text Splitting Methods
☆283Updated 7 months ago
weaviate / recipes
This repository shares end-to-end notebooks on how to use various Weaviate features and integrations!
☆814Updated this week
stephenleo / llm-structured-output-benchmarks
Benchmark various LLM Structured Output frameworks: Instructor, Mirascope, Langchain, LlamaIndex, Fructose, Marvin, Outlines, etc on task…
☆173Updated 10 months ago
KarelDO / xmc.dspy
In-Context Learning for eXtreme Multi-Label Classification (XMC) using only a handful of examples.
☆434Updated last year
ganarajpr / awesome-dspy
An Awesome list of curated DSPy resources.
☆390Updated 5 months ago
diicellman / dspy-rag-fastapi
FastAPI wrapper around DSPy
☆258Updated last year
stanford-futuredata / ARES
Automated Evaluation of RAG Systems
☆637Updated 4 months ago
predlico / ARAGOG
ARAGOG- Advanced RAG Output Grading. Exploring and comparing various Retrieval-Augmented Generation (RAG) techniques on AI research paper…
☆108Updated last year
quotient-ai / judges
A small library of LLM judges
☆248Updated this week
run-llama / multi-agent-concierge
An example of multi-agent orchestration with llama-index
☆429Updated 6 months ago
567-labs / systematically-improving-rag
☆156Updated this week
AnswerDotAI / rerankers
A lightweight, low-dependency, unified API to use all common reranking and cross-encoder models.
☆1,505Updated 2 months ago
lightonai / pylate
Late Interaction Models Training & Retrieval
☆521Updated 2 weeks ago
huggingface / yourbench
🤗 Benchmark Large Language Models Reliably On Your Data
☆367Updated this week
langchain-ai / agentevals
Readymade evaluators for agent trajectories
☆285Updated last week
pavanjava / bootstrap-rag
this project will bootstrap and scaffold the projects for specific semantic search and RAG applications along with regular boiler plate c…
☆91Updated 7 months ago
langchain-ai / memory-template
☆174Updated 2 months ago
docling-project / docling-ibm-models
☆132Updated 2 weeks ago
langchain-ai / memory-agent
☆216Updated 7 months ago
simbianai / taskgen
Task-based Agentic Framework using StrictJSON as the core
☆455Updated 3 weeks ago
CYQIQ / MultiCoT
Repository to demonstrate Chain of Table reasoning with multiple tables powered by LangGraph
☆147Updated last year
jina-ai / correlations
Simple UI for debugging correlations of text embeddings
☆288Updated 2 months ago
TonicAI / tonic_validate
Metrics to evaluate the quality of responses of your Retrieval Augmented Generation (RAG) applications.
☆315Updated 3 weeks ago
jxnl / n-levels-of-rag
☆195Updated last year