brandonstarxel / chunking_evaluation
This package, developed as part of our research detailed in the Chroma Technical Report, provides tools for text chunking and evaluation. It allows users to compare different chunking methods and includes implementations of several novel chunking strategies.
β302Updated last month
Alternatives and similar repositories for chunking_evaluation:
Users that are interested in chunking_evaluation are comparing it to the libraries listed below
- Benchmark various LLM Structured Output frameworks: Instructor, Mirascope, Langchain, LlamaIndex, Fructose, Marvin, Outlines, etc on taskβ¦β165Updated 7 months ago
- Recipes for learning, fine-tuning, and adapting ColPali to your multimodal RAG use cases. π¨π»βπ³β279Updated 2 weeks ago
- Use late-interaction multi-modal models such as ColPali in just a few lines of code.β777Updated 3 months ago
- β222Updated 5 months ago
- A fast, lightweight and easy-to-use Python library for splitting text into semantically meaningful chunks.β299Updated last month
- Code for explaining and evaluating late chunking (chunked pooling)β377Updated 4 months ago
- An example of multi-agent orchestration with llama-indexβ420Updated 3 months ago
- β143Updated 9 months ago
- β118Updated last week
- Structured information extraction from documentsβ315Updated 7 months ago
- A lightweight, low-dependency, unified API to use all common reranking and cross-encoder models.β1,400Updated last month
- β121Updated 2 months ago
- Late Interaction Models Training & Retrievalβ306Updated this week
- In-Context Learning for eXtreme Multi-Label Classification (XMC) using only a handful of examples.β421Updated last year
- β93Updated 5 months ago
- FastAPI wrapper around DSPyβ238Updated last year
- β265Updated 10 months ago
- An Awesome list of curated DSPy resources.β313Updated 2 months ago
- This repository shares end-to-end notebooks on how to use various Weaviate features and integrations!β743Updated this week
- Visualize Different Text Splitting Methodsβ251Updated 4 months ago
- Repository to demonstrate Chain of Table reasoning with multiple tables powered by LangGraphβ144Updated last year
- Automated Evaluation of RAG Systemsβ585Updated last month
- β162Updated 4 months ago
- β50Updated last month
- An open-source tool for seamless migration from other LLMs to Llama, and for general prompt optimization.β244Updated last week
- β113Updated 2 weeks ago
- Readymade evaluators for agent trajectoriesβ183Updated this week
- ARAGOG- Advanced RAG Output Grading. Exploring and comparing various Retrieval-Augmented Generation (RAG) techniques on AI research paperβ¦β102Updated last year
- π©π»βπ³ A collection of example notebooks using Haystackβ467Updated last week
- β119Updated 4 months ago