brandonstarxel / chunking_evaluation
This package, developed as part of our research detailed in the Chroma Technical Report, provides tools for text chunking and evaluation. It allows users to compare different chunking methods and includes implementations of several novel chunking strategies.
☆258Updated 5 months ago
Alternatives and similar repositories for chunking_evaluation:
Users that are interested in chunking_evaluation are comparing it to the libraries listed below
- Use late-interaction multi-modal models such as ColPali in just a few lines of code.☆750Updated last month
- In-Context Learning for eXtreme Multi-Label Classification (XMC) using only a handful of examples.☆410Updated last year
- ☆106Updated last week
- Benchmark various LLM Structured Output frameworks: Instructor, Mirascope, Langchain, LlamaIndex, Fructose, Marvin, Outlines, etc on task…☆157Updated 5 months ago
- 👩🏻🍳 A collection of example notebooks☆440Updated this week
- Recipes for learning, fine-tuning, and adapting ColPali to your multimodal RAG use cases. 👨🏻🍳☆260Updated 2 months ago
- ☆195Updated 10 months ago
- FastAPI wrapper around DSPy☆234Updated last year
- ☆212Updated 3 months ago
- An Awesome list of curated DSPy resources.☆294Updated 3 weeks ago
- ☆142Updated 7 months ago
- A Lightweight Library for AI Observability☆236Updated 3 weeks ago
- This software contains an agent based on LangGraph & LangChain for solving general requests in the Whatsapp channel of this medical clini…☆190Updated 5 months ago
- Visualize Different Text Splitting Methods☆230Updated 2 months ago
- Repository to demonstrate Chain of Table reasoning with multiple tables powered by LangGraph☆144Updated 11 months ago
- This repository shares end-to-end notebooks on how to use various Weaviate features and integrations!☆712Updated this week
- Code for explaining and evaluating late chunking (chunked pooling)☆340Updated 2 months ago
- ARAGOG- Advanced RAG Output Grading. Exploring and comparing various Retrieval-Augmented Generation (RAG) techniques on AI research paper…☆102Updated 10 months ago
- A lightweight, low-dependency, unified API to use all common reranking and cross-encoder models.☆1,327Updated 3 weeks ago
- ☆263Updated 8 months ago
- Generate large synthetic data using an LLM☆390Updated this week
- Tutorial for building LLM router☆186Updated 7 months ago
- A fast, lightweight and easy-to-use Python library for splitting text into semantically meaningful chunks.☆259Updated this week
- Building a chatbot powered with a RAG pipeline to read,summarize and quote the most relevant papers related to the user query.☆166Updated 10 months ago
- ☆88Updated 3 months ago
- ☆149Updated 3 months ago
- A simple Python sandbox for helpful LLM data agents☆229Updated 8 months ago
- An example of multi-agent orchestration with llama-index☆405Updated last month
- A small library of LLM judges☆154Updated this week
- Retrieve, Read and LinK: Fast and Accurate Entity Linking and Relation Extraction on an Academic Budget (ACL 2024)☆397Updated 5 months ago