brandonstarxel / chunking_evaluationLinks
This package, developed as part of our research detailed in the Chroma Technical Report, provides tools for text chunking and evaluation. It allows users to compare different chunking methods and includes implementations of several novel chunking strategies.
☆319Updated 2 months ago
Alternatives and similar repositories for chunking_evaluation
Users that are interested in chunking_evaluation are comparing it to the libraries listed below
Sorting:
- Code for explaining and evaluating late chunking (chunked pooling)☆390Updated 5 months ago
- ☆224Updated 5 months ago
- 👩🏻🍳 A collection of example notebooks using Haystack☆476Updated this week
- Benchmark various LLM Structured Output frameworks: Instructor, Mirascope, Langchain, LlamaIndex, Fructose, Marvin, Outlines, etc on task…☆169Updated 8 months ago
- FastAPI wrapper around DSPy☆242Updated last year
- A lightweight, low-dependency, unified API to use all common reranking and cross-encoder models.☆1,436Updated this week
- Use late-interaction multi-modal models such as ColPali in just a few lines of code.☆787Updated 4 months ago
- An example of multi-agent orchestration with llama-index☆424Updated 4 months ago
- A fast, lightweight and easy-to-use Python library for splitting text into semantically meaningful chunks.☆315Updated 2 months ago
- Recipes for learning, fine-tuning, and adapting ColPali to your multimodal RAG use cases. 👨🏻🍳☆289Updated 2 weeks ago
- In-Context Learning for eXtreme Multi-Label Classification (XMC) using only a handful of examples.☆422Updated last year
- This repository shares end-to-end notebooks on how to use various Weaviate features and integrations!☆768Updated this week
- ☆131Updated last week
- ☆130Updated this week
- Semantic Chunker is a lightweight Python package for semantically-aware chunking and clustering of text.☆251Updated last month
- ☆173Updated 5 months ago
- An Awesome list of curated DSPy resources.☆326Updated 3 months ago
- This software contains an agent based on LangGraph & LangChain for solving general requests in the Whatsapp channel of this medical clini…☆200Updated 7 months ago
- Repository for "MultiHop-RAG: A Dataset for Evaluating Retrieval-Augmented Generation Across Documents" (COLM 2024)☆320Updated last month
- Late Interaction Models Training & Retrieval☆385Updated this week
- Readymade evaluators for agent trajectories☆222Updated last week
- Repository to demonstrate Chain of Table reasoning with multiple tables powered by LangGraph☆144Updated last year
- Retrieve, Read and LinK: Fast and Accurate Entity Linking and Relation Extraction on an Academic Budget (ACL 2024)☆423Updated 8 months ago
- Automated Evaluation of RAG Systems☆596Updated 2 months ago
- Visualize Different Text Splitting Methods☆258Updated 5 months ago
- This repository provides an advanced Retrieval-Augmented Generation (RAG) solution for complex question answering. It uses sophisticated …☆1,231Updated 2 weeks ago
- ☆122Updated 3 months ago
- The Rule-based Retrieval package is a Python package that enables you to create and manage Retrieval Augmented Generation (RAG) applicati…☆239Updated 7 months ago
- Fast lexical search implementing BM25 in Python using Numpy, Numba and Scipy☆1,175Updated this week
- ☆185Updated last year