This package, developed as part of our research detailed in the Chroma Technical Report, provides tools for text chunking and evaluation. It allows users to compare different chunking methods and includes implementations of several novel chunking strategies.
☆484Dec 13, 2025Updated 4 months ago
Alternatives and similar repositories for chunking_evaluation
Users that are interested in chunking_evaluation are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- An Overview of the Latest Document Chunking Research☆89Nov 25, 2024Updated last year
- ☆33Jun 17, 2024Updated last year
- Applying domain specific evaluations to RAG chunking and embedding functions☆18Dec 25, 2024Updated last year
- An overview of popular reranking models and architectures for 2 stage RAG pipelines☆21Jun 10, 2025Updated 10 months ago
- Fast BM25 search in Python, powered by Numpy and Numba☆1,622Apr 5, 2026Updated last week
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Query Only Linear Adapter Training for Fine Tuned Embedding Model Query Representation☆28Sep 12, 2024Updated last year
- Model implementation for the contextual embeddings project☆47Jun 2, 2025Updated 10 months ago
- ☆12Updated this week
- Code for explaining and evaluating late chunking (chunked pooling)☆499Dec 23, 2024Updated last year
- Fork of OpenAI's Realtime Console, adapted for Vocal RAG☆36Oct 18, 2024Updated last year
- ☆21Nov 26, 2024Updated last year
- Optimize Document Retrieval with Fine-Tuned KnowledgeBases☆184Nov 5, 2025Updated 5 months ago
- Starbucks: Improved Training for 2D Matryoshka Embeddings☆23Jun 30, 2025Updated 9 months ago
- ☆1,456Jun 18, 2024Updated last year
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Evaluation framework for document processing models and services.☆67Apr 2, 2026Updated 2 weeks ago
- Easily use and train state of the art late-interaction retrieval methods (ColBERT) in any RAG pipeline. Designed for modularity and ease-…