chonkie-ai / chonkie
π¦ CHONK your texts with Chonkie β¨ - The no-nonsense RAG chunking library
β2,818Updated this week
Alternatives and similar repositories for chonkie:
Users that are interested in chonkie are comparing it to the libraries listed below
- RAG that intelligently adapts to your use case, data, and queriesβ3,042Updated 3 weeks ago
- Vision infrastructure to turn complex documents into RAG/LLM-ready dataβ2,017Updated this week
- High-performance retrieval engine for unstructured dataβ1,272Updated this week
- Build and query dynamic, temporally-aware Knowledge Graphsβ2,478Updated this week
- Improved file parsing for LLMβsβ2,866Updated 4 months ago
- pingcap/autoflow is a Graph RAG based and conversational knowledge base tool built with TiDB Serverless Vector Storage. Demo: https://tidβ¦β2,465Updated this week
- A system for agentic LLM-powered data processing and ETLβ1,718Updated this week
- A lightweight, low-dependency, unified API to use all common reranking and cross-encoder models.β1,338Updated last month
- NVIDIA Ingest is an early access set of microservices for parsing hundreds of thousands of complex, messy unstructured PDFs and other entβ¦β2,604Updated this week
- Empowering RAG with a memory-based data interface for all-purpose applications!β1,690Updated 3 weeks ago
- The code used to train and run inference with the ColVision models, e.g. ColPali, ColQwen2, and ColSmol.β1,614Updated this week
- A toolkit to create optimal Production-readyRetrieval Augmented Generation(RAG) setup for your dataβ1,400Updated last month
- KAG is a logical form-guided reasoning and retrieval framework based on OpenSPG engine and LLMs. It is used to build logical reasoning aβ¦β6,064Updated this week
- AutoRAG: An Open-Source Framework for Retrieval-Augmented Generation (RAG) Evaluation & Optimization with AutoML-Style Automationβ3,705Updated 3 weeks ago
- Colivara is a suite of services that allows you to store, search, and retrieve documents based on their visual embedding. ColiVara has stβ¦β859Updated last month
- Fast State-of-the-Art Static Embeddingsβ1,109Updated 3 weeks ago
- Task-Aware Agent-driven Prompt Optimization Frameworkβ3,002Updated this week
- A simple, easy-to-hack GraphRAG implementationβ2,681Updated this week
- Knowledge Agents and Management in the Cloudβ3,791Updated this week
- Fast, Accurate, Lightweight Python library to make State of the Art Embeddingβ1,882Updated this week
- Implementing the 4 agentic patterns from scratchβ1,119Updated this week
- π₯€ RAGLite is a Python toolkit for Retrieval-Augmented Generation (RAG) with PostgreSQL or SQLiteβ868Updated this week
- Everything about the SmolLM2 and SmolVLM family of modelsβ2,035Updated last week
- [NeurIPS'24] HippoRAG is a novel RAG framework inspired by human long-term memory that enables LLMs to continuously integrate knowledge aβ¦β2,051Updated 2 weeks ago
- This repository provides an advanced Retrieval-Augmented Generation (RAG) solution for complex question answering. It uses sophisticated β¦β1,096Updated 2 weeks ago
- The Open Source Memory Layer For Autonomous Agentsβ2,041Updated 5 months ago
- The AI framework that adds the engineering to prompt engineering (Python/TS/Ruby/Java/C#/Rust/Go compatible)β2,650Updated this week
- Local realtime voice AIβ2,260Updated 3 weeks ago
- Deploy your agentic worfklows to productionβ1,981Updated 2 weeks ago
- This repository contains various advanced techniques for Retrieval-Augmented Generation (RAG) systems.β1,789Updated last month