🦛 CHONK docs with Chonkie ✨ — The lightweight ingestion library for fast, efficient and robust RAG pipelines
☆3,809Mar 4, 2026Updated last week
Alternatives and similar repositories for chonkie
Users that are interested in chonkie are comparing it to the libraries listed below
Sorting:
- The most accurate document search and store for building AI apps☆3,529Feb 25, 2026Updated 2 weeks ago
- Open-source context retrieval layer for AI agents☆5,977Updated this week
- Get your documents ready for gen AI☆55,513Updated this week
- ContextGem: Effortless LLM extraction from documents☆1,808Feb 22, 2026Updated 2 weeks ago
- Universal memory layer for AI Agents☆49,365Updated this week
- SoTA production-ready AI retrieval system. Agentic Retrieval-Augmented Generation (RAG) with a RESTful API.☆7,720Nov 7, 2025Updated 4 months ago
- 🦛 CHONK your texts with Chonkie ✨ Type-friendly, light-weight, fast and super-simple chunking library☆317Mar 5, 2026Updated last week
- Data transformation framework for AI. Ultra performant, with incremental processing. 🌟 Star if you like it!☆6,304Updated this week
- Build, deploy, and orchestrate AI agents. Sim is the central intelligence layer for your AI workforce.☆26,860Updated this week
- HelixDB is an open-source graph-vector database built from scratch in Rust.☆3,889Mar 5, 2026Updated last week
- The LLM Evaluation Framework☆13,984Updated this week
- Build, run, manage agentic software at scale.☆38,516Updated this week
- 💡 All-in-one AI framework for semantic search, LLM orchestration and language model workflows☆12,266Mar 4, 2026Updated last week
- The AI Browser Automation Framework☆21,356Mar 5, 2026Updated last week
- Build Real-Time Knowledge Graphs for AI Agents☆23,438Updated this week
- Python SDK, Proxy Server (AI Gateway) to call 100+ LLM APIs in OpenAI (or native) format, with cost tracking, guardrails, loadbalancing a…☆37,994Updated this week
- Fast State-of-the-Art Static Embeddings☆2,008Feb 28, 2026Updated last week
- Convert documents to structured data effortlessly. Unstructured is open-source ETL solution for transforming complex documents into clean…☆14,135Mar 4, 2026Updated last week
- Open-source infrastructure for Computer-Use Agents. Sandboxes, SDKs, and benchmarks to train and evaluate AI agents that can control full…☆12,897Updated this week
- 🪢 Open source LLM engineering platform: LLM Observability, metrics, evals, prompt management, playground, datasets. Integrates with Open…☆22,717Updated this week
- DSPy: The framework for programming—not prompting—language models☆32,696Updated this week
- Structured Outputs☆13,539Updated this week
- An open-source RAG-based tool for chatting with your documents.☆25,193Updated this week
- File Parser optimised for LLM Ingestion with no loss 🧠 Parse PDFs, Docx, PPTx in a format that is ideal for LLMs.☆7,343Feb 21, 2025Updated last year
- The open source research environment for AI researchers to seamlessly train, evaluate, and scale models from local hardware to GPU cluste…☆4,821Updated this week
- GenAI Agent Framework, the Pydantic way☆15,256Updated this week
- RAG that intelligently adapts to your use case, data, and queries☆3,718Nov 1, 2025Updated 4 months ago
- This repository showcases various advanced techniques for Retrieval-Augmented Generation (RAG) systems. RAG systems combine information r…☆25,832Feb 17, 2026Updated 3 weeks ago
- Vision infrastructure to turn complex documents into RAG/LLM-ready data☆2,939Sep 24, 2025Updated 5 months ago
- Python tool for converting files and office documents to Markdown.☆90,316Feb 20, 2026Updated 2 weeks ago
- AutoRAG: An Open-Source Framework for Retrieval-Augmented Generation (RAG) Evaluation & Optimization with AutoML-Style Automation☆4,601Dec 23, 2025Updated 2 months ago
- A system for agentic LLM-powered data processing and ETL☆3,682Updated this week
- A Python library for extracting structured information from unstructured text using LLMs with precise source grounding and interactive vi…☆34,507Feb 25, 2026Updated 2 weeks ago
- 🚀🤖 Crawl4AI: Open-source LLM Friendly Web Crawler & Scraper. Don't be shy, join here: https://discord.gg/jP8KfhDhyN☆61,687Updated this week
- Supercharge Your LLM Application Evaluations 🚀☆12,826Feb 24, 2026Updated 2 weeks ago
- Debug, evaluate, and monitor your LLM applications, RAG systems, and agentic workflows with comprehensive tracing, automated evaluations,…☆18,073Updated this week
- The open-source LLMOps platform: prompt playground, prompt management, LLM evaluation, and LLM observability all in one place.☆3,906Updated this week
- OCR, layout analysis, reading order, table recognition in 90+ languages☆19,392Mar 1, 2026Updated last week
- A modular graph-based Retrieval-Augmented Generation (RAG) system☆31,296Updated this week