🦛 CHONK docs with Chonkie ✨ — The lightweight ingestion library for fast, efficient and robust RAG pipelines
☆3,911Apr 8, 2026Updated this week
Alternatives and similar repositories for chonkie
Users that are interested in chonkie are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- The most accurate document search and store for building AI apps☆3,557Apr 2, 2026Updated last week
- 🦛 CHONK your texts with Chonkie ✨ Type-friendly, light-weight, fast and super-simple chunking library☆324Mar 28, 2026Updated last week
- Open-source context retrieval layer for AI agents☆6,208Updated this week
- Get your documents ready for gen AI☆57,163Updated this week
- HelixDB is an open-source graph-vector database built from scratch in Rust.☆4,041Mar 31, 2026Updated last week
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- ContextGem: Effortless LLM extraction from documents☆1,822Mar 16, 2026Updated 3 weeks ago
- Universal memory layer for AI Agents☆52,137Updated this week
- Build, deploy, and orchestrate AI agents. Sim is the central intelligence layer for your AI workforce.☆27,472Updated this week
- SoTA production-ready AI retrieval system. Agentic Retrieval-Augmented Generation (RAG) with a RESTful API.☆7,753Nov 7, 2025Updated 5 months ago
- The LLM Evaluation Framework☆14,519Updated this week
- Data transformation framework for AI. Ultra performant, with incremental processing. 🌟 Star if you like it!☆6,707Apr 2, 2026Updated last week
- Build Real-Time Knowledge Graphs for AI Agents☆24,507Updated this week
- Build, run, manage agentic software at scale.☆39,153Updated this week
- 💡 All-in-one AI framework for semantic search, LLM orchestration and language model workflows☆12,368Updated this week
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Python SDK, Proxy Server (AI Gateway) to call 100+ LLM APIs in OpenAI (or native) format, with cost tracking, guardrails, loadbalancing a…☆42,652Updated this week
- Fast State-of-the-Art Static Embeddings☆2,020Updated this week
- Convert documents to structured data effortlessly. Unstructured is open-source ETL solution for transforming complex documents into clean…☆14,383Updated this week
- The SDK For Browser Agents☆21,897Updated this week
- Open-source infrastructure for Computer-Use Agents. Sandboxes, SDKs, and benchmarks to train and evaluate AI agents that can control full…☆13,370Updated this week
- DSPy: The framework for programming—not prompting—language models☆33,495Apr 2, 2026Updated last week
- An MCP server that autonomously evaluates web applications.☆1,236Feb 11, 2026Updated last month
- AI Agent Framework, the Pydantic way☆16,185Updated this week
- 🪢 Open source LLM engineering platform: LLM Observability, metrics, evals, prompt management, playground, datasets. Integrates with Open…☆24,270Apr 2, 2026Updated last week
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Structured Outputs☆13,631Mar 26, 2026Updated 2 weeks ago
- This repository showcases various advanced techniques for Retrieval-Augmented Generation (RAG) systems. RAG systems combine information r…☆26,472Feb 17, 2026Updated last month
- An open-source RAG-based tool for chatting with your documents.☆25,251Updated this week
- Python tool for converting files and office documents to Markdown.☆93,259Mar 30, 2026Updated last week
- A Python library for extracting structured information from unstructured text using LLMs with precise source grounding and interactive vi…☆35,501Updated this week
- RAG that intelligently adapts to your use case, data, and queries☆3,758Nov 1, 2025Updated 5 months ago
- The open source research environment for AI researchers to seamlessly train, evaluate, and scale models from local hardware to GPU cluste…☆4,856Updated this week
- File Parser optimised for LLM Ingestion with no loss 🧠 Parse PDFs, Docx, PPTx in a format that is ideal for LLMs.☆7,343Feb 21, 2025Updated last year
- ✨ Build a machine learning model from a prompt☆2,554Mar 6, 2026Updated last month
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Supercharge Your LLM Application Evaluations 🚀☆13,295Feb 24, 2026Updated last month
- High-performance retrieval engine for unstructured data☆1,571Nov 10, 2025Updated 4 months ago
- Vision infrastructure to turn complex documents into RAG/LLM-ready data☆2,939Sep 24, 2025Updated 6 months ago
- A system for agentic LLM-powered data processing and ETL☆3,702Mar 27, 2026Updated last week
- "AutoAgent: Fully-Automated and Zero-Code LLM Agent Framework"☆8,710Oct 16, 2025Updated 5 months ago
- 🚀🤖 Crawl4AI: Open-source LLM Friendly Web Crawler & Scraper. Don't be shy, join here: https://discord.gg/jP8KfhDhyN☆63,500Updated this week
- Expose your FastAPI endpoints as Model Context Protocol (MCP) tools, with Auth!☆11,771Nov 24, 2025Updated 4 months ago