🦛 CHONK docs with Chonkie ✨ — The lightweight ingestion library for fast, efficient and robust RAG pipelines
☆4,179Jun 24, 2026Updated this week
Alternatives and similar repositories for chonkie
Users that are interested in chonkie are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- The most accurate document search and store for building AI apps☆3,621Jun 19, 2026Updated last week
- Open-source context retrieval layer for AI agents☆6,453Jun 5, 2026Updated 3 weeks ago
- Get your documents ready for gen AI☆62,000Updated this week
- HelixDB is an OLTP graph-vector database built in Rust.☆5,501Updated this week
- ContextGem: Effortless LLM extraction from documents☆1,851Jun 6, 2026Updated 3 weeks ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Universal memory layer for AI Agents☆59,199Jun 22, 2026Updated last week
- Build, deploy, and orchestrate AI agents. Sim is the central intelligence layer for your AI workforce.☆28,866Updated this week
- SoTA production-ready AI retrieval system. Agentic Retrieval-Augmented Generation (RAG) with a RESTful API.☆7,892Nov 7, 2025Updated 7 months ago
- The LLM Evaluation Framework☆16,384Jun 21, 2026Updated last week
- Build Real-Time Knowledge Graphs for AI Agents☆28,071Updated this week
- Build, run, and manage agent platforms.☆40,861Updated this week
- 💡 All-in-one AI framework for semantic search, LLM orchestration and language model workflows☆12,673Jun 19, 2026Updated last week
- Incremental engine for long horizon agents 🌟 Star if you like it!☆10,503Updated this week
- Python SDK, Proxy Server (AI Gateway) to call 100+ LLM APIs in OpenAI (or native) format, with cost tracking, guardrails, loadbalancing a…☆51,475Updated this week
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Convert documents to structured data effortlessly. Unstructured is open-source ETL solution for transforming complex documents into clean…☆15,002Updated this week
- Fast State-of-the-Art Static Embeddings☆2,132Jun 6, 2026Updated 3 weeks ago
- The SDK For Browser Agents☆23,230Updated this week
- DSPy: The framework for programming—not prompting—language models☆35,310Jun 18, 2026Updated last week
- Open-source infrastructure for Computer-Use Agents. Sandboxes, SDKs, and benchmarks to train and evaluate AI agents that can control full…☆19,035Updated this week
- An MCP server that autonomously evaluates web applications.☆1,242Feb 11, 2026Updated 4 months ago
- AI Agent Framework, the Pydantic way☆17,991Updated this week
- Structured Outputs☆13,984Jun 19, 2026Updated last week
- This repository showcases various advanced techniques for Retrieval-Augmented Generation (RAG) systems. Each technique has a detailed not…☆28,225Jun 17, 2026Updated last week
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- An open-source RAG-based tool for chatting with your documents.☆25,500Jun 9, 2026Updated 2 weeks ago
- RAG that intelligently adapts to your use case, data, and queries☆3,812Nov 1, 2025Updated 7 months ago
- A Python library for extracting structured information from unstructured text using LLMs with precise source grounding and interactive vi…☆36,942May 21, 2026Updated last month
- 🪢 Open source AI engineering platform: LLM evals, observability, metrics, prompt management, playground, datasets. Integrates with OpenT…☆29,792Updated this week
- File Parser optimised for LLM Ingestion with no loss 🧠 Parse PDFs, Docx, PPTx in a format that is ideal for LLMs.☆7,398Feb 21, 2025Updated last year
- Supercharge Your LLM Application Evaluations 🚀☆14,523Feb 24, 2026Updated 4 months ago
- The open source research environment for AI researchers to seamlessly train, evaluate, and scale models from local hardware to GPU cluste…☆5,107Jun 20, 2026Updated last week
- ✨ Build a machine learning model from a prompt☆2,584Mar 6, 2026Updated 3 months ago
- Vision infrastructure to turn complex documents into RAG/LLM-ready data☆2,950Apr 9, 2026Updated 2 months ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- A system for agentic LLM-powered data processing and ETL☆3,841Jun 17, 2026Updated last week
- High-performance retrieval engine for unstructured data☆1,586Nov 10, 2025Updated 7 months ago
- Python tool for converting files and office documents to Markdown.☆159,614Updated this week
- "AutoAgent: Fully-Automated and Zero-Code LLM Agent Framework"☆9,401Oct 16, 2025Updated 8 months ago
- Expose your FastAPI endpoints as Model Context Protocol (MCP) tools, with Auth!☆11,928Nov 24, 2025Updated 7 months ago
- 🚀🤖 Crawl4AI: Open-source LLM Friendly Web Crawler & Scraper. Don't be shy, join here: https://discord.gg/jP8KfhDhyN☆69,339Jun 18, 2026Updated last week
- A modular graph-based Retrieval-Augmented Generation (RAG) system☆34,026Jun 22, 2026Updated last week