chonkie-inc / chonkieLinks
π¦ CHONK your texts with Chonkie β¨ β The no-nonsense RAG chunking library
β1,538Updated this week
Alternatives and similar repositories for chonkie
Users that are interested in chonkie are comparing it to the libraries listed below
Sorting:
- Open source multi-modal RAG for building AI apps over private knowledge.β2,696Updated this week
- ContextGem: Effortless LLM extraction from documentsβ1,191Updated this week
- Production-Ready MCP Server Framework β’ Build, deploy & scale secure AI agent infrastructure β’ Includes Auth, Observability, Debugger, Teβ¦β628Updated last week
- Airweave lets agents search any appβ2,659Updated this week
- An MCP server that autonomously evaluates web applications.β1,035Updated 3 weeks ago
- HelixDB is a powerful, open-source, graph-vector database built in Rust for intelligent data storage for RAG and AI.β2,075Updated this week
- The SOTA Open-Source Browser Agent for autonomously performing complex tasks on the webβ2,274Updated 2 weeks ago
- The open LLM Ops platform - Traces, Analytics, Evaluations, Datasets and Prompt Optimization β¨β2,090Updated this week
- β¨ Build a machine learning model from a promptβ1,987Updated this week
- Real-time data transformation framework for AI. Ultra performant, with incremental processing.β1,947Updated this week
- π₯€ RAGLite is a Python toolkit for Retrieval-Augmented Generation (RAG) with DuckDB or PostgreSQLβ1,021Updated last week
- Colivara is a suite of services that allows you to store, search, and retrieve documents based on their visual embedding. ColiVara has stβ¦β1,138Updated last month
- Vision infrastructure to turn complex documents into RAG/LLM-ready dataβ2,224Updated this week
- Fast State-of-the-Art Static Embeddingsβ1,740Updated 2 weeks ago
- Easily deployable and scalable backend server that efficiently converts various document formats (pdf, docx, pptx, html, images, etc) intβ¦β622Updated 3 months ago
- open-source framework for creating and managing simulations populated with AI-powered agents. It provides an intuitive platform for desigβ¦β920Updated 4 months ago
- ExtractThinker is a Document Intelligence library for LLMs, offering ORM-style interaction for flexible and powerful document workflows.β1,284Updated 2 weeks ago
- A Kubernetes deployable instance of GroundX for document parsing, storage, and search.β758Updated this week
- The toolkit for codebase mapping, symbol extraction, and many kinds of code search. Build AI-powered devtoolsβ520Updated this week
- π π§ PageIndex: Document Index System for Reasoning-based RAGβ1,066Updated last week
- Reasoning Augmented Generationβ855Updated 4 months ago
- Unified Backend Framework for APIs, Events, and AI Agentsβ2,250Updated this week
- Self-hosted, multi-user API that drops bots into Google Meet for real-time transcripts.β1,017Updated this week
- Your memories are in ChatGPT... But nowhere else. Universal Memory MCP makes your memories available to every single LLM. No logins or paβ¦β1,011Updated last week
- Agent File (.af): An open file format for serializing stateful AI agents with persistent memory and behavior. Share, checkpoint, and versβ¦β636Updated last month
- Sim Studio is an open-source AI agent workflow builder. Sim Studio's interface is a lightweight, intuitive way to quickly build and deploβ¦β4,013Updated this week
- The open-source alternative to Carbon.ai. Build powerful RAG applications with any data source, at any scale.β622Updated 2 weeks ago
- Building blocks for rapid development of GenAI applicationsβ1,370Updated last week
- A system for agentic LLM-powered data processing and ETLβ2,273Updated last week
- OCR Benchmarkβ511Updated 3 weeks ago