chonkie-inc / chonkieLinks
π¦ CHONK your texts with Chonkie β¨ β The no-nonsense RAG chunking library
β1,725Updated last week
Alternatives and similar repositories for chonkie
Users that are interested in chonkie are comparing it to the libraries listed below
Sorting:
- Open source multi-modal RAG for building AI apps over private knowledge.β2,784Updated this week
- Data transformation framework for AI. Ultra performant, with incremental processing.β2,159Updated this week
- ContextGem: Effortless LLM extraction from documentsβ1,239Updated this week
- Colivara is a suite of services that allows you to store, search, and retrieve documents based on their visual embedding. ColiVara has stβ¦β1,147Updated 2 months ago
- HelixDB is a powerful, open-source, graph-vector database built in Rust for intelligent data storage for RAG and AI.β2,161Updated this week
- Production-Ready MCP Server Framework β’ Build, deploy & scale secure AI agent infrastructure β’ Includes Auth, Observability, Debugger, Teβ¦β649Updated last week
- Airweave lets agents search any appβ2,748Updated this week
- β¨ Build a machine learning model from a promptβ2,013Updated 3 weeks ago
- π π§ PageIndex: Document Index System for Reasoning-based RAGβ1,092Updated this week
- π₯€ RAGLite is a Python toolkit for Retrieval-Augmented Generation (RAG) with DuckDB or PostgreSQLβ1,033Updated last month
- An MCP server that autonomously evaluates web applications.β1,093Updated 2 weeks ago
- The open LLM Ops platform - Traces, Analytics, Evaluations, Datasets and Prompt Optimization β¨β2,186Updated this week
- Fast State-of-the-Art Static Embeddingsβ1,756Updated this week
- The SOTA Open-Source Browser Agent for autonomously performing complex tasks on the webβ2,302Updated last month
- The edge and AI gateway for agentic apps. Arch handles the messy low-level work in building agents like applying guardrails, routing promβ¦β3,135Updated this week
- Laminar - open-source all-in-one platform for engineering AI products. Create data flywheel for your AI app. Traces, Evals, Datasets, Labβ¦β2,150Updated this week
- Agent File (.af): An open file format for serializing stateful AI agents with persistent memory and behavior. Share, checkpoint, and versβ¦β815Updated last month
- Agent Reinforcement Trainer: train multi-step agents for real-world tasks using GRPO. Give your agents on-the-job training. Reinforcementβ¦β1,108Updated this week
- Vision infrastructure to turn complex documents into RAG/LLM-ready dataβ2,282Updated 2 weeks ago
- Reasoning Augmented Generationβ860Updated 2 weeks ago
- open-source framework for creating and managing simulations populated with AI-powered agents. It provides an intuitive platform for desigβ¦β923Updated 5 months ago
- β1,256Updated last month
- Building blocks for rapid development of GenAI applicationsβ1,523Updated this week
- Easily deployable and scalable backend server that efficiently converts various document formats (pdf, docx, pptx, html, images, etc) intβ¦β633Updated 4 months ago
- A system for agentic LLM-powered data processing and ETLβ2,354Updated last week
- A Kubernetes deployable instance of GroundX for document parsing, storage, and search.β767Updated 2 weeks ago
- Agentic testing for agentic codebasesβ368Updated this week
- ExtractThinker is a Document Intelligence library for LLMs, offering ORM-style interaction for flexible and powerful document workflows.β1,298Updated last month
- A single interface to use and evaluate different agent frameworksβ571Updated this week
- AI-first Search & Answer Engine for work. Open-source alternative to Glean.β553Updated this week