chonkie-inc / chonkieLinks
π¦ CHONK your texts with Chonkie β¨ β The no-nonsense RAG chunking library
β2,332Updated this week
Alternatives and similar repositories for chonkie
Users that are interested in chonkie are comparing it to the libraries listed below
Sorting:
- Colivara is a suite of services that allows you to store, search, and retrieve documents based on their visual embedding. ColiVara has stβ¦β1,263Updated 4 months ago
- ππ§ PageIndex: Document Index for Reasoning-based RAGβ2,516Updated 2 weeks ago
- ContextGem: Effortless LLM extraction from documentsβ1,500Updated last week
- Data transformation framework for AI. Ultra performant, with incremental processing.β2,813Updated this week
- The most accurate document search and store for building AI appsβ3,235Updated this week
- π₯€ RAGLite is a Python toolkit for Retrieval-Augmented Generation (RAG) with DuckDB or PostgreSQLβ1,076Updated this week
- Fast State-of-the-Art Static Embeddingsβ1,838Updated last week
- A system for agentic LLM-powered data processing and ETLβ2,832Updated last week
- Python package and backend for the Elysia platform app.β1,623Updated last week
- ExtractThinker is a Document Intelligence library for LLMs, offering ORM-style interaction for flexible and powerful document workflows.β1,404Updated 3 weeks ago
- The open LLM Ops platform - Traces, Analytics, Evaluations, Datasets and Prompt Optimization β¨β2,466Updated this week
- Production-Ready MCP Server Framework β’ Build, deploy & scale secure AI agent infrastructure β’ Includes Auth, Observability, Debugger, Teβ¦β773Updated last week
- Easily deployable and scalable backend server that efficiently converts various document formats (pdf, docx, pptx, html, images, etc) intβ¦β690Updated 6 months ago
- Building blocks for rapid development of GenAI applicationsβ1,571Updated last week
- HelixDB is a database built from scratch to be the backend for any AI application.β2,508Updated this week
- A single interface to use and evaluate different agent frameworksβ937Updated this week
- β¨ Build a machine learning model from a promptβ2,179Updated last month
- Communicate with an LLM provider using a single interfaceβ976Updated this week
- Vision infrastructure to turn complex documents into RAG/LLM-ready dataβ2,842Updated last month
- Agent File (.af): An open file format for serializing stateful AI agents with persistent memory and behavior. Share, checkpoint, and versβ¦β935Updated 3 months ago
- Reasoning Augmented Generationβ879Updated 2 months ago
- Python library for Agentic Document Extraction from LandingAIβ1,876Updated last week
- The smart edge and AI gateway for agents. Arch is a high-performance proxy server that handles the low-level work in building agents: likβ¦β3,670Updated last week
- A Kubernetes deployable instance of GroundX for document parsing, storage, and search.β795Updated this week
- Airweave lets agents search any appβ2,877Updated this week
- RAG that intelligently adapts to your use case, data, and queriesβ3,518Updated 2 months ago
- Agentic testing for agentic codebasesβ590Updated last week
- Running Docling as an API serviceβ706Updated last week
- Open Source Application for Advanced LLM + Diffusion Engineering: interact, train, fine-tune, and evaluate large language models on your β¦β4,285Updated last week
- Semantic search and document parsing tools for the command lineβ955Updated this week