chonkie-inc / chonkieLinks
π¦ CHONK your texts with Chonkie β¨ β The no-nonsense RAG chunking library
β2,076Updated this week
Alternatives and similar repositories for chonkie
Users that are interested in chonkie are comparing it to the libraries listed below
Sorting:
- The most accurate document search and store for building AI appsβ3,152Updated this week
- ContextGem: Effortless LLM extraction from documentsβ1,477Updated this week
- ππ§ PageIndex: Document Index for Reasoning-based RAGβ1,281Updated this week
- Colivara is a suite of services that allows you to store, search, and retrieve documents based on their visual embedding. ColiVara has stβ¦β1,255Updated 4 months ago
- Data transformation framework for AI. Ultra performant, with incremental processing.β2,709Updated this week
- Fast State-of-the-Art Static Embeddingsβ1,807Updated 2 weeks ago
- The open LLM Ops platform - Traces, Analytics, Evaluations, Datasets and Prompt Optimization β¨β2,438Updated this week
- Vision infrastructure to turn complex documents into RAG/LLM-ready dataβ2,796Updated last week
- π₯€ RAGLite is a Python toolkit for Retrieval-Augmented Generation (RAG) with DuckDB or PostgreSQLβ1,056Updated this week
- A system for agentic LLM-powered data processing and ETLβ2,722Updated this week
- Communicate with an LLM provider using a single interfaceβ950Updated this week
- ExtractThinker is a Document Intelligence library for LLMs, offering ORM-style interaction for flexible and powerful document workflows.β1,388Updated this week
- β¨ Build a machine learning model from a promptβ2,160Updated last week
- Agent File (.af): An open file format for serializing stateful AI agents with persistent memory and behavior. Share, checkpoint, and versβ¦β921Updated 3 months ago
- A single interface to use and evaluate different agent frameworksβ915Updated this week
- Reasoning Augmented Generationβ875Updated last month
- The SOTA Open-Source Browser Agent for autonomously performing complex tasks on the webβ2,319Updated 2 months ago
- Easily deployable and scalable backend server that efficiently converts various document formats (pdf, docx, pptx, html, images, etc) intβ¦β680Updated 5 months ago
- Building blocks for rapid development of GenAI applicationsβ1,560Updated this week
- HelixDB is a database built from scratch to be the backend for any AI application.β2,369Updated this week
- Agentic testing for agentic codebasesβ575Updated 2 weeks ago
- Production-Ready MCP Server Framework β’ Build, deploy & scale secure AI agent infrastructure β’ Includes Auth, Observability, Debugger, Teβ¦β741Updated last week
- Python package and backend for the Elysia platform app.β753Updated this week
- Pixeltable β AI Data infrastructure providing a declarative, incremental approach for multimodal workloads.β741Updated last week
- Python library for Agentic Document Extraction from LandingAIβ1,812Updated last week
- Open Source Application for Advanced LLM + Diffusion Engineering: interact, train, fine-tune, and evaluate large language models on your β¦β3,826Updated last week
- A Kubernetes deployable instance of GroundX for document parsing, storage, and search.β776Updated this week
- Airweave lets agents search any appβ2,859Updated last week
- OCR Benchmarkβ553Updated 3 months ago
- An MCP server that autonomously evaluates web applications.β1,160Updated this week