chonkie-inc / chonkieLinks
🦛 CHONK docs with Chonkie ✨ — The lightweight ingestion library for fast, efficient and robust RAG pipelines
☆3,326Updated this week
Alternatives and similar repositories for chonkie
Users that are interested in chonkie are comparing it to the libraries listed below
Sorting:
- ContextGem: Effortless LLM extraction from documents☆1,738Updated 3 weeks ago
- The most accurate document search and store for building AI apps☆3,404Updated this week
- Colivara is a suite of services that allows you to store, search, and retrieve documents based on their visual embedding. ColiVara has st…☆1,388Updated 7 months ago
- 📑 PageIndex: Document Index for Reasoning-based RAG☆4,248Updated this week
- Vision infrastructure to turn complex documents into RAG/LLM-ready data☆2,915Updated 2 months ago
- The open LLM Ops platform - Traces, Analytics, Evaluations, Datasets and Prompt Optimization ✨☆2,672Updated this week
- Data transformation framework for AI. Ultra performant, with incremental processing. 🌟 Star if you like it!☆3,605Updated this week
- Fast State-of-the-Art Static Embeddings☆1,939Updated 3 weeks ago
- 🥤 RAGLite is a Python toolkit for Retrieval-Augmented Generation (RAG) with DuckDB or PostgreSQL☆1,119Updated this week
- Python package and backend for the Elysia platform app.☆1,820Updated this week
- A system for agentic LLM-powered data processing and ETL☆3,204Updated last week
- RAG that intelligently adapts to your use case, data, and queries☆3,611Updated last month
- An on-premises, OCR-free unstructured data extraction, markdown conversion and benchmarking toolkit. (https://idp-leaderboard.org/)☆1,809Updated 3 months ago
- Python library for Agentic Document Extraction from LandingAI☆2,299Updated 2 weeks ago
- ExtractThinker is a Document Intelligence library for LLMs, offering ORM-style interaction for flexible and powerful document workflows.☆1,457Updated 3 months ago
- ✨ Build a machine learning model from a prompt☆2,277Updated 3 months ago
- Production-Ready MCP Server Framework • Build, deploy & scale secure AI agent infrastructure • Includes Auth, Observability, Debugger, Te…☆800Updated 2 weeks ago
- Open Source Application for Advanced LLM + Diffusion Engineering: interact, train, fine-tune, and evaluate large language models on your …☆4,575Updated last week
- Easily deployable and scalable backend server that efficiently converts various document formats (pdf, docx, pptx, html, images, etc) int…☆729Updated 9 months ago
- A community-driven collection of RAG (Retrieval-Augmented Generation) frameworks, projects, and resources. Contribute and explore the evo…☆1,489Updated 2 weeks ago
- The open-source RAG platform: built-in citations, deep research, 22+ file formats, partitions, MCP server, and more.☆1,661Updated this week
- The SOTA Open-Source Browser Agent for autonomously performing complex tasks on the web☆2,324Updated 6 months ago
- Building blocks for rapid development of GenAI applications☆1,595Updated last week
- The official repo for “Dolphin: Document Image Parsing via Heterogeneous Anchor Prompting”, ACL, 2025.☆7,857Updated last month
- For your multi-agent needs☆1,303Updated last week
- Empowering RAG with a memory-based data interface for all-purpose applications!☆2,180Updated 3 months ago
- Agentic testing for agentic codebases☆666Updated last week
- ☆2,072Updated 8 months ago
- Delivery infrastructure for agents. Arch is a models-native proxy server that handles the plumbing work in AI: agent routing & orchestrat…☆4,487Updated last week
- Context retrieval for AI agents across apps and databases☆5,335Updated this week