chonkie-inc / chonkieLinks
π¦ CHONK docs with Chonkie β¨ β The lightweight ingestion library for fast, efficient and robust RAG pipelines
β3,203Updated this week
Alternatives and similar repositories for chonkie
Users that are interested in chonkie are comparing it to the libraries listed below
Sorting:
- The most accurate document search and store for building AI appsβ3,369Updated this week
- ContextGem: Effortless LLM extraction from documentsβ1,718Updated this week
- The open LLM Ops platform - Traces, Analytics, Evaluations, Datasets and Prompt Optimization β¨β2,622Updated this week
- π PageIndex: Document Index for Reasoning-based RAGβ3,997Updated last week
- Colivara is a suite of services that allows you to store, search, and retrieve documents based on their visual embedding. ColiVara has stβ¦β1,370Updated 6 months ago
- Vision infrastructure to turn complex documents into RAG/LLM-ready dataβ2,908Updated last month
- Fast State-of-the-Art Static Embeddingsβ1,900Updated this week
- Python package and backend for the Elysia platform app.β1,806Updated 2 weeks ago
- β¨ Build a machine learning model from a promptβ2,273Updated 3 months ago
- Data transformation framework for AI. Ultra performant, with incremental processing. π Star if you like it!β3,321Updated this week
- Production-Ready MCP Server Framework β’ Build, deploy & scale secure AI agent infrastructure β’ Includes Auth, Observability, Debugger, Teβ¦β797Updated last week
- Context retrieval for AI agents across apps and databasesβ5,179Updated this week
- A system for agentic LLM-powered data processing and ETLβ3,065Updated this week
- HelixDB is an open-source graph-vector database built from scratch in Rust.β3,328Updated this week
- π₯€ RAGLite is a Python toolkit for Retrieval-Augmented Generation (RAG) with DuckDB or PostgreSQLβ1,104Updated 2 weeks ago
- Easily deployable and scalable backend server that efficiently converts various document formats (pdf, docx, pptx, html, images, etc) intβ¦β724Updated 8 months ago
- The SOTA Open-Source Browser Agent for autonomously performing complex tasks on the webβ2,321Updated 5 months ago
- ExtractThinker is a Document Intelligence library for LLMs, offering ORM-style interaction for flexible and powerful document workflows.β1,455Updated 2 months ago
- A single interface to use and evaluate different agent frameworksβ1,026Updated this week
- Agentic testing for agentic codebasesβ634Updated last week
- NeMo Retriever extraction is a scalable, performance-oriented document content and metadata extraction microservice. NeMo Retriever extraβ¦β2,760Updated this week
- RAG that intelligently adapts to your use case, data, and queriesβ3,591Updated 2 weeks ago
- UQLM: Uncertainty Quantification for Language Models, is a Python package for UQ-based LLM hallucination detectionβ1,062Updated this week
- The data plane for agents. Arch is a models-native proxy server that handles the plumbing work in AI: agent routing & hand off, guardrailβ¦β4,348Updated this week
- Building blocks for rapid development of GenAI applicationsβ1,589Updated this week
- Reasoning Augmented Generationβ889Updated 4 months ago
- π₯ Reliable Browser AI Agents (YC S25)β1,666Updated last week
- Build, enrich, and transform datasets using AI models with no codeβ1,564Updated 3 weeks ago
- Semantic search and document parsing tools for the command lineβ1,455Updated last week
- AI Powered Knowledge Graph Generatorβ1,367Updated last month