chonkie-inc / chonkieLinks
π¦ CHONK docs with Chonkie β¨ β The lightweight ingestion library for fast, efficient and robust RAG pipelines
β3,419Updated this week
Alternatives and similar repositories for chonkie
Users that are interested in chonkie are comparing it to the libraries listed below
Sorting:
- ContextGem: Effortless LLM extraction from documentsβ1,747Updated last week
- The most accurate document search and store for building AI appsβ3,432Updated this week
- π PageIndex: Document Index for Reasoning-based RAGβ4,435Updated last week
- Colivara is a suite of services that allows you to store, search, and retrieve documents based on their visual embedding. ColiVara has stβ¦β1,410Updated 8 months ago
- The open LLM Ops platform - Traces, Analytics, Evaluations, Datasets and Prompt Optimization β¨β2,709Updated this week
- A system for agentic LLM-powered data processing and ETLβ3,310Updated this week
- Python package and backend for the Elysia platform app.β1,849Updated 2 weeks ago
- Data transformation framework for AI. Ultra performant, with incremental processing. π Star if you like it!β5,473Updated this week
- Vision infrastructure to turn complex documents into RAG/LLM-ready dataβ2,924Updated 3 months ago
- Fast State-of-the-Art Static Embeddingsβ1,959Updated last month
- π₯€ RAGLite is a Python toolkit for Retrieval-Augmented Generation (RAG) with DuckDB or PostgreSQLβ1,128Updated 2 weeks ago
- β¨ Build a machine learning model from a promptβ2,287Updated 4 months ago
- Context retrieval for AI agents across apps and databasesβ5,426Updated this week
- Communicate with an LLM provider using a single interfaceβ1,519Updated this week
- Open Source Machine Learning Research Platform designed for frontier AI/ML workflows. Local, on-prem, or in the cloud. Open source.β4,626Updated last week
- β2,233Updated last month
- ExtractThinker is a Document Intelligence library for LLMs, offering ORM-style interaction for flexible and powerful document workflows.β1,469Updated 4 months ago
- Easily deployable and scalable backend server that efficiently converts various document formats (pdf, docx, pptx, html, images, etc) intβ¦β732Updated 9 months ago
- Production-Ready MCP Server Framework β’ Build, deploy & scale secure AI agent infrastructure β’ Includes Auth, Observability, Debugger, Teβ¦β806Updated last week
- Building blocks for rapid development of GenAI applicationsβ1,601Updated last week
- The open-source RAG platform: built-in citations, deep research, 22+ file formats, partitions, MCP server, and more.β1,748Updated this week
- Ship agents faster. Plano is delivery infrastructure for agentic applications. A models-native proxy server & dataplane that offloads theβ¦β4,689Updated this week
- HelixDB is an open-source graph-vector database built from scratch in Rust.β3,530Updated this week
- AI Powered Knowledge Graph Generatorβ1,417Updated this week
- A single interface to use and evaluate different agent frameworksβ1,055Updated last week
- Build, enrich, and transform datasets using AI models with no codeβ1,610Updated 2 months ago
- Python library for Agentic Document Extraction from LandingAIβ2,308Updated 2 weeks ago
- Memory for AI Agents in 6 lines of codeβ10,571Updated last week
- Running Docling as an API serviceβ1,080Updated 2 weeks ago
- An on-premises, OCR-free unstructured data extraction, markdown conversion and benchmarking toolkit. (https://idp-leaderboard.org/)β1,824Updated 4 months ago