chonkie-ai / chonkie
π¦ CHONK your texts with Chonkie β¨ - The no-nonsense RAG chunking library
β2,599Updated this week
Alternatives and similar repositories for chonkie:
Users that are interested in chonkie are comparing it to the libraries listed below
- RAG that intelligently adapts to your use case, data, and queriesβ2,924Updated last week
- NVIDIA Ingest is an early access set of microservices for parsing hundreds of thousands of complex, messy unstructured PDFs and other entβ¦β2,533Updated this week
- A system for agentic LLM-powered data processing and ETLβ1,669Updated this week
- Vision model based document ingestionβ1,647Updated this week
- π₯€ RAGLite is a Python toolkit for Retrieval-Augmented Generation (RAG) with PostgreSQL or SQLiteβ807Updated this week
- High-performance retrieval engine for unstructured dataβ1,165Updated this week
- Easy token price estimates for 400+ LLMs. TokenOps.β1,565Updated this week
- Build and query dynamic, temporally-aware Knowledge Graphsβ1,915Updated last week
- pingcap/autoflow is a Graph RAG based and conversational knowledge base tool built with TiDB Serverless Vector Storage. Demo: https://tidβ¦β2,274Updated this week
- A toolkit to create optimal Production-readyRetrieval Augmented Generation(RAG) setup for your dataβ1,383Updated 2 weeks ago
- Fast State-of-the-Art Static Embeddingsβ1,060Updated this week
- This repository contains various advanced techniques for Retrieval-Augmented Generation (RAG) systems.β1,697Updated this week
- Colivara is a suite of services that allows you to store, search, and retrieve documents based on their visual embedding. ColiVara has stβ¦β766Updated 2 weeks ago
- A lightweight, low-dependency, unified API to use all common reranking and cross-encoder models.β1,295Updated last week
- The official Python SDK for Model Context Protocol servers and clientsβ1,888Updated this week
- The most advanced AI retrieval system. Agentic Retrieval-Augmented Generation (RAG) with a RESTful API.β4,880Updated this week
- The fast, Pythonic way to build Model Context Protocol servers πβ1,018Updated last month
- Empowering RAG with a memory-based data interface for all-purpose applications!β1,633Updated 2 months ago
- AI-native (edge and LLM) proxy for agents. Handles all the pesky heavy lifting in building agentic apps -- fast β‘οΈ query routing, seamleβ¦β1,648Updated this week
- Deploy your agentic worfklows to productionβ1,964Updated this week
- A lightweight task engine for building stateful AI agents that prioritizes simplicity and flexibility.β886Updated last month
- open-source framework for creating and managing simulations populated with AI-powered agents. It provides an intuitive platform for desigβ¦β898Updated 3 weeks ago
- Everything about the SmolLM2 and SmolVLM family of modelsβ1,888Updated 2 weeks ago
- π¦Ύ Take control of your AI agentsβ1,169Updated 2 weeks ago
- Things you can do with the token embeddings of an LLMβ1,424Updated 2 weeks ago
- Knowledge Agents and Management in the Cloudβ3,707Updated this week
- The code used to train and run inference with the ColVision models, e.g. ColPali, ColQwen2, and ColSmol.β1,492Updated this week
- Building AI agents, atomicallyβ2,652Updated this week
- Desktop app for prototyping and debugging LangGraph applications locally.β2,479Updated 3 weeks ago
- Implementing the 4 agentic patterns from scratchβ1,027Updated 3 weeks ago