π¦ CHONK your texts with Chonkie β¨ - The no-nonsense RAG chunking library
β42Nov 8, 2024Updated last year
Alternatives and similar repositories for chonkie
Users that are interested in chonkie are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Curated examples and patterns for using Chalk. Use these to build your feature pipelines.β27Apr 23, 2026Updated last week
- An easy-to-use library and command-line tool for TTSβ15May 3, 2025Updated last year
- Personal voice assistant, with voice interruption and Twilio supportβ18Feb 24, 2025Updated last year
- LLM-Powered Data Discovery System for Tabular Dataβ28Apr 7, 2026Updated 3 weeks ago
- Streaming Retrieval-Augmented Generation (RAG) agent in Go. It consumes real-time data from Kafka topics, processes it in configurable wiβ¦β26Jun 7, 2025Updated 10 months ago
- 1-Click AI Models by DigitalOcean Gradient β’ AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Pytest plugin type-checking tests, fixtures, and/or your codebase with @beartype.β23Apr 15, 2026Updated 2 weeks ago
- Svelte stores and components to query data (with realtime updates) from PocketBaseβ15Feb 9, 2026Updated 2 months ago
- A Rust library for programmatically generating LaTeX documentsβ13Apr 17, 2025Updated last year
- QLoRA: Efficient Finetuning of Quantized LLMsβ11Jul 22, 2023Updated 2 years ago
- β17Feb 24, 2025Updated last year
- β19Feb 14, 2026Updated 2 months ago
- Recursive Self-Aggregation evals on ARC-AGIβ33Jan 26, 2026Updated 3 months ago
- A Python package designed to simplify the process of creating and managing function calls to OpenAI's API, as well as models using LiteLLβ¦β17May 25, 2025Updated 11 months ago
- β26Jan 25, 2026Updated 3 months ago
- Virtual machines for every use case on DigitalOcean β’ AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- C inference engine for running GLiClass (Generalist and Lightweight Classification) modelsβ17May 21, 2025Updated 11 months ago
- implementation of https://arxiv.org/pdf/2312.09299β21Jul 3, 2024Updated last year
- Add automatic captions to short videos (YouTube Shorts, TikTok) using AI speech recognition. Fast, customizable, and easy to use.β14Feb 12, 2025Updated last year
- AI-powered text compression library for RAG systems and API calls. Reduce token usage by up to 50-60% while preserving semantic meaning wβ¦β85Aug 16, 2025Updated 8 months ago
- Expand -> Retrieve -> Rerank - simple method with strong results on BRIGHT benchmarkβ22Aug 22, 2025Updated 8 months ago
- β20Oct 21, 2025Updated 6 months ago
- Local dual-layer memory for AI agents using a compressed index plus vector retrievalβ48Updated this week
- A lightweight selfhosted web file uploader using a gist backendβ19Feb 12, 2025Updated last year
- π¨ Add text overlays to segmented objects in your images using AI. Powered by Meta's SAM2 for segmentation, running entirely in your browβ¦β22Feb 15, 2025Updated last year
- Managed Kubernetes at scale on DigitalOcean β’ AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Example demonstrating how to use gpt-4o-mini for fine-tuningβ28Aug 30, 2024Updated last year
- interact with your robot in JS, inspired by LeRobotβ38Nov 14, 2025Updated 5 months ago
- a 2D region based quadtree in JSβ22Feb 11, 2018Updated 8 years ago
- Copy My Writing is a command-line tool for generating content based on your personal writing style.β11Oct 12, 2025Updated 6 months ago
- A Tool for Validating Conformance to the Functions Framework Contractβ22Apr 24, 2026Updated last week
- This is an interface that will offline convert anything pdf document you give it into an interview between two people discussing it.β17Dec 8, 2024Updated last year
- [LREC-COLING'24] HumanEval-XL: A Multilingual Code Generation Benchmark for Cross-lingual Natural Language Generalizationβ41Mar 7, 2025Updated last year
- Java and Scala client libraries for Concordβ13Feb 15, 2017Updated 9 years ago
- My learnings (publicly) on RAG systemsβ14Jan 2, 2024Updated 2 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI β’ AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- A fork of the PEFT library, supporting Robust Adaptation (RoSA)β15Aug 16, 2024Updated last year
- Drift detection module for machine learning pipelines.β24Jun 21, 2023Updated 2 years ago
- adapt data to and from every formatβ28Apr 27, 2026Updated last week
- Generalized Method of Moments estimationβ14Mar 23, 2025Updated last year
- App promotion web site: Teach website built by React and add GSAP animation!β14Jan 13, 2023Updated 3 years ago
- R package implementing the grammar of temporal graphicsβ24Apr 15, 2026Updated 2 weeks ago
- Memgraph Platform is a multi-container application containing Memgraph+MAGE and Memgraph Lab.β24Dec 1, 2025Updated 5 months ago