Open-source toolkit for reliable RAG pipelines: convert PDFs to Markdown, clean documents, inspect chunks, compare chunking strategies, and enrich metadata for LLM applications.
☆126Jun 6, 2026Updated 3 weeks ago
Alternatives and similar repositories for chunky
Users that are interested in chunky are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Open Knowledge Graph Resources is a static, daily-refreshed catalog of ontology and semantic software records sourced from Wikidata. It p…☆64Updated this week
- A local proxy that strips web pages down to clean text before they enter your AI agent's context window. 704K tokens → 2.6K tokens. No LL…☆67Apr 5, 2026Updated 2 months ago
- ☆21May 29, 2025Updated last year
- An opinionated list of practical tools for Conceptual Modeling and Linked Data☆40Mar 24, 2026Updated 3 months ago
- The exhaustive guide to mastering Claude for Product Managers. Build your AI-native PM OS from scratch — PRDs, research synthesis, stakeh…☆90Apr 27, 2026Updated 2 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Supercharge your workflow automation with this curated collection of n8n templates! Instantly connect your favorite apps-like Gmail, Tele…☆12May 22, 2025Updated last year
- A Reddit thread summarizer is a tool that generates a summary of the main points or themes discussed in a Reddit thread☆18Jan 5, 2023Updated 3 years ago
- ☆12Nov 24, 2023Updated 2 years ago
- Dynamically constructs and adapts an LLM-generated taxonomy to a given corpus across multiple dimensions.☆43Sep 27, 2025Updated 9 months ago
- This package enables inference of header hierarchy in the docling PDF parsing pipeline.☆76Apr 24, 2026Updated 2 months ago
- A Toolbox Platform for Creating Your Own Tools. Bake Them with Code or AI.☆25Jun 3, 2026Updated 3 weeks ago
- Security-hardened openclaw with auth gateway, AES-256 encryption and session management☆56Jun 11, 2026Updated 2 weeks ago
- PDF extraction that checks its own work. #2 reading order accuracy — zero AI, zero GPU, zero cost.☆71Updated this week
- Use the OpenAI API inside Supabase Edge Functions☆16Dec 31, 2022Updated 3 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- A Library and Tool for Symbolic Melodic Similarity based on Shape Similarity☆29Nov 16, 2016Updated 9 years ago
- ☆19Nov 7, 2025Updated 7 months ago
- A curated list of awesome exploration policy papers.☆14Jan 3, 2026Updated 5 months ago
- Debuggable runtime for AI agent workflows. DAG pipelines, artifact lineage, and replayable runs.☆62Apr 19, 2026Updated 2 months ago
- AI guardian for manual crypto traders — risk monitoring, strategy validation & emotional trading detection. No trade execution.☆110Mar 14, 2026Updated 3 months ago
- Markdown-Native Multi-Agent Task Coordination☆27Feb 20, 2026Updated 4 months ago
- The AI cron daemon. Schedule recurring agent sessions via HEARTBEAT.md prompt files.☆29Feb 10, 2026Updated 4 months ago
- A Simple GUI wrapper around yt-dlp for Windows using AHK☆21Dec 12, 2021Updated 4 years ago
- Self-hosted personal AI agent and employee for workflow automation in your DMs. It writes code, runs tools, schedules jobs, saves workflo…☆38Jun 17, 2026Updated last week
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Open-source framework for turning expert knowledge into PII-free synthetic conversational data and production-ready LoRA adapters.☆10Mar 24, 2026Updated 3 months ago
- DPG Campus Tool. Shrink massive PDFs to fit AI upload limits. Sanitize before uploading to reduce risk of exposing sensitive data.☆50Jan 20, 2026Updated 5 months ago
- Observability platform for Digital Employee, providing real-time tracing, session insights, and cost analysis for multi-agent workflows☆136May 20, 2026Updated last month
- A fast, ai-native macOS Calendar CLI built on go-eventkit☆63Jun 21, 2026Updated last week
- Context-efficient command runner for coding agents.☆25Mar 9, 2026Updated 3 months ago
- Implementation of Visual Odometry for localization and Octomap for mapping☆13Feb 6, 2020Updated 6 years ago
- A deployment framework for reinforcement learning-based motion control of legged robots, covering bipedal, quadrupedal, wheeled-bipedal, …☆109Jun 19, 2026Updated last week
- LEMMA: Logical Engine for Multi-domain Mathematical Analysis☆28Feb 14, 2026Updated 4 months ago
- Avoid merge conflicts across git worktrees for parallel AI coding agents☆62Feb 24, 2026Updated 4 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Open-source AI coding assistant CLI powered by DeepSeek-V3 — a low-cost alternative to proprietary coding agents☆19Jun 10, 2026Updated 2 weeks ago
- See what's on your ports, then act on it. Diagnostic-first port viewer for Linux, MacOS and Windows.☆53Jun 8, 2026Updated 2 weeks ago
- 🤖 Kubernetes for AI Agents. Self-hosted, production-grade runtime for orchestrating LLM swarms and autonomous agents. TypeScript-native.☆37Updated this week
- Push-to-talk voice typing for your terminal. Local Whisper, cross-platform.☆54Mar 16, 2026Updated 3 months ago
- git worktrees that actually work (zero-config dep sync, fleet mode for parallel agents)☆66Jun 16, 2026Updated last week
- The largest open corpus of classified docx documents☆60May 23, 2026Updated last month
- Hold-to-talk voice input for Pi CLI — Deepgram streaming STT with live transcription, voice commands, and cross-platform hold detection☆66May 1, 2026Updated last month