2dogsandanerd / smart-ingest-kitLinks
Stop using static chunk sizes. A lightweight, production-ready RAG ingestion toolkit. Uses Docling for layout-aware parsing and applies smart heuristics for optimal chunking (PDF vs Code vs MD). Extracted from a production RAG platform
☆60Updated 3 weeks ago
Alternatives and similar repositories for smart-ingest-kit
Users that are interested in smart-ingest-kit are comparing it to the libraries listed below
Sorting:
- Official python implementation of UTCP. UTCP is an open standard that lets AI agents call any API directly, without extra middleware.☆633Updated 2 weeks ago
- Declarative language for composable Al workflows. Devtool for agents and mere humans.☆598Updated this week
- Pixelagent — Multimodal stateful agents☆223Updated 6 months ago
- Deploy any AI model, agent, database, RAG, and pipeline locally or remotely in minutes☆700Updated this week
- Open-source AI-powered data science platform.☆304Updated 2 months ago
- Laddr is a python framework for building multi-agent systems where agents communicate, delegate tasks, and execute work in parallel. Thin…☆272Updated 3 weeks ago
- An Excel AI agent that uses MCP tools to let LLMs read, edit, and automate Excel spreadsheets.☆72Updated last week
- Git Based Memory Storage for Conversational AI Agent☆757Updated last month
- The Intelligence Layer for AI agents. Connect your models, tools, and data to create agentic apps that can think, act and talk to you.☆569Updated last week
- Build your personal memory system to power your AI apps.☆1,170Updated this week
- An open-source Text2SQL tool that transforms natural language into SQL using graph-powered schema understanding. Ask your database questi…☆276Updated last week
- Deploy an AI Analyst in less than 2 mins — connect any LLM to any data source with centralized context management, observability, and con…☆324Updated last week
- Interactive 3D visualization of knowledge graphs generated by Microsoft GraphRAG. Explore entities, relationships, and communities with i…☆449Updated 3 months ago
- Turn AI into a persistent, memory-powered collaborator. Universal MCP Server (supports HTTP & STDIO) enabling cross-platform AI memory, …☆202Updated 2 weeks ago
- The Supabase of AI era. A modular, open-source backend for building AI-native software — designed for knowledge, not static data.☆421Updated 6 months ago
- 🤖 An open-source AI assistant answering questions using your docs☆235Updated 3 weeks ago
- Orchestrate Claude Code, Codex, and Gemini sessions on a multiplayer canvas. Manage git worktrees, track AI conversations, and visualize …☆817Updated this week
- The memory-first coding agent☆642Updated this week
- Semi-Structured Agentic Framework. Workflows build themselves as agents discover what needs to be done, not what you predicted upfront.☆1,054Updated 3 weeks ago
- Frontend Repository for Elysia☆166Updated last week
- In the midst of all the tools out there that you can possibly use to keep track of them. Here's a "shovel" that just works to try them al…☆112Updated last week
- state of the art browsing agent (WebArena 72.7%)☆360Updated 2 months ago
- CoexistAI is a modular, developer-friendly research assistant framework . It enables you to build, search, summarize, and automate resear…☆384Updated 2 months ago
- Turn your data into shareable RAG apps in minutes. All in pure Markdown. Zero boilerplate.☆804Updated last week
- DeepContext is an MCP server that adds symbol-aware semantic search to Claude Code, Codex CLI, and other agents for faster, smarter conte…☆250Updated 3 months ago
- VeritasGraph: Enterprise-Grade Graph RAG for Secure, On-Premise AI with Verifiable Attribution☆180Updated last month
- awesome-rag: a collection of awesome thing related to Retrieval-Augmented Generation☆171Updated 5 months ago
- next-generation AI memory infrastructure (powered by mem0 and graphiti)☆163Updated last week
- Gemini-cli or claude code? Why not both? LangCode combines all CLI capabilities and models in one place ☂️!☆427Updated last month
- ☆210Updated last week