NVIDIA / nv-ingest
NVIDIA Ingest is an early access set of microservices for parsing hundreds of thousands of complex, messy unstructured PDFs and other enterprise documents into metadata and text to embed into retrieval systems.
☆2,631Updated this week
Alternatives and similar repositories for nv-ingest:
Users that are interested in nv-ingest are comparing it to the libraries listed below
- Vision infrastructure to turn complex documents into RAG/LLM-ready data☆2,090Updated this week
- Knowledge Agents and Management in the Cloud☆3,853Updated this week
- Build Real-Time Knowledge Graphs for AI Agents☆3,085Updated last week
- A system for agentic LLM-powered data processing and ETL☆1,728Updated last week
- A visual playground for agentic workflows: Iterate over your agents 10x faster☆4,160Updated this week
- RAG that intelligently adapts to your use case, data, and queries☆3,074Updated last month
- RAG (Retrieval Augmented Generation) Framework for building modular, open source applications for production by TrueFoundry☆3,987Updated last month
- Agent Framework / shim to use Pydantic with LLMs☆8,016Updated this week
- Lightning-fast serving engine for any AI model of any size. Flexible. Easy. Enterprise-scale.☆3,016Updated this week
- ☆2,685Updated last week
- File Parser optimised for LLM Ingestion with no loss 🧠 Parse PDFs, Docx, PPTx in a format that is ideal for LLMs.☆5,917Updated last month
- Task-Aware Agent-driven Prompt Optimization Framework☆3,073Updated last week
- ExtractThinker is a Document Intelligence library for LLMs, offering ORM-style interaction for flexible and powerful document workflows.☆1,170Updated last week
- Improved file parsing for LLM’s☆2,888Updated 4 months ago
- Build effective agents using Model Context Protocol and simple workflow patterns☆2,233Updated last week
- No-code LLM Platform to launch APIs and ETL Pipelines to structure unstructured documents☆4,982Updated this week
- LOTUS: A semantic query engine for fast and easy LLM-powered data processing☆1,138Updated last week
- Full toolkit for running an AI agent service built with LangGraph, FastAPI and Streamlit☆2,579Updated last week
- 🥤 RAGLite is a Python toolkit for Retrieval-Augmented Generation (RAG) with PostgreSQL or SQLite☆876Updated 2 weeks ago
- A fast multimodal LLM for real-time voice☆3,791Updated last month
- Deploy your agentic worfklows to production☆1,984Updated last week
- The fast, Pythonic way to build Model Context Protocol servers 🚀☆3,557Updated last week
- The python library for real-time communication☆3,355Updated this week
- pingcap/autoflow is a Graph RAG based and conversational knowledge base tool built with TiDB Serverless Vector Storage. Demo: https://tid…☆2,488Updated this week
- Colivara is a suite of services that allows you to store, search, and retrieve documents based on their visual embedding. ColiVara has st…☆879Updated last month
- Document to Markdown OCR library with Llama 3.2 vision☆2,238Updated 2 months ago
- Prompt optimization scratch☆678Updated 3 weeks ago
- open-source framework for creating and managing simulations populated with AI-powered agents. It provides an intuitive platform for desig…☆911Updated 2 months ago
- The AI framework that adds the engineering to prompt engineering (Python/TS/Ruby/Java/C#/Rust/Go compatible)☆2,791Updated this week
- A toolkit to create optimal Production-readyRetrieval Augmented Generation(RAG) setup for your data☆1,403Updated 2 months ago