NVIDIA / nv-ingest
NVIDIA Ingest is an early access set of microservices for parsing hundreds of thousands of complex, messy unstructured PDFs and other enterprise documents into metadata and text to embed into retrieval systems.
☆2,629Updated this week
Alternatives and similar repositories for nv-ingest:
Users that are interested in nv-ingest are comparing it to the libraries listed below
- Vision infrastructure to turn complex documents into RAG/LLM-ready data☆2,059Updated this week
- Build Real-Time Knowledge Graphs for AI Agents☆3,085Updated this week
- RAG that intelligently adapts to your use case, data, and queries☆3,074Updated last month
- File Parser optimised for LLM Ingestion with no loss 🧠 Parse PDFs, Docx, PPTx in a format that is ideal for LLMs.☆5,917Updated last month
- pingcap/autoflow is a Graph RAG based and conversational knowledge base tool built with TiDB Serverless Vector Storage. Demo: https://tid…☆2,474Updated last week
- A powerful framework for building realtime voice AI agents 🤖🎙️📹☆5,380Updated this week
- A system for agentic LLM-powered data processing and ETL☆1,723Updated this week
- Knowledge Agents and Management in the Cloud☆3,827Updated this week
- Task-Aware Agent-driven Prompt Optimization Framework☆3,073Updated last week
- Document (PDF, Word, PPTX ...) extraction and parse API using state of the art modern OCRs + Ollama supported models. Anonymize documents…☆2,509Updated this week
- 🥤 RAGLite is a Python toolkit for Retrieval-Augmented Generation (RAG) with PostgreSQL or SQLite☆876Updated 2 weeks ago
- No-code LLM Platform to launch APIs and ETL Pipelines to structure unstructured documents☆4,982Updated this week
- ☆2,574Updated last week
- Local realtime voice AI☆2,273Updated 3 weeks ago
- A visual playground for agentic workflows: Iterate over your agents 10x faster☆4,160Updated this week
- A fast multimodal LLM for real-time voice☆3,771Updated last month
- Colivara is a suite of services that allows you to store, search, and retrieve documents based on their visual embedding. ColiVara has st…☆879Updated last month
- A lightweight task engine for building stateful AI agents that prioritizes simplicity and flexibility.☆914Updated this week
- Everything about the SmolLM2 and SmolVLM family of models☆2,069Updated this week
- Build effective agents using Model Context Protocol and simple workflow patterns☆2,233Updated last week
- open-source framework for creating and managing simulations populated with AI-powered agents. It provides an intuitive platform for desig…☆911Updated 2 months ago
- Detect and extract tables to markdown and csv☆734Updated 2 months ago
- AutoRAG: An Open-Source Framework for Retrieval-Augmented Generation (RAG) Evaluation & Optimization with AutoML-Style Automation☆3,752Updated 3 weeks ago
- Document to Markdown OCR library with Llama 3.2 vision☆2,238Updated 2 months ago
- This repository contains various advanced techniques for Retrieval-Augmented Generation (RAG) systems.☆1,825Updated last month
- Neo4j graph construction from unstructured data using LLMs☆3,207Updated this week
- The python library for real-time communication☆3,355Updated this week
- Flexible and powerful framework for managing multiple AI agents and handling complex conversations☆4,593Updated this week
- The Open Source Memory Layer For Autonomous Agents☆2,112Updated 5 months ago
- The fast, Pythonic way to build Model Context Protocol servers 🚀☆1,993Updated last week