NVIDIA / nv-ingest
NVIDIA Ingest is an early access set of microservices for parsing hundreds of thousands of complex, messy unstructured PDFs and other enterprise documents into metadata and text to embed into retrieval systems.
☆2,653Updated this week
Alternatives and similar repositories for nv-ingest:
Users that are interested in nv-ingest are comparing it to the libraries listed below
- Vision infrastructure to turn complex documents into RAG/LLM-ready data☆2,124Updated this week
- A system for agentic LLM-powered data processing and ETL☆1,767Updated this week
- RAG that intelligently adapts to your use case, data, and queries☆3,164Updated 3 weeks ago
- Build Real-Time Knowledge Graphs for AI Agents☆4,092Updated this week
- Knowledge Agents and Management in the Cloud☆3,906Updated this week
- A fast multimodal LLM for real-time voice☆3,855Updated 2 months ago
- ☆2,973Updated this week
- pingcap/autoflow is a Graph RAG based and conversational knowledge base tool built with TiDB Serverless Vector Storage. Demo: https://tid…☆2,522Updated last week
- The python library for real-time communication☆3,750Updated this week
- Fast State-of-the-Art Static Embeddings☆1,359Updated this week
- Colivara is a suite of services that allows you to store, search, and retrieve documents based on their visual embedding. ColiVara has st…☆895Updated 2 months ago
- 🥤 RAGLite is a Python toolkit for Retrieval-Augmented Generation (RAG) with PostgreSQL or SQLite☆923Updated this week
- SOTA Open-Source Browser Agent for autonomously performing complex tasks on the web☆1,548Updated this week
- ☆3,238Updated 3 weeks ago
- Expose your FastAPI endpoints as Model Context Protocol (MCP) tools, with Auth!☆3,763Updated this week
- ExtractThinker is a Document Intelligence library for LLMs, offering ORM-style interaction for flexible and powerful document workflows.☆1,205Updated this week
- Fully local web research and report writing assistant☆7,099Updated last month
- Full toolkit for running an AI agent service built with LangGraph, FastAPI and Streamlit☆2,702Updated this week
- Things you can do with the token embeddings of an LLM☆1,437Updated 3 weeks ago
- Deploy high-performance AI models and inference pipelines on FastAPI with built-in batching, streaming and more.☆3,064Updated last week
- Build effective agents using Model Context Protocol and simple workflow patterns☆4,029Updated this week
- Task-Aware Agent-driven Prompt Optimization Framework☆3,175Updated last month
- Easily deployable and scalable backend server that efficiently converts various document formats (pdf, docx, pptx, html, images, etc) int…☆552Updated last month
- Flexible and powerful framework for managing multiple AI agents and handling complex conversations☆4,744Updated last week
- open-source framework for creating and managing simulations populated with AI-powered agents. It provides an intuitive platform for desig…☆915Updated 2 months ago
- No-code LLM Platform to launch APIs and ETL Pipelines to structure unstructured documents☆5,116Updated this week
- AdalFlow: The library to build & auto-optimize LLM applications.☆2,930Updated last month
- Cost-efficient and pluggable Infrastructure components for GenAI inference☆3,484Updated this week
- A Kubernetes deployable instance of GroundX for document parsing, storage, and search.☆682Updated this week
- Agent S: an open agentic framework that uses computers like a human☆2,436Updated this week