NVIDIA / nv-ingest
NVIDIA Ingest is an early access set of microservices for parsing hundreds of thousands of complex, messy unstructured PDFs and other enterprise documents into metadata and text to embed into retrieval systems.
☆2,658Updated last week
Alternatives and similar repositories for nv-ingest:
Users that are interested in nv-ingest are comparing it to the libraries listed below
- Vision infrastructure to turn complex documents into RAG/LLM-ready data☆2,143Updated this week
- Open Source Application for Advanced LLM Engineering: interact, train, fine-tune, and evaluate large language models on your own computer…☆3,059Updated this week
- Fast State-of-the-Art Static Embeddings☆1,589Updated this week
- RAG that intelligently adapts to your use case, data, and queries☆3,222Updated last month
- Build Real-Time Knowledge Graphs for AI Agents☆8,122Updated this week
- pingcap/autoflow is a Graph RAG based and conversational knowledge base tool built with TiDB Serverless Vector Storage. Demo: https://tid…☆2,542Updated last week
- open-source framework for creating and managing simulations populated with AI-powered agents. It provides an intuitive platform for desig…☆915Updated 3 months ago
- Colivara is a suite of services that allows you to store, search, and retrieve documents based on their visual embedding. ColiVara has st…☆900Updated last week
- Open source multi-modal RAG for building AI apps over private knowledge.☆2,047Updated this week
- A system for agentic LLM-powered data processing and ETL☆1,937Updated this week
- Flexible and powerful framework for managing multiple AI agents and handling complex conversations☆4,931Updated this week
- The python library for real-time communication☆3,824Updated 2 weeks ago
- File Parser optimised for LLM Ingestion with no loss 🧠 Parse PDFs, Docx, PPTx in a format that is ideal for LLMs.☆6,377Updated 2 months ago
- Deploy high-performance AI models and inference pipelines on FastAPI with built-in batching, streaming and more.☆3,091Updated this week
- ☆3,290Updated last month
- Knowledge Agents and Management in the Cloud☆3,934Updated last week
- The SOTA Open-Source Browser Agent for autonomously performing complex tasks on the web☆2,085Updated this week
- The code used to train and run inference with the ColVision models, e.g. ColPali, ColQwen2, and ColSmol.☆1,795Updated this week
- Build effective agents using Model Context Protocol and simple workflow patterns☆4,247Updated last week
- Everything about the SmolLM2 and SmolVLM family of models☆2,273Updated last month
- AdalFlow: The library to build & auto-optimize LLM applications.☆2,971Updated last month
- A fast multimodal LLM for real-time voice☆3,896Updated 2 months ago
- Task-Aware Agent-driven Prompt Optimization Framework☆3,218Updated last month
- A visual playground for agentic workflows: Iterate over your agents 10x faster☆4,818Updated last month
- NVIDIA AI Blueprint for multimodal PDF data extraction for enterprise RAG☆321Updated last month
- ACI.dev is the open source platform that connects your AI agents to 600+ tool integrations with multi-tenant auth, granular permissions, …☆1,973Updated this week
- Open source alternative to Gemini Deep Research. Generate reports with AI based on search results.☆1,885Updated last month
- ☆3,114Updated 2 weeks ago
- Cache-Augmented Generation: A Simple, Efficient Alternative to RAG☆1,263Updated 2 months ago
- 🔥 Open Source Browser API for AI Agents & Apps. Steel Browser is a batteries-included browser instance that lets you automate the web wi…☆4,292Updated this week