NVIDIA / nv-ingestLinks
NVIDIA Ingest is an early access set of microservices for parsing hundreds of thousands of complex, messy unstructured PDFs and other enterprise documents into metadata and text to embed into retrieval systems.
☆2,701Updated this week
Alternatives and similar repositories for nv-ingest
Users that are interested in nv-ingest are comparing it to the libraries listed below
Sorting:
- A system for agentic LLM-powered data processing and ETL☆2,340Updated this week
- Vision infrastructure to turn complex documents into RAG/LLM-ready data☆2,269Updated last week
- The easiest way to deploy agents, MCP servers, models, RAG, pipelines and more. No MLOps. No YAML.☆3,382Updated this week
- Colivara is a suite of services that allows you to store, search, and retrieve documents based on their visual embedding. ColiVara has st…☆1,147Updated 2 months ago
- ExtractThinker is a Document Intelligence library for LLMs, offering ORM-style interaction for flexible and powerful document workflows.☆1,298Updated last month
- Knowledge Agents and Management in the Cloud☆4,046Updated this week
- 🥤 RAGLite is a Python toolkit for Retrieval-Augmented Generation (RAG) with DuckDB or PostgreSQL☆1,033Updated 3 weeks ago
- RAG that intelligently adapts to your use case, data, and queries☆3,372Updated 3 weeks ago
- Fast State-of-the-Art Static Embeddings☆1,752Updated last month
- pingcap/autoflow is a Graph RAG based and conversational knowledge base tool built with TiDB Serverless Vector Storage. Demo: https://tid…☆2,608Updated this week
- The open LLM Ops platform - Traces, Analytics, Evaluations, Datasets and Prompt Optimization ✨☆2,186Updated this week
- ☆1,582Updated 3 months ago
- The code used to train and run inference with the ColVision models, e.g. ColPali, ColQwen2, and ColSmol.☆2,009Updated this week
- Document to Markdown OCR library with Llama 3.2 vision☆2,360Updated 5 months ago
- A toolkit to create optimal Production-readyRetrieval Augmented Generation(RAG) setup for your data☆1,441Updated last month
- Task-Aware Agent-driven Prompt Optimization Framework☆3,383Updated this week
- RAG (Retrieval Augmented Generation) Framework for building modular, open source applications for production by TrueFoundry☆4,137Updated 4 months ago
- Open source multi-modal RAG for building AI apps over private knowledge.☆2,784Updated this week
- File Parser optimised for LLM Ingestion with no loss 🧠 Parse PDFs, Docx, PPTx in a format that is ideal for LLMs.☆6,560Updated 4 months ago
- open-source framework for creating and managing simulations populated with AI-powered agents. It provides an intuitive platform for desig…☆923Updated 5 months ago
- 📄 🧠 PageIndex: Document Index System for Reasoning-based RAG☆1,092Updated this week
- Fast, Accurate, Lightweight Python library to make State of the Art Embedding☆2,207Updated this week
- Improved file parsing for LLM’s☆3,013Updated 8 months ago
- LOTUS: A semantic query engine for fast and easy LLM-powered data processing☆1,240Updated this week
- Cache-Augmented Generation: A Simple, Efficient Alternative to RAG☆1,332Updated last month
- AdalFlow: The library to build & auto-optimize LLM applications.☆3,394Updated this week
- No-code LLM Platform to launch APIs and ETL Pipelines to structure unstructured documents☆5,445Updated this week
- High-performance retrieval engine for unstructured data☆1,439Updated 3 weeks ago
- Implementing the 4 agentic patterns from scratch☆1,413Updated 3 months ago
- Deploy your agentic worfklows to production☆2,035Updated last week