NVIDIA / nv-ingestLinks
NVIDIA Ingest is an early access set of microservices for parsing hundreds of thousands of complex, messy unstructured PDFs and other enterprise documents into metadata and text to embed into retrieval systems.
☆2,685Updated this week
Alternatives and similar repositories for nv-ingest
Users that are interested in nv-ingest are comparing it to the libraries listed below
Sorting:
- Vision infrastructure to turn complex documents into RAG/LLM-ready data☆2,224Updated this week
- A system for agentic LLM-powered data processing and ETL☆2,273Updated last week
- RAG that intelligently adapts to your use case, data, and queries☆3,320Updated 2 months ago
- ContextGem: Effortless LLM extraction from documents☆1,180Updated this week
- Open Source Application for Advanced LLM Engineering: interact, train, fine-tune, and evaluate large language models on your own computer…☆3,428Updated this week
- A toolkit to create optimal Production-readyRetrieval Augmented Generation(RAG) setup for your data☆1,432Updated last month
- The open LLM Ops platform - Traces, Analytics, Evaluations, Datasets and Prompt Optimization ✨☆2,064Updated this week
- Open source multi-modal RAG for building AI apps over private knowledge.☆2,662Updated this week
- Task-Aware Agent-driven Prompt Optimization Framework☆3,331Updated 2 weeks ago
- pingcap/autoflow is a Graph RAG based and conversational knowledge base tool built with TiDB Serverless Vector Storage. Demo: https://tid…☆2,586Updated 3 weeks ago
- File Parser optimised for LLM Ingestion with no loss 🧠 Parse PDFs, Docx, PPTx in a format that is ideal for LLMs.☆6,517Updated 4 months ago
- Fast State-of-the-Art Static Embeddings☆1,740Updated 2 weeks ago
- Flexible and powerful framework for managing multiple AI agents and handling complex conversations☆6,041Updated 2 weeks ago
- Colivara is a suite of services that allows you to store, search, and retrieve documents based on their visual embedding. ColiVara has st…☆1,136Updated last month
- RAG (Retrieval Augmented Generation) Framework for building modular, open source applications for production by TrueFoundry☆4,101Updated 4 months ago
- Knowledge Agents and Management in the Cloud☆4,014Updated this week
- 📄 🧠 PageIndex: Document Index System for Reasoning-based RAG☆1,056Updated last week
- open-source framework for creating and managing simulations populated with AI-powered agents. It provides an intuitive platform for desig…☆921Updated 4 months ago
- DATAGEN: AI-driven multi-agent research assistant automating hypothesis generation, data analysis, and report writing. Now expanding into…☆1,332Updated last week
- A fast multimodal LLM for real-time voice☆4,016Updated 4 months ago
- Desktop app for prototyping and debugging LangGraph applications locally.☆2,952Updated this week
- Document (PDF, Word, PPTX ...) extraction and parse API using state of the art modern OCRs + Ollama supported models. Anonymize documents…☆2,614Updated last month
- ExtractThinker is a Document Intelligence library for LLMs, offering ORM-style interaction for flexible and powerful document workflows.☆1,281Updated last week
- 🥤 RAGLite is a Python toolkit for Retrieval-Augmented Generation (RAG) with DuckDB or PostgreSQL☆1,015Updated last week
- LOTUS: A semantic query engine for fast and easy LLM-powered data processing☆1,199Updated 3 weeks ago
- The code used to train and run inference with the ColVision models, e.g. ColPali, ColQwen2, and ColSmol.☆1,964Updated this week
- This repository contains various advanced techniques for Retrieval-Augmented Generation (RAG) systems.☆2,053Updated 4 months ago
- Full toolkit for running an AI agent service built with LangGraph, FastAPI and Streamlit☆3,161Updated this week
- Deploy your agentic worfklows to production☆2,026Updated this week
- ETL, Analytics, Versioning for Unstructured Data☆2,584Updated this week