NVIDIA / nv-ingestLinks
NeMo Retriever extraction is a scalable, performance-oriented document content and metadata extraction microservice. NeMo Retriever extraction uses specialized NVIDIA NIM microservices to find, contextualize, and extract text, tables, charts and images that you can use in downstream generative applications.
☆2,745Updated this week
Alternatives and similar repositories for nv-ingest
Users that are interested in nv-ingest are comparing it to the libraries listed below
Sorting:
- A system for agentic LLM-powered data processing and ETL☆2,945Updated 2 weeks ago
- Vision infrastructure to turn complex documents into RAG/LLM-ready data☆2,878Updated 2 weeks ago
- ExtractThinker is a Document Intelligence library for LLMs, offering ORM-style interaction for flexible and powerful document workflows.☆1,430Updated last month
- RAG that intelligently adapts to your use case, data, and queries☆3,535Updated 3 months ago
- The easiest way to deploy agents, MCP servers, models, RAG, pipelines and more. No MLOps. No YAML.☆3,578Updated 3 weeks ago
- Colivara is a suite of services that allows you to store, search, and retrieve documents based on their visual embedding. ColiVara has st…☆1,263Updated 5 months ago
- File Parser optimised for LLM Ingestion with no loss 🧠 Parse PDFs, Docx, PPTx in a format that is ideal for LLMs.☆7,192Updated 7 months ago
- A toolkit to create optimal Production-readyRetrieval Augmented Generation(RAG) setup for your data☆1,506Updated 4 months ago
- pingcap/autoflow is a Graph RAG based and conversational knowledge base tool built with TiDB Serverless Vector Storage. Demo: https://tid…☆2,663Updated 2 months ago
- AdalFlow: The library to build & auto-optimize LLM applications.☆3,788Updated last week
- Fast State-of-the-Art Static Embeddings☆1,853Updated this week
- No-code LLM Platform to launch APIs and ETL Pipelines to structure unstructured documents☆5,835Updated this week
- Knowledge Agents and Management in the Cloud☆4,163Updated this week
- The most accurate document search and store for building AI apps☆3,291Updated this week
- Full toolkit for running an AI agent service built with LangGraph, FastAPI and Streamlit☆3,678Updated last week
- The open LLM Ops platform - Traces, Analytics, Evaluations, Datasets and Prompt Optimization ✨☆2,534Updated this week
- Task-Aware Agent-driven Prompt Optimization Framework☆3,623Updated last week
- A fast multimodal LLM for real-time voice☆4,211Updated last month
- The code used to train and run inference with the ColVision models, e.g. ColPali, ColQwen2, and ColSmol.☆2,240Updated this week
- High-performance retrieval engine for unstructured data☆1,501Updated 2 months ago
- Cache-Augmented Generation: A Simple, Efficient Alternative to RAG☆1,391Updated 4 months ago
- Use LOTUS to process all of your datasets with LLMs and embeddings. Enjoy up to 1000x speedups with fast, accurate query processing, that…☆1,311Updated this week
- This repository contains various advanced techniques for Retrieval-Augmented Generation (RAG) systems.☆2,270Updated 7 months ago
- RAG (Retrieval Augmented Generation) Framework for building modular, open source applications for production by TrueFoundry☆4,259Updated last month
- 🦛 CHONK docs with Chonkie ✨ — The no-nonsense RAG library☆2,490Updated this week
- Document to Markdown OCR library with Llama 3.2 vision☆2,408Updated 8 months ago
- Official Implementation of "KBLaM: Knowledge Base augmented Language Model"☆1,406Updated last month
- Deploy your agentic worfklows to production☆2,052Updated last month
- The smart edge and AI gateway for agents. Arch is a high-performance proxy server that handles the low-level work in building agents: lik…☆3,744Updated this week
- Improved file parsing for LLM’s☆3,105Updated 10 months ago