NeMo Retriever extraction is a scalable, performance-oriented document content and metadata extraction microservice. NeMo Retriever extraction uses specialized NVIDIA NIM microservices to find, contextualize, and extract text, tables, charts and images that you can use in downstream generative applications.
☆2,851Feb 26, 2026Updated this week
Alternatives and similar repositories for nv-ingest
Users that are interested in nv-ingest are comparing it to the libraries listed below
Sorting:
- A fast multimodal LLM for real-time voice☆4,367Dec 12, 2025Updated 2 months ago
- Get your documents ready for gen AI☆54,754Updated this week
- File Parser optimised for LLM Ingestion with no loss 🧠 Parse PDFs, Docx, PPTx in a format that is ideal for LLMs.☆7,342Feb 21, 2025Updated last year
- An LLM-powered knowledge curation system that researches a topic and generates a full-length report with citations.☆27,949Sep 30, 2025Updated 5 months ago
- Task-Aware Agent-driven Prompt Optimization Framework☆3,805Oct 13, 2025Updated 4 months ago
- No-code LLM Platform to launch APIs and ETL Pipelines to structure unstructured documents☆6,452Updated this week
- An open-source RAG-based tool for chatting with your documents.☆25,168Updated this week
- OCR & Document Extraction using vision models☆12,155May 20, 2025Updated 9 months ago
- OCR, layout analysis, reading order, table recognition in 90+ languages☆19,360Feb 24, 2026Updated last week
- KAG is a logical form-guided reasoning and retrieval framework based on OpenSPG engine and LLMs. It is used to build logical reasoning a…☆8,574Jan 28, 2026Updated last month
- 🪄 Create rich visualizations with AI☆15,103Updated this week
- Toolkit for linearizing PDFs for LLM datasets/training☆16,947Feb 19, 2026Updated last week
- LLM-powered multiagent persona simulation for imagination enhancement and business insights.☆7,295Updated this week
- DSPy: The framework for programming—not prompting—language models☆32,519Updated this week
- Composable building blocks to build LLM Apps☆8,278Updated this week
- Build, run, manage agentic software at scale.☆38,276Updated this week
- Convert PDF to markdown + JSON quickly with high accuracy☆32,069Updated this week
- ⚡️ GenBI (Generative BI) queries any database in natural language, generates accurate SQL (Text-to-SQL), charts (Text-to-Chart), and AI-p…☆14,528Updated this week
- Convert documents to structured data effortlessly. Unstructured is open-source ETL solution for transforming complex documents into clean…☆14,074Updated this week
- GenAI Agent Framework, the Pydantic way☆15,120Updated this week
- Knowledge Agents and Management in the Cloud☆4,235Feb 17, 2026Updated 2 weeks ago
- Educational framework exploring ergonomic, lightweight multi-agent orchestration. Managed by OpenAI Solution team.☆21,026Mar 11, 2025Updated 11 months ago
- Build Real-Time Knowledge Graphs for AI Agents☆23,192Updated this week
- Python SDK, Proxy Server (AI Gateway) to call 100+ LLM APIs in OpenAI (or native) format, with cost tracking, guardrails, loadbalancing a…☆37,083Updated this week
- pingcap/autoflow is a Graph RAG based and conversational knowledge base tool built with TiDB Serverless Vector Storage. Demo: https://tid…☆2,738Jan 9, 2026Updated last month
- Universal memory layer for AI Agents☆48,604Updated this week
- The AI Browser Automation Framework☆21,261Updated this week
- 🚀🤖 Crawl4AI: Open-source LLM Friendly Web Crawler & Scraper. Don't be shy, join here: https://discord.gg/jP8KfhDhyN☆61,332Updated this week
- Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train OpenAI gpt-oss, DeepSeek, Qwen, Llama, Gemma, TTS 2x faster with 70% less VRAM.☆53,029Updated this week
- 🤗 smolagents: a barebones library for agents that think in code.☆25,615Feb 21, 2026Updated last week
- Python tool for converting files and office documents to Markdown.☆88,637Feb 20, 2026Updated last week
- Your AI second brain. Self-hostable. Get answers from the web or your docs. Build custom agents, schedule automations, do deep research. …☆32,752Feb 24, 2026Updated last week
- A modular graph-based Retrieval-Augmented Generation (RAG) system☆31,162Updated this week
- Flexible and powerful framework for managing multiple AI agents and handling complex conversations☆7,472Feb 11, 2026Updated 3 weeks ago
- Automate browser based workflows with AI☆20,629Updated this week
- Vision infrastructure to turn complex documents into RAG/LLM-ready data☆2,940Sep 24, 2025Updated 5 months ago
- A framework for building realtime voice AI agents 🤖🎙️📹☆9,441Updated this week
- Scira (Formerly MiniPerplx) is a minimalistic AI-powered search engine that helps you find information on the internet and cites it too. …☆11,472Feb 10, 2026Updated 3 weeks ago
- An autonomous agent that conducts deep research on any data using any LLM providers.☆25,472Feb 21, 2026Updated last week