NVIDIA / nv-ingestLinks
NeMo Retriever extraction is a scalable, performance-oriented document content and metadata extraction microservice. NeMo Retriever extraction uses specialized NVIDIA NIM microservices to find, contextualize, and extract text, tables, charts and images that you can use in downstream generative applications.
☆2,789Updated this week
Alternatives and similar repositories for nv-ingest
Users that are interested in nv-ingest are comparing it to the libraries listed below
Sorting:
- Knowledge Agents and Management in the Cloud☆4,221Updated 2 weeks ago
- Vision infrastructure to turn complex documents into RAG/LLM-ready data☆2,924Updated 3 months ago
- A system for agentic LLM-powered data processing and ETL☆3,310Updated this week
- RAG that intelligently adapts to your use case, data, and queries☆3,642Updated last month
- pingcap/autoflow is a Graph RAG based and conversational knowledge base tool built with TiDB Serverless Vector Storage. Demo: https://tid…☆2,716Updated last week
- Colivara is a suite of services that allows you to store, search, and retrieve documents based on their visual embedding. ColiVara has st…☆1,408Updated 7 months ago
- ExtractThinker is a Document Intelligence library for LLMs, offering ORM-style interaction for flexible and powerful document workflows.☆1,464Updated 4 months ago
- A toolkit to create optimal Production-readyRetrieval Augmented Generation(RAG) setup for your data☆1,522Updated 7 months ago
- A minimal Python framework for building custom AI inference servers with full control over logic, batching, and scaling.☆3,746Updated this week
- File Parser optimised for LLM Ingestion with no loss 🧠 Parse PDFs, Docx, PPTx in a format that is ideal for LLMs.☆7,246Updated 10 months ago
- Fast State-of-the-Art Static Embeddings☆1,959Updated last month
- The code used to train and run inference with the ColVision models, e.g. ColPali, ColQwen2, and ColSmol.☆2,403Updated last week
- Deploy your agentic worfklows to production☆2,065Updated 2 weeks ago
- Document to Markdown OCR library with Llama 3.2 vision☆2,414Updated 11 months ago
- 🥤 RAGLite is a Python toolkit for Retrieval-Augmented Generation (RAG) with DuckDB or PostgreSQL☆1,126Updated 2 weeks ago
- The open LLM Ops platform - Traces, Analytics, Evaluations, Datasets and Prompt Optimization ✨☆2,698Updated this week
- RAG (Retrieval Augmented Generation) Framework for building modular, open source applications for production by TrueFoundry☆4,308Updated last month
- High-performance retrieval engine for unstructured data☆1,545Updated last month
- Improved file parsing for LLM’s☆3,146Updated last year
- Empowering RAG with a memory-based data interface for all-purpose applications!☆2,192Updated 3 months ago
- The easiest way to use Agentic RAG in any enterprise☆4,375Updated 11 months ago
- Easily use and train state of the art late-interaction retrieval methods (ColBERT) in any RAG pipeline. Designed for modularity and ease-…☆3,803Updated 7 months ago
- open-source framework for creating and managing simulations populated with AI-powered agents. It provides an intuitive platform for desig…☆933Updated 10 months ago
- No-code LLM Platform to launch APIs and ETL Pipelines to structure unstructured documents☆6,019Updated this week
- AI-Powered Data Processing: Use LOTUS to process all of your datasets with LLMs and embeddings. Enjoy up to 1000x speedups with fast, acc…☆1,510Updated 2 weeks ago
- Cache-Augmented Generation: A Simple, Efficient Alternative to RAG☆1,454Updated 7 months ago
- Fast, Accurate, Lightweight Python library to make State of the Art Embedding☆2,578Updated last week
- The most accurate document search and store for building AI apps☆3,429Updated this week
- The Open Source Memory Layer For Autonomous Agents☆2,517Updated last year
- One-click deploy of a Knowledge Graph powered RAG (GraphRAG) in Azure☆2,397Updated 7 months ago