NVIDIA / nv-ingestLinks
NeMo Retriever extraction is a scalable, performance-oriented document content and metadata extraction microservice. NeMo Retriever extraction uses specialized NVIDIA NIM microservices to find, contextualize, and extract text, tables, charts and images that you can use in downstream generative applications.
☆2,758Updated this week
Alternatives and similar repositories for nv-ingest
Users that are interested in nv-ingest are comparing it to the libraries listed below
Sorting:
- A system for agentic LLM-powered data processing and ETL☆3,029Updated 2 weeks ago
- Vision infrastructure to turn complex documents into RAG/LLM-ready data☆2,903Updated last month
- ExtractThinker is a Document Intelligence library for LLMs, offering ORM-style interaction for flexible and powerful document workflows.☆1,449Updated 2 months ago
- Colivara is a suite of services that allows you to store, search, and retrieve documents based on their visual embedding. ColiVara has st…☆1,349Updated 6 months ago
- Task-Aware Agent-driven Prompt Optimization Framework☆3,657Updated 3 weeks ago
- Fast State-of-the-Art Static Embeddings☆1,882Updated 3 weeks ago
- Build custom inference engines for models, agents, multi-modal systems, RAG, pipelines and more.☆3,681Updated this week
- pingcap/autoflow is a Graph RAG based and conversational knowledge base tool built with TiDB Serverless Vector Storage. Demo: https://tid…☆2,673Updated 2 weeks ago
- Knowledge Agents and Management in the Cloud☆4,194Updated last week
- RAG that intelligently adapts to your use case, data, and queries☆3,575Updated last week
- A toolkit to create optimal Production-readyRetrieval Augmented Generation(RAG) setup for your data☆1,508Updated 5 months ago
- No-code LLM Platform to launch APIs and ETL Pipelines to structure unstructured documents☆5,921Updated this week
- ETL, Analytics, Versioning for Unstructured Data☆2,694Updated last week
- File Parser optimised for LLM Ingestion with no loss 🧠 Parse PDFs, Docx, PPTx in a format that is ideal for LLMs.☆7,216Updated 8 months ago
- The open LLM Ops platform - Traces, Analytics, Evaluations, Datasets and Prompt Optimization ✨☆2,588Updated this week
- AdalFlow: The library to build & auto-optimize LLM applications.☆3,854Updated last month
- Fast, Accurate, Lightweight Python library to make State of the Art Embedding☆2,477Updated last week
- 🥤 RAGLite is a Python toolkit for Retrieval-Augmented Generation (RAG) with DuckDB or PostgreSQL☆1,102Updated this week
- The code used to train and run inference with the ColVision models, e.g. ColPali, ColQwen2, and ColSmol.☆2,284Updated last week
- A fast multimodal LLM for real-time voice☆4,243Updated 2 months ago
- Deploy your agentic worfklows to production☆2,059Updated 2 months ago
- Document to Markdown OCR library with Llama 3.2 vision☆2,415Updated 9 months ago
- Empowering RAG with a memory-based data interface for all-purpose applications!☆2,159Updated last month
- Official Implementation of "KBLaM: Knowledge Base augmented Language Model"☆1,417Updated 3 weeks ago
- Improved file parsing for LLM’s☆3,123Updated 11 months ago
- A framework for serving and evaluating LLM routers - save LLM costs without compromising quality☆4,376Updated last year
- The SOTA Open-Source Browser Agent for autonomously performing complex tasks on the web☆2,320Updated 4 months ago
- This repository contains various advanced techniques for Retrieval-Augmented Generation (RAG) systems.☆2,303Updated 8 months ago
- RAG (Retrieval Augmented Generation) Framework for building modular, open source applications for production by TrueFoundry☆4,269Updated 2 months ago
- A collection of notebooks/recipes showcasing usecases of open-source models with Together AI.☆1,070Updated last week