NVIDIA / nv-ingestLinks
NeMo Retriever extraction is a scalable, performance-oriented document content and metadata extraction microservice. NeMo Retriever extraction uses specialized NVIDIA NIM microservices to find, contextualize, and extract text, tables, charts and images that you can use in downstream generative applications.
☆2,744Updated this week
Alternatives and similar repositories for nv-ingest
Users that are interested in nv-ingest are comparing it to the libraries listed below
Sorting:
- A system for agentic LLM-powered data processing and ETL☆2,832Updated this week
- The easiest way to deploy agents, MCP servers, models, RAG, pipelines and more. No MLOps. No YAML.☆3,551Updated this week
- Vision infrastructure to turn complex documents into RAG/LLM-ready data☆2,842Updated 3 weeks ago
- pingcap/autoflow is a Graph RAG based and conversational knowledge base tool built with TiDB Serverless Vector Storage. Demo: https://tid…☆2,654Updated 2 months ago
- Fast State-of-the-Art Static Embeddings☆1,838Updated this week
- ExtractThinker is a Document Intelligence library for LLMs, offering ORM-style interaction for flexible and powerful document workflows.☆1,400Updated 2 weeks ago
- RAG that intelligently adapts to your use case, data, and queries☆3,491Updated 2 months ago
- Knowledge Agents and Management in the Cloud☆4,139Updated this week
- Colivara is a suite of services that allows you to store, search, and retrieve documents based on their visual embedding. ColiVara has st…☆1,260Updated 4 months ago
- 🥤 RAGLite is a Python toolkit for Retrieval-Augmented Generation (RAG) with DuckDB or PostgreSQL☆1,057Updated last week
- Task-Aware Agent-driven Prompt Optimization Framework☆3,582Updated last month
- Deploy your agentic worfklows to production☆2,055Updated 2 weeks ago
- File Parser optimised for LLM Ingestion with no loss 🧠 Parse PDFs, Docx, PPTx in a format that is ideal for LLMs.☆7,146Updated 6 months ago
- A toolkit to create optimal Production-readyRetrieval Augmented Generation(RAG) setup for your data☆1,466Updated 3 months ago
- ☆2,020Updated 6 months ago
- Document to Markdown OCR library with Llama 3.2 vision☆2,388Updated 7 months ago
- open-source framework for creating and managing simulations populated with AI-powered agents. It provides an intuitive platform for desig…☆926Updated 7 months ago
- Desktop app for prototyping and debugging LangGraph applications locally.☆3,203Updated 2 months ago
- No-code LLM Platform to launch APIs and ETL Pipelines to structure unstructured documents☆5,764Updated this week
- Fast, Accurate, Lightweight Python library to make State of the Art Embedding☆2,371Updated 2 weeks ago
- Cache-Augmented Generation: A Simple, Efficient Alternative to RAG☆1,377Updated 3 months ago
- Document (PDF, Word, PPTX ...) extraction and parse API using state of the art modern OCRs + Ollama supported models. Anonymize documents…☆2,864Updated last month
- 📄🧠 PageIndex: Document Index for Reasoning-based RAG☆2,516Updated last week
- High-performance retrieval engine for unstructured data☆1,495Updated last month
- ETL, Analytics, Versioning for Unstructured Data☆2,652Updated this week
- Use LOTUS to process all of your datasets with LLMs and embeddings. Enjoy up to 1000x speedups with fast, accurate query processing, that…☆1,290Updated last week
- Multi-LoRA inference server that scales to 1000s of fine-tuned LLMs☆3,417Updated 3 months ago
- A fast multimodal LLM for real-time voice☆4,192Updated 2 weeks ago
- 🤖 MLE-Agent: Your intelligent companion for seamless AI engineering and research. 🔍 Integrate with arxiv and paper with code to provide…☆1,387Updated last month
- The Open Source Memory Layer For Autonomous Agents☆2,460Updated 10 months ago