QuivrHQ / MegaParseLinks
File Parser optimised for LLM Ingestion with no loss 🧠 Parse PDFs, Docx, PPTx in a format that is ideal for LLMs.
☆7,234Updated 9 months ago
Alternatives and similar repositories for MegaParse
Users that are interested in MegaParse are comparing it to the libraries listed below
Sorting:
- No-code LLM Platform to launch APIs and ETL Pipelines to structure unstructured documents☆5,963Updated this week
- NeMo Retriever extraction is a scalable, performance-oriented document content and metadata extraction microservice. NeMo Retriever extra…☆2,763Updated this week
- Vision infrastructure to turn complex documents into RAG/LLM-ready data☆2,910Updated 2 months ago
- RAG (Retrieval Augmented Generation) Framework for building modular, open source applications for production by TrueFoundry☆4,285Updated 2 months ago
- Document (PDF, Word, PPTX ...) extraction and parse API using state of the art modern OCRs + Ollama supported models. Anonymize documents…☆2,940Updated 2 months ago
- A visual playground for agentic workflows: Iterate over your agents 10x faster☆5,596Updated 4 months ago
- The easiest way to use Agentic RAG in any enterprise☆4,362Updated 10 months ago
- A system for agentic LLM-powered data processing and ETL☆3,101Updated last week
- Structured data extraction and instruction calling with ML, LLM and Vision LLM☆5,052Updated this week
- SoTA production-ready AI retrieval system. Agentic Retrieval-Augmented Generation (RAG) with a RESTful API.☆7,486Updated 3 weeks ago
- Retrieval Augmented Generation (RAG) chatbot powered by Weaviate☆7,450Updated 4 months ago
- Knowledge Agents and Management in the Cloud☆4,205Updated this week
- OCR & Document Extraction using vision models☆11,968Updated 6 months ago
- 🪄 Create rich visualizations with AI☆14,316Updated last week
- Document to Markdown OCR library with Llama 3.2 vision☆2,420Updated 10 months ago
- Open Source Application for Advanced LLM + Diffusion Engineering: interact, train, fine-tune, and evaluate large language models on your …☆4,571Updated this week
- 📃 A better UX for chat, writing content, and coding with LLMs.☆5,164Updated 3 months ago
- Task-Aware Agent-driven Prompt Optimization Framework☆3,693Updated last month
- OCR, layout analysis, reading order, table recognition in 90+ languages☆18,906Updated last month
- Fully local web research and report writing assistant☆8,342Updated 3 months ago
- 🔥 Open Source Browser API for AI Agents & Apps. Steel Browser is a batteries-included browser sandbox that lets you automate the web wit…☆5,995Updated this week
- An open-source RAG-based tool for chatting with your documents.☆24,676Updated 4 months ago
- pingcap/autoflow is a Graph RAG based and conversational knowledge base tool built with TiDB Serverless Vector Storage. Demo: https://tid…