QuivrHQ / MegaParseLinks
File Parser optimised for LLM Ingestion with no loss 🧠 Parse PDFs, Docx, PPTx in a format that is ideal for LLMs.
☆6,560Updated 4 months ago
Alternatives and similar repositories for MegaParse
Users that are interested in MegaParse are comparing it to the libraries listed below
Sorting:
- Task-Aware Agent-driven Prompt Optimization Framework☆3,371Updated last month
- No-code LLM Platform to launch APIs and ETL Pipelines to structure unstructured documents☆5,428Updated this week
- NVIDIA Ingest is an early access set of microservices for parsing hundreds of thousands of complex, messy unstructured PDFs and other ent…☆2,701Updated this week
- Knowledge Agents and Management in the Cloud☆4,046Updated this week
- Build Real-Time Knowledge Graphs for AI Agents☆12,130Updated this week
- Toolkit for linearizing PDFs for LLM datasets/training☆13,196Updated this week
- Open Source Alternative to NotebookLM / Perplexity / Glean, connected to external sources such as search engines (Tavily, Linkup), Slack,…☆5,883Updated this week
- A visual playground for agentic workflows: Iterate over your agents 10x faster☆5,266Updated this week
- OCR & Document Extraction using vision models☆11,514Updated last month
- SoTA production-ready AI retrieval system. Agentic Retrieval-Augmented Generation (RAG) with a RESTful API.☆7,033Updated last week
- Document to Markdown OCR library with Llama 3.2 vision☆2,360Updated 5 months ago
- Vision infrastructure to turn complex documents into RAG/LLM-ready data☆2,269Updated last week
- A system for agentic LLM-powered data processing and ETL☆2,340Updated this week
- RAG (Retrieval Augmented Generation) Framework for building modular, open source applications for production by TrueFoundry☆4,137Updated 4 months ago
- Turn any webpage into structured data using LLMs☆5,052Updated last month
- Flexible and powerful framework for managing multiple AI agents and handling complex conversations☆6,116Updated 2 weeks ago
- ☆4,279Updated this week
- Document (PDF, Word, PPTX ...) extraction and parse API using state of the art modern OCRs + Ollama supported models. Anonymize documents…☆2,625Updated 2 months ago
- LLM-powered multiagent persona simulation for imagination enhancement and business insights.☆6,922Updated 3 months ago
- Open Source Application for Advanced LLM + Diffusion Engineering: interact, train, fine-tune, and evaluate large language models on your …☆3,519Updated last week
- AutoRAG: An Open-Source Framework for Retrieval-Augmented Generation (RAG) Evaluation & Optimization with AutoML-Style Automation☆4,080Updated last week
- PraisonAI is a production-ready Multi AI Agents framework, designed to create AI Agents to automate and solve problems ranging from simpl…☆5,027Updated this week
- A powerful framework for building realtime voice AI agents 🤖🎙️📹☆6,687Updated this week
- 🔥 Open Source Browser API for AI Agents & Apps. Steel Browser is a batteries-included browser instance that lets you automate the web wi…☆4,700Updated last week
- An open-source RAG-based tool for chatting with your documents.☆22,756Updated last week
- pingcap/autoflow is a Graph RAG based and conversational knowledge base tool built with TiDB Serverless Vector Storage. Demo: https://tid…☆2,608Updated this week
- ExtractThinker is a Document Intelligence library for LLMs, offering ORM-style interaction for flexible and powerful document workflows.☆1,298Updated last month
- Convert documents to structured data effortlessly. Unstructured is open-source ETL solution for transforming complex documents into clean…☆11,873Updated this week
- Neo4j graph construction from unstructured data using LLMs☆3,679Updated last week
- ⚡️Wren AI is your GenBI Agent, that you can query any database with natural language, get accurate SQL(Text-to-SQL), charts(Text-to-Chart…☆8,564Updated this week