QuivrHQ / MegaParseLinks
File Parser optimised for LLM Ingestion with no loss π§ Parse PDFs, Docx, PPTx in a format that is ideal for LLMs.
β7,269Updated 11 months ago
Alternatives and similar repositories for MegaParse
Users that are interested in MegaParse are comparing it to the libraries listed below
Sorting:
- Document (PDF, Word, PPTX ...) extraction and parse API using state of the art modern OCRs + Ollama supported models. Anonymize documentsβ¦β2,969Updated last month
- Vision infrastructure to turn complex documents into RAG/LLM-ready dataβ2,932Updated 4 months ago
- The easiest way to use Agentic RAG in any enterpriseβ4,393Updated last year
- Knowledge Agents and Management in the Cloudβ4,229Updated last week
- RAG (Retrieval Augmented Generation) Framework for building modular, open source applications for production by TrueFoundryβ4,314Updated 2 months ago
- Task-Aware Agent-driven Prompt Optimization Frameworkβ3,747Updated 3 months ago
- No-code LLM Platform to launch APIs and ETL Pipelines to structure unstructured documentsβ6,068Updated this week
- π A better UX for chat, writing content, and coding with LLMs.β5,321Updated 3 weeks ago
- ExtractThinker is a Document Intelligence library for LLMs, offering ORM-style interaction for flexible and powerful document workflows.β1,480Updated 5 months ago
- NeMo Retriever extraction is a scalable, performance-oriented document content and metadata extraction microservice. NeMo Retriever extraβ¦β2,817Updated last week
- A system for agentic LLM-powered data processing and ETLβ3,501Updated last week
- A Comprehensive Toolkit for High-Quality PDF Content Extractionβ9,143Updated last year
- Document to Markdown OCR library with Llama 3.2 visionβ2,422Updated last year
- β‘οΈ GenBI (Generative BI) queries any database in natural language, generates accurate SQL (Text-to-SQL), charts (Text-to-Chart), and AI-pβ¦β13,555Updated this week
- Structured data extraction and instruction calling with ML, LLM and Vision LLMβ5,101Updated last week
- Retrieval Augmented Generation (RAG) chatbot powered by Weaviateβ7,540Updated 6 months ago
- The open LLM Ops platform - Traces, Analytics, Evaluations, Datasets and Prompt Optimization β¨β2,743Updated this week
- An open-source RAG-based tool for chatting with your documents.β24,873Updated 6 months ago
- A visual playground for agentic workflows: Iterate over your agents 10x fasterβ5,662Updated 6 months ago
- pingcap/autoflow is a Graph RAG based and conversational knowledge base tool built with TiDB Serverless Vector Storage. Demo: https://tidβ¦β2,731Updated 3 weeks ago
- Fully local web research and report writing assistantβ8,477Updated 5 months ago
- Your agent in your terminal, equipped with local tools: writes code, uses the terminal, browses the web, vision.β4,174Updated this week
- OCR & Document Extraction using vision modelsβ12,041Updated 8 months ago
- Improved file parsing for LLMβsβ3,151Updated last year
- πͺ Create rich visualizations with AIβ14,789Updated this week
- A minimal Python framework for building custom AI inference servers with full control over logic, batching, and scaling.β3,792Updated 3 weeks ago
- π₯ Open Source Browser API for AI Agents & Apps. Steel Browser is a batteries-included browser sandbox that lets you automate the web witβ¦β6,278Updated 3 weeks ago
- Neo4j graph construction from unstructured data using LLMsβ4,306Updated this week
- A toolkit to create optimal Production-readyRetrieval Augmented Generation(RAG) setup for your dataβ1,522Updated 8 months ago
- Flexible and powerful framework for managing multiple AI agents and handling complex conversationsβ7,303Updated last week