QuivrHQ / MegaParseLinks
File Parser optimised for LLM Ingestion with no loss π§ Parse PDFs, Docx, PPTx in a format that is ideal for LLMs.
β7,094Updated 6 months ago
Alternatives and similar repositories for MegaParse
Users that are interested in MegaParse are comparing it to the libraries listed below
Sorting:
- No-code LLM Platform to launch APIs and ETL Pipelines to structure unstructured documentsβ5,654Updated last week
- Knowledge Agents and Management in the Cloudβ4,111Updated this week
- Document (PDF, Word, PPTX ...) extraction and parse API using state of the art modern OCRs + Ollama supported models. Anonymize documentsβ¦β2,786Updated 2 weeks ago
- NeMo Retriever extraction is a scalable, performance-oriented document content and metadata extraction microservice. NeMo Retriever extraβ¦β2,731Updated this week
- SoTA production-ready AI retrieval system. Agentic Retrieval-Augmented Generation (RAG) with a RESTful API.β7,196Updated this week
- Agent Framework / shim to use Pydantic with LLMsβ11,725Updated this week
- Vision infrastructure to turn complex documents into RAG/LLM-ready dataβ2,766Updated this week
- The easiest way to use Agentic RAG in any enterpriseβ4,298Updated 6 months ago
- π₯ Open Source Browser API for AI Agents & Apps. Steel Browser is a batteries-included browser instance that lets you automate the web wiβ¦β4,920Updated this week
- Flexible and powerful framework for managing multiple AI agents and handling complex conversationsβ6,382Updated last month
- RAG (Retrieval Augmented Generation) Framework for building modular, open source applications for production by TrueFoundryβ4,191Updated 6 months ago
- AutoRAG: An Open-Source Framework for Retrieval-Augmented Generation (RAG) Evaluation & Optimization with AutoML-Style Automationβ4,190Updated last month
- Automate browser-based workflows with LLMs and Computer Visionβ14,089Updated this week
- A system for agentic LLM-powered data processing and ETLβ2,713Updated this week
- A visual playground for agentic workflows: Iterate over your agents 10x fasterβ5,358Updated last month
- Structured data extraction and instruction calling with ML, LLM and Vision LLMβ4,954Updated last month
- PraisonAI is a production-ready Multi AI Agents framework, designed to create AI Agents to automate and solve problems ranging from simplβ¦β5,291Updated last week
- Retrieval Augmented Generation (RAG) chatbot powered by Weaviateβ7,247Updated last month
- OCR & Document Extraction using vision modelsβ11,772Updated 3 months ago
- ExtractThinker is a Document Intelligence library for LLMs, offering ORM-style interaction for flexible and powerful document workflows.β1,369Updated 2 weeks ago
- Task-Aware Agent-driven Prompt Optimization Frameworkβ3,486Updated 2 weeks ago
- Improved file parsing for LLMβsβ3,044Updated 9 months ago
- OCR, layout analysis, reading order, table recognition in 90+ languagesβ18,337Updated this week
- Full-stack framework for building Multi-Agent Systems with memory, knowledge and reasoning.β31,838Updated this week
- Get your documents ready for gen AIβ36,287Updated this week
- Desktop app for prototyping and debugging LangGraph applications locally.β3,149Updated last month
- LLM-powered multiagent persona simulation for imagination enhancement and business insights.β7,014Updated 2 weeks ago
- β‘οΈ GenBI (Generative BI) queries any database in natural language, generates accurate SQL (Text-to-SQL), charts (Text-to-Chart), and AI-pβ¦β9,867Updated this week
- Turn any webpage into structured data using LLMsβ5,953Updated 3 months ago
- A suite of tools to develop RAG, semantic search, and other AI applications more easily with PostgreSQLβ5,129Updated last week