QuivrHQ / MegaParseLinks
File Parser optimised for LLM Ingestion with no loss π§ Parse PDFs, Docx, PPTx in a format that is ideal for LLMs.
β7,221Updated 8 months ago
Alternatives and similar repositories for MegaParse
Users that are interested in MegaParse are comparing it to the libraries listed below
Sorting:
- No-code LLM Platform to launch APIs and ETL Pipelines to structure unstructured documentsβ5,933Updated this week
- Vision infrastructure to turn complex documents into RAG/LLM-ready dataβ2,904Updated last month
- Document (PDF, Word, PPTX ...) extraction and parse API using state of the art modern OCRs + Ollama supported models. Anonymize documentsβ¦β2,926Updated last month
- Turn any webpage into structured data using LLMsβ6,083Updated 3 weeks ago
- A system for agentic LLM-powered data processing and ETLβ3,034Updated this week
- π A better UX for chat, writing content, and coding with LLMs.β5,135Updated 2 months ago
- Task-Aware Agent-driven Prompt Optimization Frameworkβ3,668Updated 3 weeks ago
- OCR, layout analysis, reading order, table recognition in 90+ languagesβ18,857Updated 3 weeks ago
- An open-source RAG-based tool for chatting with your documents.β24,597Updated 4 months ago
- NeMo Retriever extraction is a scalable, performance-oriented document content and metadata extraction microservice. NeMo Retriever extraβ¦β2,760Updated this week
- π₯ Open Source Browser API for AI Agents & Apps. Steel Browser is a batteries-included browser sandbox that lets you automate the web witβ¦β5,964Updated this week
- The first AI agent that builds permissionless integrations through reverse engineering platforms' internal APIs.β4,489Updated 2 months ago
- RAG (Retrieval Augmented Generation) Framework for building modular, open source applications for production by TrueFoundryβ4,277Updated 2 months ago
- ExtractThinker is a Document Intelligence library for LLMs, offering ORM-style interaction for flexible and powerful document workflows.β1,455Updated 2 months ago
- Knowledge Agents and Management in the Cloudβ4,198Updated last week
- Memory for AI Agents in 6 lines of codeβ8,160Updated this week
- The most accurate document search and store for building AI appsβ3,363Updated this week
- Flexible and powerful framework for managing multiple AI agents and handling complex conversationsβ7,029Updated 3 weeks ago
- RAG that intelligently adapts to your use case, data, and queriesβ3,575Updated last week
- The easiest way to use Agentic RAG in any enterpriseβ4,353Updated 9 months ago
- Structured data extraction and instruction calling with ML, LLM and Vision LLMβ5,031Updated last week
- Your agent in your terminal, equipped with local tools: writes code, uses the terminal, browses the web, vision.β4,055Updated last week
- LLM-powered multiagent persona simulation for imagination enhancement and business insights.β7,108Updated 2 months ago
- OCR & Document Extraction using vision modelsβ11,925Updated 5 months ago
- The open-source LLMOps platform: prompt playground, prompt management, LLM evaluation, and LLM observability all in one place.β3,340Updated this week
- Convert any URL to an LLM-friendly input with a simple prefix https://r.jina.ai/β9,377Updated 6 months ago
- CrawleeβA web scraping and browser automation library for Python to build reliable crawlers. Extract data for AI, LLMs, RAG, or GPTs. Dowβ¦β7,113Updated last week
- A powerful framework for building realtime voice AI agents π€ποΈπΉβ8,152Updated this week
- AutoRAG: An Open-Source Framework for Retrieval-Augmented Generation (RAG) Evaluation & Optimization with AutoML-Style Automationβ4,388Updated last week
- PraisonAI is a production-ready Multi AI Agents framework, designed to create AI Agents to automate and solve problems ranging from simplβ¦β5,468Updated last week