QuivrHQ / MegaParseLinks
File Parser optimised for LLM Ingestion with no loss π§ Parse PDFs, Docx, PPTx in a format that is ideal for LLMs.
β7,131Updated 6 months ago
Alternatives and similar repositories for MegaParse
Users that are interested in MegaParse are comparing it to the libraries listed below
Sorting:
- No-code LLM Platform to launch APIs and ETL Pipelines to structure unstructured documentsβ5,746Updated last week
- Vision infrastructure to turn complex documents into RAG/LLM-ready dataβ2,832Updated 3 weeks ago
- NeMo Retriever extraction is a scalable, performance-oriented document content and metadata extraction microservice. NeMo Retriever extraβ¦β2,740Updated this week
- πͺ Create rich visualizations with AIβ13,638Updated this week
- π₯ Open Source Browser API for AI Agents & Apps. Steel Browser is a batteries-included browser sandbox that lets you automate the web witβ¦β5,043Updated last week
- Document (PDF, Word, PPTX ...) extraction and parse API using state of the art modern OCRs + Ollama supported models. Anonymize documentsβ¦β2,864Updated last month
- Task-Aware Agent-driven Prompt Optimization Frameworkβ3,571Updated last month
- Memory for AI Agents in 5 lines of codeβ6,940Updated this week
- LLM-powered multiagent persona simulation for imagination enhancement and business insights.β7,039Updated 2 weeks ago
- A system for agentic LLM-powered data processing and ETLβ2,812Updated last week
- π A better UX for chat, writing content, and coding with LLMs.β4,950Updated 3 weeks ago
- Scira (Formerly MiniPerplx) is a minimalistic AI-powered search engine that helps you find information on the internet and cites it too. β¦β10,586Updated last week
- Fully local web research and report writing assistantβ8,083Updated last month
- The python library for real-time communicationβ4,269Updated last week
- The open LLM Ops platform - Traces, Analytics, Evaluations, Datasets and Prompt Optimization β¨β2,457Updated this week
- ExtractThinker is a Document Intelligence library for LLMs, offering ORM-style interaction for flexible and powerful document workflows.β1,400Updated 2 weeks ago
- A visual playground for agentic workflows: Iterate over your agents 10x fasterβ5,441Updated last month
- The most accurate document search and store for building AI appsβ3,164Updated last week
- PraisonAI is a production-ready Multi AI Agents framework, designed to create AI Agents to automate and solve problems ranging from simplβ¦β5,340Updated last week
- Flexible and powerful framework for managing multiple AI agents and handling complex conversationsβ6,483Updated 3 weeks ago
- SoTA production-ready AI retrieval system. Agentic Retrieval-Augmented Generation (RAG) with a RESTful API.β7,267Updated 3 weeks ago
- Open Source Alternative to NotebookLM / Perplexity, connected to external sources such as Search Engines, Slack, Linear, Jira, ClickUp, Cβ¦β7,632Updated last week
- Vision agentβ5,030Updated last week
- RAG (Retrieval Augmented Generation) Framework for building modular, open source applications for production by TrueFoundryβ4,205Updated last week
- Open Source Application for Advanced LLM + Diffusion Engineering: interact, train, fine-tune, and evaluate large language models on your β¦β4,227Updated this week
- Keep searching, reading webpages, reasoning until it finds the answer (or exceeding the token budget)β4,832Updated 2 weeks ago
- Knowledge Agents and Management in the Cloudβ4,130Updated this week
- Turn any webpage into structured data using LLMsβ5,988Updated 3 months ago
- Document to Markdown OCR library with Llama 3.2 visionβ2,388Updated 7 months ago
- OCR & Document Extraction using vision modelsβ11,819Updated 3 months ago