QuivrHQ / MegaParseLinks
File Parser optimised for LLM Ingestion with no loss π§ Parse PDFs, Docx, PPTx in a format that is ideal for LLMs.
β6,449Updated 3 months ago
Alternatives and similar repositories for MegaParse
Users that are interested in MegaParse are comparing it to the libraries listed below
Sorting:
- No-code LLM Platform to launch APIs and ETL Pipelines to structure unstructured documentsβ5,284Updated this week
- Toolkit for linearizing PDFs for LLM datasets/trainingβ12,482Updated this week
- Task-Aware Agent-driven Prompt Optimization Frameworkβ3,265Updated last week
- Build Real-Time Knowledge Graphs for AI Agentsβ9,878Updated this week
- Open Source Alternative to NotebookLM / Perplexity / Glean, connected to external sources such as search engines (Tavily, Linkup), Slack,β¦β4,603Updated last week
- π A better UX for chat, writing content, and coding with LLMs.β4,588Updated this week
- OCR & Document Extraction using vision modelsβ11,232Updated last week
- SoTA production-ready AI retrieval system. Agentic Retrieval-Augmented Generation (RAG) with a RESTful API.β6,904Updated this week
- Ingest, parse, and optimize any data format β‘οΈ from documents to multimedia β‘οΈ for enhanced compatibility with GenAI frameworksβ6,558Updated this week
- Get your documents ready for gen AIβ30,684Updated this week
- RAG (Retrieval Augmented Generation) Framework for building modular, open source applications for production by TrueFoundryβ4,084Updated 3 months ago
- An open-source RAG-based tool for chatting with your documents.β22,347Updated last month
- A Comprehensive Toolkit for High-Quality PDF Content Extractionβ7,744Updated 4 months ago
- π₯ Open Source Browser API for AI Agents & Apps. Steel Browser is a batteries-included browser instance that lets you automate the web wiβ¦β4,491Updated this week
- Document (PDF, Word, PPTX ...) extraction and parse API using state of the art modern OCRs + Ollama supported models. Anonymize documentsβ¦β2,587Updated last month
- NVIDIA Ingest is an early access set of microservices for parsing hundreds of thousands of complex, messy unstructured PDFs and other entβ¦β2,671Updated last week
- KAG is a logical form-guided reasoning and retrieval framework based on OpenSPG engine and LLMs. It is used to build logical reasoning aβ¦β7,007Updated last week
- Agent Framework / shim to use Pydantic with LLMsβ9,748Updated this week
- A free and open source, self hosted Ai based live meeting note taker and minutes summary generator that can completely run in your Local β¦β6,188Updated last week
- Knowledge Agents and Management in the Cloudβ3,984Updated last week
- Vision infrastructure to turn complex documents into RAG/LLM-ready dataβ2,182Updated this week
- Open source multi-modal RAG for building AI apps over private knowledge.β2,385Updated this week
- Debug, evaluate, and monitor your LLM applications, RAG systems, and agentic workflows with comprehensive tracing, automated evaluations,β¦β8,786Updated this week
- π€ Open-source GenBI AI Agent that empowers data-driven teams to chat with their databases to generate Text-to-SQL, BI and embedded AI. οΏ½β¦β7,900Updated this week
- Python SDK, Proxy Server (LLM Gateway) to call 100+ LLM APIs in OpenAI format - [Bedrock, Azure, OpenAI, VertexAI, Cohere, Anthropic, Sagβ¦β23,128Updated this week
- pingcap/autoflow is a Graph RAG based and conversational knowledge base tool built with TiDB Serverless Vector Storage. Demo: https://tidβ¦β2,568Updated this week
- Convert any URL to an LLM-friendly input with a simple prefix https://r.jina.ai/β8,766Updated 3 weeks ago
- A visual playground for agentic workflows: Iterate over your agents 10x fasterβ4,988Updated 2 weeks ago
- Retrieval Augmented Generation (RAG) chatbot powered by Weaviateβ7,133Updated 2 months ago
- Scira (Formerly MiniPerplx) is a minimalistic AI-powered search engine that helps you find information on the internet and cites it too. β¦β8,128Updated last week