opendatalab / MinerULinks
A high-quality tool for convert PDF to Markdown and JSON.一站式开源高质量数据提取工具,将PDF转换成Markdown和JSON格式。
☆36,799Updated this week
Alternatives and similar repositories for MinerU
Users that are interested in MinerU are comparing it to the libraries listed below
Sorting:
- A Comprehensive Toolkit for High-Quality PDF Content Extraction☆7,982Updated 5 months ago
- Production-ready platform for agentic workflow development.☆104,441Updated this week
- 🔥 Turn entire websites into LLM-ready markdown or structured data. Scrape, crawl and extract with a single API.☆40,551Updated this week
- OCR & Document Extraction using vision models☆11,468Updated last month
- Ingest, parse, and optimize any data format ➡️ from documents to multimedia ➡️ for enhanced compatibility with GenAI frameworks☆6,593Updated 2 weeks ago
- 🍒 Cherry Studio is a desktop client that supports for multiple LLM providers.☆29,040Updated this week
- Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model☆7,683Updated 4 months ago
- Convert PDF to markdown + JSON quickly with high accuracy☆26,105Updated last week
- OCR, layout analysis, reading order, table recognition in 90+ languages☆17,709Updated this week
- FastGPT is a knowledge-based platform built on the LLMs, offers a comprehensive suite of out-of-the-box capabilities such as data process…☆24,898Updated this week
- Toolkit for linearizing PDFs for LLM datasets/training☆13,063Updated this week
- A modular graph-based Retrieval-Augmented Generation (RAG) system☆26,017Updated last week
- RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.☆57,684Updated this week
- PDF scientific paper translation with preserved formats - 基于 AI 完整保留排版的 PDF 文档全文双语翻译,支持 Google/DeepL/Ollama/OpenAI 等服务,提供 CLI/GUI/MCP/Doc…☆24,962Updated this week
- 💬 MaxKB is an open-source AI assistant for enterprise. It seamlessly integrates RAG pipelines, supports robust workflows, and provides M…☆16,909Updated this week
- Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)☆53,115Updated this week
- Replace OpenAI GPT with another LLM in your app by changing a single line of code. Xinference gives you the freedom to use any LLM you ne…☆8,117Updated this week
- 🚀🤖 Crawl4AI: Open-source LLM Friendly Web Crawler & Scraper. Don't be shy, join here: https://discord.gg/jP8KfhDhyN☆46,531Updated this week
- A Flexible Framework for Experiencing Cutting-edge LLM Inference Optimizations☆14,452Updated this week
- 分享一些好用的 Dify DSL 工作流程,自用、学习两相宜。 Sharing some Dify workflows.☆8,014Updated 2 weeks ago
- The all-in-one Desktop & Docker AI application with built-in RAG, AI agents, No-code agent builder, MCP compatibility, and more.☆45,675Updated this week
- Netflix-level subtitle cutting, translation, alignment, and even dubbing - one-click fully automated AI video subtitle team | Netflix级字幕切…☆13,339Updated last month