opendatalab / MinerULinks
A high-quality tool for convert PDF to Markdown and JSON.一站式开源高质量数据提取工具,将PDF转换成Markdown和JSON格式。
☆40,061Updated this week
Alternatives and similar repositories for MinerU
Users that are interested in MinerU are comparing it to the libraries listed below
Sorting:
- A Comprehensive Toolkit for High-Quality PDF Content Extraction☆8,171Updated 6 months ago
- Toolkit for linearizing PDFs for LLM datasets/training☆13,234Updated this week
- Convert PDF to markdown + JSON quickly with high accuracy☆26,616Updated this week
- 🍒 Cherry Studio is a desktop client that supports for multiple LLM providers.☆30,342Updated this week
- Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model☆7,733Updated 5 months ago
- OCR & Document Extraction using vision models☆11,574Updated 2 months ago
- OCR, layout analysis, reading order, table recognition in 90+ languages☆17,847Updated last week
- RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.☆60,141Updated this week
- OCR software, free and offline. 开源、免费的离线OCR软件。支持截屏/批量导入图片,PDF文档识别,排除水印/页眉页脚,扫描/生成二维码。内置多国语言库。☆35,540Updated last month
- Netflix-level subtitle cutting, translation, alignment, and even dubbing - one-click fully automated AI video subtitle team | Netflix级字幕切…☆13,937Updated 2 months ago
- 🔥 MaxKB is an open-source platform for building enterprise-grade agents. MaxKB 是强大易用的开源企业级智能体平台。☆17,207Updated this week
- 🔥 Open-source no code web data extraction platform. Instantly turn any website into API or spreadsheet 🔥☆13,246Updated last week
- FastGPT is a knowledge-based platform built on the LLMs, offers a comprehensive suite of out-of-the-box capabilities such as data process…☆25,174Updated this week
- User-friendly Desktop Client App for AI Models/LLMs (GPT, Claude, Gemini, Ollama...)☆35,837Updated 3 weeks ago
- PDF scientific paper translation with preserved formats - 基于 AI 完整保留排版的 PDF 文档全文双语翻译,支持 Google/DeepL/Ollama/OpenAI 等服务,提供 CLI/GUI/MCP/Doc…☆25,819Updated last week
- The all-in-one Desktop & Docker AI application with built-in RAG, AI agents, No-code agent builder, MCP compatibility, and more.☆46,696Updated this week
- Use LLMs to dig out what you care about from massive amounts of information and a variety of sources daily.☆7,705Updated 2 weeks ago
- SOTA Open Source TTS☆22,407Updated 2 weeks ago
- A generative speech model for daily dialogue.☆37,196Updated 2 weeks ago
- ⚡️HivisionIDPhotos: a lightweight and efficient AI ID photos tools. 一个轻量级的AI证件照制作算法。☆18,481Updated last week
- LLM API 管理 & 分发系统,支持 OpenAI、Azure、Anthropic Claude、Google Gemini、DeepSeek、字节豆包、ChatGLM、文心一言、讯飞星火、通义千问、360 智脑、腾讯混元等主流模型,统一 API 适配,可用于 key …☆26,243Updated this week
- A simple screen parsing tool towards pure vision based GUI agent☆22,722Updated 3 months ago
- 🚀🚀 「大模型」2小时完全从0训练26M的小参数GPT!🌏 Train a 26M-parameter GPT from scratch in just 2h!☆23,177Updated 2 months ago
- Production-ready platform for agentic workflow development.☆107,400Updated this week
- An LLM-powered knowledge curation system that researches a topic and generates a full-length report with citations.☆26,729Updated 3 weeks ago
- Integrate the DeepSeek API into popular softwares☆33,226Updated 2 months ago
- MiniCPM-o 2.6: A GPT-4o Level MLLM for Vision, Speech and Multimodal Live Streaming on Your Phone☆19,846Updated 3 weeks ago
- 🔥 Turn entire websites into LLM-ready markdown or structured data. Scrape, crawl and extract with a single API.☆43,098Updated this week
- Question and Answer based on Anything.☆13,407Updated 3 months ago
- A Flexible Framework for Experiencing Cutting-edge LLM Inference Optimizations☆14,663Updated last week