opendatalab / MinerU
A high-quality tool for convert PDF to Markdown and JSON.一站式开源高质量数据提取工具,将PDF转换成Markdown和JSON格式。
☆28,808Updated this week
Alternatives and similar repositories for MinerU:
Users that are interested in MinerU are comparing it to the libraries listed below
- A Comprehensive Toolkit for High-Quality PDF Content Extraction☆7,120Updated 2 months ago
- Toolkit for linearizing PDFs for LLM datasets/training☆10,379Updated this week
- RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.☆45,800Updated this week
- 🍒 Cherry Studio is a desktop client that supports for multiple LLM providers. Support deepseek-r1☆20,319Updated this week
- OCR & Document Extraction using vision models☆10,605Updated this week
- PDF scientific paper translation with preserved formats - 基于 AI 完整保留排版的 PDF 文档全文双语翻译,支持 Google/DeepL/Ollama/OpenAI 等服务,提供 CLI/GUI/Docker/…☆19,129Updated this week
- Netflix-level subtitle cutting, translation, alignment, and even dubbing - one-click fully automated AI video subtitle team | Netflix级字幕切…☆11,965Updated last week
- 🚀🚀 「大模型」2小时完全从0训练26M的小参数GPT!🌏 Train a 26M-parameter GPT from scratch in just 2h!☆16,670Updated last month
- Ingest, parse, and optimize any data format ➡️ from documents to multimedia ➡️ for enhanced compatibility with GenAI frameworks☆6,405Updated 4 months ago
- 🔥 Turn entire websites into LLM-ready markdown or structured data. Scrape, crawl and extract with a single API.☆31,990Updated this week
- Question and Answer based on Anything.☆12,885Updated last week
- 💬 Ready-to-use & flexible RAG Chatbot, supporting mainstream large language models (LLMs) such as DeepSeek-R1, Llama 3.3, Qwen2, OpenAI …☆14,889Updated this week
- Dify is an open-source LLM app development platform. Dify's intuitive interface combines AI workflow, RAG pipeline, agent capabilities, m…☆84,096Updated this week
- SOTA Open Source TTS☆20,165Updated this week
- Convert any URL to an LLM-friendly input with a simple prefix https://r.jina.ai/☆8,359Updated this week
- Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model☆7,257Updated last month
- 🚀🤖 Crawl4AI: Open-source LLM Friendly Web Crawler & Scraper. Don't be shy, join here: https://discord.gg/mEkkMXFG☆33,905Updated this week
- Convert PDF to markdown + JSON quickly with high accuracy☆23,161Updated this week
- MiniCPM-o 2.6: A GPT-4o Level MLLM for Vision, Speech and Multimodal Live Streaming on Your Phone☆19,039Updated 3 weeks ago
- Integrate the DeepSeek API into popular softwares☆29,794Updated this week
- FastGPT is a knowledge-based platform built on the LLMs, offers a comprehensive suite of out-of-the-box capabilities such as data process…☆22,980Updated this week
- User-friendly Desktop Client App for AI Models/LLMs (GPT, Claude, Gemini, Ollama...)☆33,535Updated this week
- Finetune Llama 3.3, DeepSeek-R1, Gemma 3 & Reasoning LLMs 2x faster with 70% less memory! 🦥☆35,466Updated this week
- A simple screen parsing tool towards pure vision based GUI agent☆20,951Updated last week
- 🦉 OWL: Optimized Workforce Learning for General Multi-Agent Assistance in Real-World Task Automation☆14,067Updated this week
- Qwen2.5 is the large language model series developed by Qwen team, Alibaba Cloud.☆16,285Updated last week
- OCR, layout analysis, reading order, table recognition in 90+ languages☆16,929Updated this week
- An LLM-powered knowledge curation system that researches a topic and generates a full-length report with citations.☆23,520Updated 2 months ago
- Open-source no-code web data extraction platform. Turn websites to APIs & spreadsheets with no-code robots in minutes.☆9,648Updated this week
- 🤯 Lobe Chat - an open-source, modern-design AI chat framework. Supports Multi AI Providers( OpenAI / Claude 3 / Gemini / Ollama / DeepSe…☆57,990Updated this week