opendatalab/mineru-vl-utils

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/opendatalab/mineru-vl-utils)

opendatalab / mineru-vl-utils

A Python package for interacting with the MinerU Vision-Language Model.

☆136

Alternatives and similar repositories for mineru-vl-utils

Users that are interested in mineru-vl-utils are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

opendatalab / MinerU-HTML
View on GitHub
MinerU-HTML: An SLM-powered HTML main content extractor that outputs clean HTML bodies. Perfect for Deep Research Agents, RAG application…
☆276Mar 27, 2026Updated 3 months ago
opendatalab / dsdl-docs
View on GitHub
Data Set Description Language Specification （新一代人工智能数据集描述语言DSDL）
☆46May 29, 2024Updated 2 years ago
opendatalab / UniMERNet
View on GitHub
UniMERNet: A Universal Network for Real-World Mathematical Expression Recognition
☆492Sep 28, 2025Updated 9 months ago
opendatalab / labelU-Kit
View on GitHub
Data annotation component library --provided as NPM packages
☆157Updated this week
opendatalab / opendatalab-python-sdk
View on GitHub
SDK of OpenDataLab - https://opendatalab.org.cn
☆60Jul 31, 2025Updated 11 months ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
opendatalab / Miner-PDF-Benchmark
View on GitHub
MPB (Miner-PDF-Benchmark) is an end-to-end PDF document comprehension evaluation suite designed for large-scale model data scenarios.
☆24Dec 11, 2024Updated last year
opendatalab / Meta-rater
View on GitHub
[ACL 2025 Best Theme Paper] This is the official implementation for the paper: "Meta-rater: A Multi-dimensional Data Selection Method for…
☆196Aug 29, 2025Updated 10 months ago
opendatalab / OmniDocBench
View on GitHub
[CVPR 2025] A Comprehensive Benchmark for Document Parsing and Evaluation
☆1,898Jun 26, 2026Updated 3 weeks ago
opendatalab / MinerU-Diffusion
View on GitHub
[ECCV 2026] A diffusion-based framework for document OCR that replaces autoregressive decoding with block-level parallel diffusion decodi…
☆623Jun 18, 2026Updated last month
yujunhuics / LayoutReader
View on GitHub
阅读顺序、Layoutreader
☆18May 8, 2025Updated last year
Niujunbo2002 / NativeRes-LLaVA
View on GitHub
Official code repo for our work "Native Visual Understanding: Resolving Resolution Dilemmas in Vision-Language Models"
☆54Jun 17, 2025Updated last year
opendatalab / DocLayout-YOLO
View on GitHub
DocLayout-YOLO: Enhancing Document Layout Analysis through Diverse Synthetic Data and Global-to-Local Adaptive Perception
☆2,232Apr 14, 2025Updated last year
Shannon4Science / NanaDraw
View on GitHub
NanaDraw turns complex scientific ideas into clear, expressive visuals you can use right away. Powered by Nano Banana, it generates edita…
☆111Apr 29, 2026Updated 2 months ago
opendatalab / MinerU-Popo
View on GitHub
☆257Jun 15, 2026Updated last month
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
opendatalab / OHR-Bench
View on GitHub
(ICCV 2025) OCR Hinders RAG: Evaluating the Cascading Impact of OCR on Retrieval-Augmented Generation
☆104Dec 3, 2025Updated 7 months ago
Tencent / POINTS-Reader
View on GitHub
☆197Dec 7, 2025Updated 7 months ago
FreeOCR-AI / layoutreader
View on GitHub
A Faster LayoutReader Model based on LayoutLMv3, Sort OCR bboxes to reading order.
☆322Aug 15, 2025Updated 11 months ago
RapidAI / RapidTable
View on GitHub
基于序列表格识别算法推理库，集成PP-Structure和modelscope等表格识别算法。
☆432Apr 23, 2026Updated 2 months ago
TencentCloudADP / youtu-parsing
View on GitHub
Youtu-Parsing: Perception, Structuring and Recognition via High-Parallelism Decoding
☆69Jun 15, 2026Updated last month
OpenDCAI / Flash-MinerU
View on GitHub
Ray-powered accelerator for MinerU, turning PDF → Markdown into a scalable, cluster-ready data infrastructure. 基于 Ray 的 MinerU 加速层，将 PDF …
☆63Apr 20, 2026Updated 3 months ago
Tencent-Hunyuan / HunyuanOCR
View on GitHub
☆1,861Updated this week
DocTron-hub / FD-RL
View on GitHub
[CVPR 2026] Reading or Reasoning? Format Decoupled Reinforcement Learning for Document OCR
☆18Mar 23, 2026Updated 3 months ago
poloclub / unitable
View on GitHub
UniTable: Towards a Unified Table Foundation Model
☆533Apr 21, 2026Updated 2 months ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
dada-qin / Data-Centric_LLM_Studies
View on GitHub
A list of papers about data quality in Large Language Models (LLMs)
☆27Dec 14, 2023Updated 2 years ago
ZichenWen1 / AHGFC
View on GitHub
The source code for “Homophily-Related: Adaptive Hybrid Graph Filter for Multi-View Graph Clustering”
☆11Apr 10, 2024Updated 2 years ago
opendatalab / LabelLLM
View on GitHub
The Open-Source Data Annotation Platform
☆1,262Jul 2, 2026Updated 2 weeks ago
xrose3159 / PaperPub
View on GitHub
PaperPub is an academic arena where diverse AI Agents read papers daily, pick apart each other's arguments, and fiercely debate.
☆43Jun 12, 2026Updated last month
LivingSkyTechnologies / Dense_Article_Dataset_DAD
View on GitHub
Dense Article Dataset (DAD): A Benchmark Dataset for Document Layout Analysis
☆16Jan 13, 2022Updated 4 years ago
RapidAI / TableStructureRec
View on GitHub
整理目前开源的最优表格识别模型，完善前后处理，模型转换为ONNX | Organize the currently open-source optimal table recognition models, improve pre-processing and post-…
☆954Aug 3, 2025Updated 11 months ago
OpenDCAI / AgentFlow
View on GitHub
The First Unified Agent Data Synthesis Framework for Custom Agentic Task with all-in-one envrionment
☆124May 4, 2026Updated 2 months ago
studio-dots-ai / dots.ocr
View on GitHub
Multilingual Document Layout Parsing in a Single Vision-Language Model
☆9,016Mar 24, 2026Updated 3 months ago
ZichenWen1 / DIJA
View on GitHub
(ICLR 2026 🔥) Code for "The Devil behind the mask: An emergent safety vulnerability of Diffusion LLMs"
☆79Feb 9, 2026Updated 5 months ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
opendatalab / PDF-Extract-Kit
View on GitHub
A Comprehensive Toolkit for High-Quality PDF Content Extraction
☆9,796Jan 3, 2025Updated last year
AlibabaResearch / AdvancedLiterateMachinery
View on GitHub
A collection of original, innovative ideas and algorithms towards Advanced Literate Machinery. This project is maintained by the OCR Team…
☆1,833Mar 17, 2026Updated 4 months ago
opendatalab / VIGC
View on GitHub
AAAI 2024: Visual Instruction Generation and Correction
☆97Feb 4, 2024Updated 2 years ago
SilverLucFox / Loula_SqLite_Viewer
View on GitHub
☆18Sep 17, 2025Updated 10 months ago
magicpdf / Magic-Doc
View on GitHub
conversion doc（pdf/html/doc/docx/ppt/pptx）to markdown
☆49Jul 23, 2024Updated last year
alibaba / Logics-Parsing
View on GitHub
☆1,393May 13, 2026Updated 2 months ago
opendatalab / CLIP-Parrot-Bias
View on GitHub
ECCV2024_Parrot Captions Teach CLIP to Spot Text
☆66Sep 6, 2024Updated last year