A Python package for interacting with the MinerU Vision-Language Model.
☆120May 19, 2026Updated this week
Alternatives and similar repositories for mineru-vl-utils
Users that are interested in mineru-vl-utils are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- MPB (Miner-PDF-Benchmark) is an end-to-end PDF document comprehension evaluation suite designed for large-scale model data scenarios.☆24Dec 11, 2024Updated last year
- UniMERNet: A Universal Network for Real-World Mathematical Expression Recognition☆473Sep 28, 2025Updated 7 months ago
- conversion doc(pdf/html/doc/docx/ppt/pptx)to markdown☆49Jul 23, 2024Updated last year
- Data annotation component library --provided as NPM packages☆152Apr 21, 2026Updated 3 weeks ago
- Large-Scale High-quality Chinese Web Text with Multi-dimensional and fine-grained information☆38Dec 2, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- SDK of OpenDataLab - https://opendatalab.org.cn☆60Jul 31, 2025Updated 9 months ago
- A High-efficiency Open-source Toolkit for Table-to-Latex Task☆277Dec 6, 2025Updated 5 months ago
- A Benchmark of PDF Information Extraction Tools using a Multi-Task and Multi-Domain Evaluation Framework for Academic Documents☆31Dec 8, 2022Updated 3 years ago
- Data browser based on s3. 一个基于 S3 的数据(json / jsonl / parquet / html / md等)可视化工具。👇 Try online.☆85Apr 14, 2026Updated last month
- 秘塔AI搜索 Python SDK https://metaso.cn☆15Apr 21, 2025Updated last year
- ☆121Jan 15, 2026Updated 4 months ago
- This repository provides the code for applying Contrastive Learning Penalty Loss (CLPL) and Mixture of Experts (MoE) to the BGE-M3 text e…☆11Dec 27, 2024Updated last year
- 记录有用的Git repos☆12Jul 28, 2024Updated last year
- A mesh system for adapting multiple large language models.☆11Mar 20, 2024Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- An Annotated Question Answering Dataset for Assisting Chinese Python Programming Learners☆10Feb 23, 2024Updated 2 years ago
- Implementation (in progress) of Dieng et al.'s TopicRNN intended to be used as a baseline and starting point.☆10Jun 26, 2018Updated 7 years ago
- MinerU-HTML: An SLM-powered HTML main content extractor that outputs clean HTML bodies. Perfect for Deep Research Agents, RAG application…☆242Mar 27, 2026Updated last month
- ☆12May 15, 2024Updated 2 years ago
- Use to store public paper and organize them.☆18Feb 26, 2021Updated 5 years ago
- O'Reilly Course, In-Memory Computing Essentials☆10Oct 16, 2020Updated 5 years ago
- 语音合成VITS 纯中文微调☆12Mar 15, 2023Updated 3 years ago
- BERT&RoBERTa预训练代码,tensorflow和torch两种版本实现☆13Feb 8, 2023Updated 3 years ago
- A Faster LayoutReader Model based on LayoutLMv3, Sort OCR bboxes to reading order.☆320Aug 15, 2025Updated 9 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆16Apr 30, 2025Updated last year
- 用于生成文本纠错模型(如Gector)需要的大量数据。☆14Jan 5, 2023Updated 3 years ago
- Tensor Belief Propagation - algorithm for approximate inference in discrete graphical models☆12Feb 17, 2020Updated 6 years ago
- 中文关键词提取☆14Aug 7, 2023Updated 2 years ago
- ☆17Jul 10, 2022Updated 3 years ago
- 增加加预览之后的博客代码☆11Mar 23, 2017Updated 9 years ago
- Official implementation of ECCV 2024 paper: "Event-based Mosaicing Bundle Adjustment"☆13Mar 12, 2025Updated last year
- All Digital Phase-Locked Loop☆13May 22, 2023Updated 2 years ago
- ☆25Nov 7, 2022Updated 3 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Youtu-Parsing: Perception, Structuring and Recognition via High-Parallelism Decoding☆69Feb 10, 2026Updated 3 months ago
- datasets resource☆140Apr 14, 2026Updated last month
- CycleCenternet based on MMDetection☆22Jun 28, 2023Updated 2 years ago
- Official Repository for ICML 2024 Paper "OT-CLIP: Understanding and Generalizing CLIP via Optimal Transport"☆23Dec 4, 2025Updated 5 months ago
- ☆18May 28, 2024Updated last year
- NPUEval is an LLM evaluation dataset written specifically to target AIE kernel code generation on RyzenAI hardware.☆31Nov 8, 2025Updated 6 months ago
- Python 3 support for the MS COCO caption evaluation tools☆14Jun 14, 2024Updated last year