A Python package for interacting with the MinerU Vision-Language Model.
☆131Jun 11, 2026Updated 2 weeks ago
Alternatives and similar repositories for mineru-vl-utils
Users that are interested in mineru-vl-utils are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- MPB (Miner-PDF-Benchmark) is an end-to-end PDF document comprehension evaluation suite designed for large-scale model data scenarios.☆24Dec 11, 2024Updated last year
- UniMERNet: A Universal Network for Real-World Mathematical Expression Recognition☆486Sep 28, 2025Updated 9 months ago
- WanJuan-CC是以CommonCrawl为基础,经过数据抽取,规则清洗,去重,安全过滤,质量清洗等步骤得到的高质量数据。☆14Apr 18, 2024Updated 2 years ago
- PaperPub is an academic arena where diverse AI Agents read papers daily, pick apart each other's arguments, and fiercely debate.☆43Jun 12, 2026Updated 2 weeks ago
- 阅读顺序、Layoutreader☆18May 8, 2025Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- conversion doc(pdf/html/doc/docx/ppt/pptx)to markdown☆49Jul 23, 2024Updated last year
- Large-Scale High-quality Chinese Web Text with Multi-dimensional and fine-grained information☆40Dec 2, 2024Updated last year
- A High-efficiency Open-source Toolkit for Table-to-Latex Task☆277Dec 6, 2025Updated 6 months ago
- 秘塔AI搜索 Python SDK https://metaso.cn☆16Apr 21, 2025Updated last year
- Data Efficacy for Language Model Training☆50May 29, 2026Updated last month
- ☆121Jan 15, 2026Updated 5 months ago
- ☆199Dec 7, 2025Updated 6 months ago
- 记录有用的Git repos☆12Jul 28, 2024Updated last year
- Diffusion Model Improvement Method☆35Sep 4, 2023Updated 2 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Implementation of our paper "Global Localization in Large-scale Point Clouds via Roll-pitch-yaw Invariant Place Recognition and Low-overl…☆10Nov 25, 2023Updated 2 years ago
- A mesh system for adapting multiple large language models.☆11Mar 20, 2024Updated 2 years ago
- An Annotated Question Answering Dataset for Assisting Chinese Python Programming Learners☆10Feb 23, 2024Updated 2 years ago
- Preview markdown files in yazi with mdcat☆13Apr 24, 2025Updated last year
- Implementation of research paper "Deep Splitting and Merging for Table Structure Decomposition"☆61Nov 9, 2022Updated 3 years ago
- Simple MCP Client for remote MCP Servers 🌐☆25Jun 15, 2025Updated last year
- A helper package to get information of scholarly articles from DBLP using its public API☆16May 13, 2025Updated last year
- An automated data pipeline scaling RL to pretraining levels☆77Jun 2, 2026Updated 3 weeks ago
- BERT&RoBERTa预训练代码,tensorflow和torch两种版本实现☆13Feb 8, 2023Updated 3 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- This repository contains the code for the Transformer-Representation Neural Topic Model (TNTM) based on the paper "Probabilistic Topic Mo…☆12Jul 6, 2024Updated last year
- ☆17Jan 31, 2025Updated last year
- A Faster LayoutReader Model based on LayoutLMv3, Sort OCR bboxes to reading order.☆321Aug 15, 2025Updated 10 months ago
- ☆16Apr 30, 2025Updated last year
- Light C++11 graph library☆13Sep 16, 2021Updated 4 years ago
- 用于生成文本纠错模型(如Gector)需要的大量数据。☆14Jan 5, 2023Updated 3 years ago
- 同花顺算法挑战平台:【9-10双月赛】跨领域迁移的文本语义匹配☆11Oct 28, 2021Updated 4 years ago
- a fast async pool based on channel☆26Apr 22, 2026Updated 2 months ago
- 中文关键词提取☆14Aug 7, 2023Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆25Nov 7, 2022Updated 3 years ago
- Youtu-Parsing: Perception, Structuring and Recognition via High-Parallelism Decoding☆69Jun 15, 2026Updated 2 weeks ago
- datasets resource☆146May 27, 2026Updated last month
- [NIPS 2023] AR-Diffusion: Auto-Regressive Diffusion Model for Text Generation☆12May 19, 2023Updated 3 years ago
- 🚀 快速部署你的专属开发环境 - 只需5分钟!☆15Nov 26, 2023Updated 2 years ago
- CycleCenternet based on MMDetection☆22Jun 28, 2023Updated 3 years ago
- Official Repository for ICML 2024 Paper "OT-CLIP: Understanding and Generalizing CLIP via Optimal Transport"☆23Dec 4, 2025Updated 6 months ago