A diffusion-based framework for document OCR that replaces autoregressive decoding with block-level parallel diffusion decoding. Topics
☆159Mar 27, 2026Updated this week
Alternatives and similar repositories for MinerU-Diffusion
Users that are interested in MinerU-Diffusion are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- update☆28Jan 29, 2026Updated last month
- 🛡️AutoControl Arena: Synthesizing Executable Test Environments for Frontier AI Risk Evaluation☆39Mar 19, 2026Updated last week
- A hybrid deep learning framework for toxicity prediction☆16Jan 14, 2026Updated 2 months ago
- Daily Chinese tech digest from Karpathy’s 90 curated blogs, with AI ranking, link analysis, and a static web reader. | 基于 Karpathy 精选 90 …☆38Feb 19, 2026Updated last month
- MPB (Miner-PDF-Benchmark) is an end-to-end PDF document comprehension evaluation suite designed for large-scale model data scenarios.☆24Dec 11, 2024Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- ☆33Dec 1, 2025Updated 3 months ago
- ☆41Mar 6, 2026Updated 3 weeks ago
- Official Repository for paper "HERMES: KV Cache as Hierarchical Memory for Efficient Streaming Video Understanding"☆63Updated this week
- TRivia: Self-supervised Fine-tuning of Vision-Language Models for Table Recognition☆29Feb 5, 2026Updated last month
- ☆177Apr 23, 2025Updated 11 months ago
- EVOLVE-VLA: Test-Time Training from Environment Feedback for Vision-Language-Action Models☆80Dec 17, 2025Updated 3 months ago
- ☆127Updated this week
- Data Set Description Language Specification (新一代人工智能数据集描述语言DSDL)☆46May 29, 2024Updated last year
- 开源微信 Bot 管理平台 | Self-hosted WeChat Bot Management & Message Relay | WebSocket + Webhook + AI Auto-reply | Passkey Login | 7 Language SDKs☆158Updated this week
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- LuaJIT raw-bytecode decompiler☆10Nov 22, 2017Updated 8 years ago
- SDK of OpenDataLab - https://opendatalab.org.cn☆59Jul 31, 2025Updated 7 months ago
- AI-powered threat hunting and incident response MCP server for Elasticsearch/OpenSearch☆65Mar 18, 2026Updated last week
- Official implementation of "UniMedVL: Unifying Medical Multimodal Understanding and Generation through Observation-Knowledge-Analysis" - …☆67Jan 15, 2026Updated 2 months ago
- The official pytorch implementation of Exploring the User Guidance for More Accurate Building Segmentation from High-Resolution Remote Se…☆18May 27, 2024Updated last year
- Unofficial implementation of DocMAE (WIP): Document Image Rectification via Self-supervised Representation Learning☆20Dec 20, 2023Updated 2 years ago
- ☆86May 23, 2025Updated 10 months ago
- ☆39Feb 27, 2025Updated last year
- A practical guide to building AI products end-to-end. You'll learn how to choose models based on product constraints, decide when fine-t…☆84Mar 11, 2026Updated 2 weeks ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- 阅读顺序、Layoutreader☆19May 8, 2025Updated 10 months ago
- 基于函数式编程和 dio 封装的类似 ahooks 的 useRequest 网络请求库☆81Mar 5, 2026Updated 3 weeks ago
- koolshare merlin 小宝改版固件插件中心Let's Encrypt插件☆17Feb 3, 2020Updated 6 years ago
- Example code for getting started with Boost.Test.☆39Apr 2, 2013Updated 12 years ago
- 开演AI 将影视创作全流程整合到一个强大的平台中。从剧本创作到视频制作,我们的AI工具帮助创作者更快地将故事变为现实。 核心优势: 🎯 一站式解决方案 - 剧本、分镜、图像、视频一站式完成 🤖 AI驱动 - 智能解析、生成和优化 ⚡ 极速创作 - 秒级生成分镜,告别…☆100Mar 21, 2026Updated last week
- AAAI 2024: Visual Instruction Generation and Correction☆96Feb 4, 2024Updated 2 years ago
- Youtu-Parsing: Perception, Structuring and Recognition via High-Parallelism Decoding☆60Feb 10, 2026Updated last month
- [ACL 2024 Main Conference] Chinese commonsense benchmark for LLMs☆44Jul 27, 2024Updated last year
- [ICCV 2025] SALAD -- Semantics-Aware Logical Anomaly Detection☆43Oct 3, 2025Updated 5 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- ☆76Mar 14, 2026Updated last week
- ☆21Updated this week
- VisualGPTScore for visio-linguistic reasoning☆27Oct 7, 2023Updated 2 years ago
- ☆107Feb 5, 2026Updated last month
- This is the repo for the paper Multi-Agent Collaborative Data Selection for Efficient LLM Pretraining.☆47Aug 22, 2025Updated 7 months ago
- ☆79Jan 3, 2026Updated 2 months ago
- ☆101Feb 12, 2026Updated last month