Tencent/POINTS-Reader

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Tencent/POINTS-Reader)

Tencent / POINTS-Reader

☆197

Alternatives and similar repositories for POINTS-Reader

Users that are interested in POINTS-Reader are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Tencent / POINTS-GUI
View on GitHub
☆47Feb 9, 2026Updated 5 months ago
WePOINTS / WePOINTS
View on GitHub
☆190Mar 13, 2026Updated 4 months ago
guoxy25 / Ocean-OCR
View on GitHub
☆48Feb 7, 2025Updated last year
alibaba / Logics-Parsing
View on GitHub
☆1,396May 13, 2026Updated 2 months ago
felix-schmitt / MathNet
View on GitHub
MathNet: A Data-Centric Approach, Dataset and Benchmark Model to Advance Mathematical Expression Recognition
☆10Mar 19, 2025Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
bytedance / WildDoc
View on GitHub
The official repo for “WildDoc: How Far Are We from Achieving Comprehensive and Robust Document Understanding in the Wild?“
☆74May 19, 2025Updated last year
opendatalab / DocLayout-YOLO
View on GitHub
DocLayout-YOLO: Enhancing Document Layout Analysis through Diverse Synthetic Data and Global-to-Local Adaptive Perception
☆2,239Apr 14, 2025Updated last year
zzyhlyoko / DCTC
View on GitHub
☆42Sep 2, 2023Updated 2 years ago
VisualSphinx / VisualSphinx
View on GitHub
☆17Jun 3, 2025Updated last year
Yuliang-Liu / MultimodalOCR
View on GitHub
On the Hidden Mystery of OCR in Large Multimodal Models (OCRBench)
☆873Jul 22, 2026Updated last week
infly-ai / INF-MLLM
View on GitHub
INF Tech's open-source MLLMs for SOTA visual-language understanding and advanced document intelligence.
☆239Jul 22, 2026Updated last week
studio-dots-ai / dots.ocr
View on GitHub
Multilingual Document Layout Parsing in a Single Vision-Language Model
☆9,042Mar 24, 2026Updated 4 months ago
InternScience / StructEqTable-Deploy
View on GitHub
A High-efficiency Open-source Toolkit for Table-to-Latex Task
☆276Dec 6, 2025Updated 7 months ago
opendatalab / UniMERNet
View on GitHub
UniMERNet: A Universal Network for Real-World Mathematical Expression Recognition
☆494Sep 28, 2025Updated 10 months ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
Veason-silverbullet / ViTLP
View on GitHub
[NAACL 2024] Visually Guided Generative Text-Layout Pre-training for Document Intelligence
☆149Sep 10, 2024Updated last year
Tencent-Hunyuan / HunyuanOCR
View on GitHub
HunyuanOCR-1.5: Making Lightweight OCR VLMs Faster and Better
☆1,895Updated this week
HCIILAB / M6Doc
View on GitHub
☆166May 8, 2025Updated last year
Yuliang-Liu / MonkeyOCR
View on GitHub
A lightweight LMM-based Document Parsing Model
☆6,616Jul 20, 2026Updated last week
bytedance / Dolphin
View on GitHub
The official repo for “Dolphin: Document Image Parsing via Heterogeneous Anchor Prompting”, ACL, 2025.
☆9,041Mar 25, 2026Updated 4 months ago
OpenGVLab / Docopilot
View on GitHub
[CVPR 2025] Docopilot: Improving Multimodal Models for Document-Level Understanding
☆37Jul 22, 2025Updated last year
TencentCloudADP / youtu-parsing
View on GitHub
Youtu-Parsing: Perception, Structuring and Recognition via High-Parallelism Decoding
☆69Jun 15, 2026Updated last month
Rapisurazurite / FFDN
View on GitHub
Implementation for Enhancing Tampered Text Detection Through Frequency Feature Fusion and Decomposition
☆30Feb 26, 2025Updated last year
opendatalab / mineru-vl-utils
View on GitHub
A Python package for interacting with the MinerU Vision-Language Model.
☆134Jun 11, 2026Updated last month
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
HumanMLLM / CoGenAV
View on GitHub
☆64Jul 1, 2025Updated last year
zhuzilin / vllm-group
View on GitHub
☆12Nov 5, 2024Updated last year
LayTextLLM / LayTextLLM
View on GitHub
☆103Dec 23, 2024Updated last year
irisXcoding / DocReal
View on GitHub
DocReal: Robust Document Dewarping of Real-Life Images via Attention-Enhanced Control Point Prediction
☆30Jun 28, 2023Updated 3 years ago
FreeOCR-AI / layoutreader
View on GitHub
A Faster LayoutReader Model based on LayoutLMv3, Sort OCR bboxes to reading order.
☆323Aug 15, 2025Updated 11 months ago
Tencent-Hunyuan / Hunyuan-MT
View on GitHub
☆713Dec 30, 2025Updated 6 months ago
s-sahoo / Eso-LMs
View on GitHub
[ICML 2026] Esoteric Language Models
☆122Jul 13, 2026Updated 2 weeks ago
Ucas-HaoranWei / GOT-OCR2.0
View on GitHub
Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model
☆8,212Feb 10, 2025Updated last year
Hxyz-123 / ReasoningOCR
View on GitHub
☆18Jul 24, 2025Updated last year
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
shannanyinxiang / UPOCR
View on GitHub
Official implementation of UPOCR: Towards unified pixel-level OCR interface (ICML 2024)
☆70Jun 6, 2024Updated 2 years ago
xenova / model-explorer
View on GitHub
Browse, search, and visualize ONNX models.
☆35May 6, 2025Updated last year
Token-family / TokenFD
View on GitHub
[ICCV2025] A Token-level Text Image Foundation Model for Document Understanding
☆135Aug 27, 2025Updated 11 months ago
xhli-git / DocSAM
View on GitHub
☆33Apr 8, 2025Updated last year
AlibabaResearch / AdvancedLiterateMachinery
View on GitHub
A collection of original, innovative ideas and algorithms towards Advanced Literate Machinery. This project is maintained by the OCR Team…
☆1,832Mar 17, 2026Updated 4 months ago
zhangxiao339 / chineseocr-onnx
View on GitHub
chineres ocr from picture, 中英文本检测与文本识别，dense-ctc，dbnet，crnn，pse，unet等模型
☆11Sep 23, 2020Updated 5 years ago
krystalan / DRT
View on GitHub
Deep Reasoning Translation (DRT) Project
☆242Sep 1, 2025Updated 10 months ago