breezedeus/Pix2Text

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/breezedeus/Pix2Text)

breezedeus / Pix2Text

An Open-Source Python3 tool with SMALL models for recognizing layouts, tables, math formulas (LaTeX), and text in images, converting them into Markdown format. A free alternative to Mathpix, empowering seamless conversion of visual content into text-based representations. 80+ languages are supported.

☆3,196

Alternatives and similar repositories for Pix2Text

Users that are interested in Pix2Text are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

lukas-blecher / LaTeX-OCR
View on GitHub
pix2tex: Using a ViT to convert images of equations into LaTeX code.
☆16,500Jan 18, 2025Updated last year
LinXueyuanStdio / LaTeX_OCR_PRO
View on GitHub
数学公式识别增强版：中英文手写印刷公式、支持初级符号推导（数据结构基于 LaTeX 抽象语法树）Math Formula OCR Pro, supports handwrite, Chinese-mixed formulas and simple symbol reaso…
☆1,306Jun 11, 2024Updated 2 years ago
OleehyO / TexTeller
View on GitHub
TexTeller can convert image to latex formulas (image2latex, latex OCR) with higher accuracy and exhibits superior generalization ability,…
☆752Aug 22, 2025Updated 11 months ago
RapidAI / RapidLaTeXOCR
View on GitHub
Formula recognition based on LaTeX-OCR and ONNXRuntime.
☆388Nov 3, 2024Updated last year
breezedeus / CnSTD
View on GitHub
CnSTD: 基于 PyTorch/MXNet 的中文/英文场景文字检测（Scene Text Detection）、数学公式检测（Mathematical Formula Detection, MFD）、篇章分析（Layout Analysis）的Python3 包
☆792Jul 5, 2026Updated 2 weeks ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
RQLuo / MixTeX-Latex-OCR
View on GitHub
MixTeX multimodal LaTeX, ZhEn, and, Table OCR. It performs efficient CPU-based inference in a local offline on Windows.
☆1,634Apr 24, 2025Updated last year
facebookresearch / nougat
View on GitHub
Implementation of Nougat Neural Optical Understanding for Academic Documents
☆10,046Feb 21, 2025Updated last year
VikParuchuri / texify
View on GitHub
Math OCR model that outputs LaTeX and markdown
☆1,126Jan 29, 2025Updated last year
opendatalab / UniMERNet
View on GitHub
UniMERNet: A Universal Network for Real-World Mathematical Expression Recognition
☆492Sep 28, 2025Updated 9 months ago
opendatalab / PDF-Extract-Kit
View on GitHub
A Comprehensive Toolkit for High-Quality PDF Content Extraction
☆9,797Jan 3, 2025Updated last year
Ucas-HaoranWei / GOT-OCR2.0
View on GitHub
Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model
☆8,155Feb 10, 2025Updated last year
breezedeus / CnMFD_Dataset
View on GitHub
Chinese Mathematical Formula Detection (MFD) Dataset 中文文档数学公式检测数据集
☆35Dec 21, 2022Updated 3 years ago
opendatalab / MinerU
View on GitHub
Transforms complex documents like PDFs and Office docs into LLM-ready markdown/JSON for your Agentic workflows.
☆75,311Updated this week
Ucas-HaoranWei / Vary
View on GitHub
[ECCV 2024] Official code implementation of Vary: Scaling Up the Vision Vocabulary of Large Vision Language Models.
☆1,889Dec 30, 2024Updated last year
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
AlibabaResearch / AdvancedLiterateMachinery
View on GitHub
A collection of original, innovative ideas and algorithms towards Advanced Literate Machinery. This project is maintained by the OCR Team…
☆1,833Mar 17, 2026Updated 4 months ago
opendatalab / DocLayout-YOLO
View on GitHub
DocLayout-YOLO: Enhancing Document Layout Analysis through Diverse Synthetic Data and Global-to-Local Adaptive Perception
☆2,233Apr 14, 2025Updated last year
InternScience / StructEqTable-Deploy
View on GitHub
A High-efficiency Open-source Toolkit for Table-to-Latex Task
☆276Dec 6, 2025Updated 7 months ago
LinXueyuanStdio / LaTeX_OCR
View on GitHub
数学公式识别 Math Formula OCR
☆554Mar 24, 2023Updated 3 years ago
datalab-to / marker
View on GitHub
Convert PDF to markdown + JSON quickly with high accuracy
☆37,711Updated this week
CosmosShadow / gptpdf
View on GitHub
Using GPT to parse PDF
☆3,559Apr 17, 2025Updated last year
kingyiusuen / image-to-latex
View on GitHub
Convert images of LaTex math equations into LaTex code.
☆2,160Oct 4, 2022Updated 3 years ago
datalab-to / surya
View on GitHub
OCR, layout analysis, reading order, table recognition in 90+ languages
☆21,130Updated this week
Mathpix / mathpix-markdown-it
View on GitHub
Markdown rendering + Latex extras (equations, tables, ...), with conversion features, for the scientific community
☆675Jul 6, 2026Updated 2 weeks ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
Yuxiang1995 / ICDAR2021_MFD
View on GitHub
1st Solution For ICDAR 2021 Competition on Mathematical Formula Detection（公式检测冠军方案）
☆134Sep 4, 2023Updated 2 years ago
360AILAB-NLP / 360LayoutAnalysis
View on GitHub
360LayoutAnaylsis, a series Document Analysis Models and Datasets deleveped by 360 AI Research Institute
☆305Sep 10, 2024Updated last year
breezedeus / CnOCR
View on GitHub
CnOCR: Awesome Chinese/English OCR Python toolkits based on PyTorch. It comes with 20+ well-trained models for different application scen…
☆3,763Jul 5, 2026Updated 2 weeks ago
PDFMathTranslate / PDFMathTranslate
View on GitHub
[EMNLP 2025 Demo] PDF scientific paper translation with preserved formats - 基于 AI 完整保留排版的 PDF 文档全文双语翻译，支持 Google/DeepL/Ollama/OpenAI 等服务，…
☆35,698May 25, 2026Updated last month
NormXU / nougat-latex-ocr
View on GitHub
Codebase for fine-tuning / evaluating nougat-based image2latex generation models
☆160Sep 25, 2024Updated last year
binary-husky / gpt_academic
View on GitHub
为GPT/GLM等LLM大语言模型提供实用化交互接口，特别优化论文阅读/润色/写作体验，模块化设计，支持自定义快捷按钮&函数插件，支持Python和C++等项目剖析&自译解功能，PDF/LaTex论文翻译&总结功能，支持并行问询多种LLM模型，支持chatglm3等本地模型…
☆71,115Jan 25, 2026Updated 5 months ago
qs956 / Latex_OCR_Pytorch
View on GitHub
基于Pytorch实现的End-to-End图像Latex公式识别 inspire by LinXueyuanStdio/LaTeX_OCR_PRO
☆178Apr 6, 2020Updated 6 years ago
PaddlePaddle / PaddleOCR
View on GitHub
Turn any PDF or image document into structured data for your AI. A powerful, lightweight OCR toolkit that bridges the gap between images/…
☆85,960Jul 15, 2026Updated last week
LBH1024 / CAN
View on GitHub
When Counting Meets HMER: Counting-Aware Network for Handwritten Mathematical Expression Recognition (ECCV’2022 Poster).
☆387Aug 5, 2024Updated last year
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
X-PLUG / mPLUG-DocOwl
View on GitHub
mPLUG-DocOwl: Modularized Multimodal Large Language Model for Document Understanding
☆2,408May 30, 2025Updated last year
hiroi-sora / Umi-OCR
View on GitHub
OCR software, free and offline. 开源、免费的离线OCR软件。支持截屏/批量导入图片，PDF文档识别，排除水印/页眉页脚，扫描/生成二维码。内置多国语言库。
☆46,176Nov 20, 2025Updated 8 months ago
LinXueyuanStdio / Data-for-LaTeX_OCR
View on GitHub
LaTeX OCR 的数据仓库
☆142Jun 11, 2024Updated 2 years ago
kaixindelele / ChatPaper
View on GitHub
Use ChatGPT to summarize the arXiv papers. 全流程加速科研，利用chatgpt进行论文全文总结+专业翻译+润色+审稿+审稿回复
☆19,699Mar 2, 2026Updated 4 months ago
SUSYUSTC / MathTranslate
View on GitHub
translate scientific papers in latex, especially arxiv papers
☆1,362Sep 26, 2025Updated 9 months ago
felix-schmitt / FormulaNet
View on GitHub
FormulaNet is a new large-scale Mathematical Formula Detection dataset.
☆21Nov 21, 2022Updated 3 years ago
buptlihang / CDLA
View on GitHub
CDLA: A Chinese document layout analysis (CDLA) dataset
☆293Sep 13, 2021Updated 4 years ago