An Open-Source Python3 tool with SMALL models for recognizing layouts, tables, math formulas (LaTeX), and text in images, converting them into Markdown format. A free alternative to Mathpix, empowering seamless conversion of visual content into text-based representations. 80+ languages are supported.
☆3,020Feb 7, 2026Updated 3 weeks ago
Alternatives and similar repositories for Pix2Text
Users that are interested in Pix2Text are comparing it to the libraries listed below
Sorting:
- pix2tex: Using a ViT to convert images of equations into LaTeX code.☆16,196Jan 18, 2025Updated last year
- Formula recognition based on LaTeX-OCR and ONNXRuntime.☆380Nov 3, 2024Updated last year
- TexTeller can convert image to latex formulas (image2latex, latex OCR) with higher accuracy and exhibits superior generalization ability,…☆710Aug 22, 2025Updated 6 months ago
- 数学公式识别增强版:中英文手写印刷公式、支持初级符号推导(数据结构基于 LaTeX 抽象语法树)Math Formula OCR Pro, supports handwrite, Chinese-mixed formulas and simple symbol reaso…☆1,281Jun 11, 2024Updated last year
- CnSTD: 基于 PyTorch/MXNet 的 中文/英文 场景文字检测(Scene Text Detection)、数学公式检测(Mathematical Formula Detection, MFD)、篇章分析(Layout Analysis)的Python3 包☆781Feb 7, 2026Updated 3 weeks ago
- MixTeX multimodal LaTeX, ZhEn, and, Table OCR. It performs efficient CPU-based inference in a local offline on Windows.☆1,605Apr 24, 2025Updated 10 months ago
- Math OCR model that outputs LaTeX and markdown☆1,111Jan 29, 2025Updated last year
- Implementation of Nougat Neural Optical Understanding for Academic Documents☆9,848Feb 21, 2025Updated last year
- UniMERNet: A Universal Network for Real-World Mathematical Expression Recognition☆458Sep 28, 2025Updated 5 months ago
- A Comprehensive Toolkit for High-Quality PDF Content Extraction☆9,402Jan 3, 2025Updated last year
- Transforms complex documents like PDFs into LLM-ready markdown/JSON for your Agentic workflows.☆55,275Updated this week
- Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model☆8,089Feb 10, 2025Updated last year
- Convert PDF to markdown + JSON quickly with high accuracy☆32,069Updated this week
- A collection of original, innovative ideas and algorithms towards Advanced Literate Machinery. This project is maintained by the OCR Team…☆1,820Apr 9, 2025Updated 10 months ago
- [ECCV 2024] Official code implementation of Vary: Scaling Up the Vision Vocabulary of Large Vision Language Models.☆1,897Dec 30, 2024Updated last year
- OCR, layout analysis, reading order, table recognition in 90+ languages☆19,360Updated this week
- Using GPT to parse PDF☆3,562Apr 17, 2025Updated 10 months ago
- DocLayout-YOLO: Enhancing Document Layout Analysis through Diverse Synthetic Data and Global-to-Local Adaptive Perception☆2,017Apr 14, 2025Updated 10 months ago
- A High-efficiency Open-source Toolkit for Table-to-Latex Task☆275Dec 6, 2025Updated 2 months ago
- Chinese Mathematical Formula Detection (MFD) Dataset 中文文档数学公式检测数据集☆34Dec 21, 2022Updated 3 years ago
- 数学公式识别 Math Formula OCR☆548Mar 24, 2023Updated 2 years ago
- Convert images of LaTex math equations into LaTex code.☆2,150Oct 4, 2022Updated 3 years ago
- CnOCR: Awesome Chinese/English OCR Python toolkits based on PyTorch. It comes with 20+ well-trained models for different application scen…☆3,730Feb 7, 2026Updated 3 weeks ago
- Markdown rendering + Latex extras (equations, tables, ...), with conversion features, for the scientific community☆656Updated this week
- [EMNLP 2025 Demo] PDF scientific paper translation with preserved formats - 基于 AI 完整保留排版的 PDF 文档全文双语翻译,支持 Google/DeepL/Ollama/OpenAI 等服务,…☆31,890Nov 25, 2025Updated 3 months ago
- 为GPT/GLM等LLM大语言模型提供实用化交互接口,特别优化论文阅读/润色/写作体验,模块化设计,支持自定义快捷按钮&函数插件,支持Python和C++等项目剖析&自译解功能,PDF/LaTex论文翻译&总结功能,支持并行问询多种LLM模型,支持chatglm3等本地模型…☆70,122Jan 25, 2026Updated last month
- Codebase for fine-tuning / evaluating nougat-based image2latex generation models☆159Sep 25, 2024Updated last year
- mPLUG-DocOwl: Modularized Multimodal Large Language Model for Document Understanding☆2,370May 30, 2025Updated 9 months ago
- translate scientific papers in latex, especially arxiv papers☆1,344Sep 26, 2025Updated 5 months ago
- OCR software, free and offline. 开源、免费的离线OCR软件。支持截屏/批量导入图片,PDF文档识别,排除水印/页眉页脚,扫描/生成二维码。内置多国语言库。☆42,306Nov 20, 2025Updated 3 months ago
- 📄 Awesome OCR multiple programing languages toolkits based on ONNXRuntime, OpenVINO, MNN, PaddlePaddle and PyTorch.☆6,021Updated this week
- 360LayoutAnaylsis, a series Document Analysis Models and Datasets deleveped by 360 AI Research Institute☆306Sep 10, 2024Updated last year
- Turn any PDF or image document into structured data for your AI. A powerful, lightweight OCR toolkit that bridges the gap between images/…☆71,369Updated this week
- Use ChatGPT to summarize the arXiv papers. 全流程加速科研,利用chatgpt进行论文全文总结+专业翻译+润色+审稿+审稿回复☆19,257Nov 19, 2025Updated 3 months ago
- A Faster LayoutReader Model based on LayoutLMv3, Sort OCR bboxes to reading order.☆312Aug 15, 2025Updated 6 months ago
- 1st Solution For ICDAR 2021 Competition on Mathematical Formula Detection(公式检测冠军方案)☆133Sep 4, 2023Updated 2 years ago
- CDLA: A Chinese document layout analysis (CDLA) dataset☆288Sep 13, 2021Updated 4 years ago
- PyMuPDF is a high performance Python library for data extraction, analysis, conversion & manipulation of PDF (and other) documents.☆9,146Updated this week
- UniTable: Towards a Unified Table Foundation Model☆525Jun 4, 2024Updated last year