breezedeus / Pix2Text
An Open-Source Python3 tool with SMALL models for recognizing layouts, tables, math formulas (LaTeX), and text in images, converting them into Markdown format. A free alternative to Mathpix, empowering seamless conversion of visual content into text-based representations. 80+ languages are supported.
☆2,381Updated last week
Alternatives and similar repositories for Pix2Text:
Users that are interested in Pix2Text are comparing it to the libraries listed below
- TexTeller can convert image to latex formulas (image2latex, latex OCR) with higher accuracy and exhibits superior generalization ability,…☆530Updated 2 weeks ago
- MixTeX multimodal LaTeX, ZhEn, and, Table OCR. It performs efficient CPU-based inference in a local offline on Windows.☆1,236Updated 2 weeks ago
- Convert images of LaTex math equations into LaTex code.☆2,115Updated 2 years ago
- pix2tex: Using a ViT to convert images of equations into LaTeX code.☆14,248Updated 3 months ago
- 数学公式识别增强版:中英文手写印刷公式、支持初级符号推导(数据结构基于 LaTeX 抽象语法树)Math Formula OCR Pro, supports handwrite, Chinese-mixed formulas and simple symbol reaso…☆1,211Updated 10 months ago
- Math OCR model that outputs LaTeX and markdown☆1,047Updated 3 months ago
- Formula recognition based on LaTeX-OCR and ONNXRuntime.☆349Updated 6 months ago
- translate scientific papers in latex, especially arxiv papers☆1,246Updated last month
- Codebase for fine-tuning / evaluating nougat-based image2latex generation models☆152Updated 7 months ago
- Pix2Text MacOS App: A Mac Desktop App for Mathematical Formula Recognition and Text Recognition. Mac 本地数学公式识别和文本识别工具☆54Updated 10 months ago
- UniMERNet: A Universal Network for Real-World Mathematical Expression Recognition☆314Updated last month
- 数学公式识别 Math Formula OCR☆528Updated 2 years ago
- LateX公式编辑器-妈叔出品☆1,105Updated last year
- DocLayout-YOLO: Enhancing Document Layout Analysis through Diverse Synthetic Data and Global-to-Local Adaptive Perception☆1,157Updated 3 weeks ago
- Implementation of Nougat Neural Optical Understanding for Academic Documents☆9,430Updated 2 months ago
- CnSTD: 基于 PyTorch/MXNet 的 中文/英文 场景文字检测(Scene Text Detection)、数学公式检测(Mathematical Formula Detection, MFD)、篇章分析(Layout Analysis)的Python3 包☆738Updated 2 months ago
- Using GPT to parse PDF☆3,396Updated 3 weeks ago
- 整理目前开源的最优表格识别模型,完善前后处理,模型转换为ONNX Organize the currently open-source optimal table recognition models, improve pre-processing and post…☆665Updated last month
- Markdown rendering + Latex extras (equations, tables, ...), with conversion features, for the scientific community☆605Updated last week
- Large scale training of Latex formula recognition model, currently being organized and open source☆52Updated last year
- Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model☆7,526Updated 2 months ago
- A collection of original, innovative ideas and algorithms towards Advanced Literate Machinery. This project is maintained by the OCR Team…☆1,703Updated 3 weeks ago
- 基于Pytorch实现的End-to-End图像Latex公式识别 inspire by LinXueyuanStdio/LaTeX_OCR_PRO☆175Updated 5 years ago
- ☆489Updated 9 months ago
- 🤙 Easy replacement for LaTeX Beamer! 🥂 custom Marp templates with a selection of over a dozen themes☆713Updated 11 months ago
- 破解CAJViewer带有效期的文档,支持破解科学文库、标准全文数据库下载的文档。无损破解,保留文字和目录,解除有效期限制。☆253Updated last year
- A GUI implement of MixTex with rust☆30Updated 2 months ago
- LatexFormatting 是一个用于格式化 LaTeX 和 Markdown 文件的实用工具。LatexFormatting is a utility used to format LaTeX and Markdown files.☆96Updated 3 months ago
- An open-source academic paper management tool.☆1,731Updated 2 weeks ago
- Attachment Manager for Zotero☆929Updated last week