Codebase for fine-tuning / evaluating nougat-based image2latex generation models
☆159Sep 25, 2024Updated last year
Alternatives and similar repositories for nougat-latex-ocr
Users that are interested in nougat-latex-ocr are comparing it to the libraries listed below
Sorting:
- A full codebase for replicating the results of Nougat from downloading arXiv dataset to the final evaluation. It also contains a few fixe…☆11Dec 11, 2023Updated 2 years ago
- A minimalist macOS app to convert a snap of Equation to LaTeX without paying☆15Jun 14, 2024Updated last year
- TexTeller can convert image to latex formulas (image2latex, latex OCR) with higher accuracy and exhibits superior generalization ability,…☆715Aug 22, 2025Updated 6 months ago
- Formula recognition based on LaTeX-OCR and ONNXRuntime.☆380Nov 3, 2024Updated last year
- An official implementation of paper "Paragraph2Graph: A Language-independent GNN-based framework for layout analysis"☆81Oct 14, 2023Updated 2 years ago
- A Faster LayoutReader Model based on LayoutLMv3, Sort OCR bboxes to reading order.☆314Aug 15, 2025Updated 6 months ago
- Another LaTex formula OCR tool☆15Feb 15, 2023Updated 3 years ago
- UniMERNet: A Universal Network for Real-World Mathematical Expression Recognition☆458Sep 28, 2025Updated 5 months ago
- Datasets and Evaluation Scripts for CompHRDoc☆56Feb 25, 2025Updated last year
- Semantic Graph Representation Learning for Handwritten Mathematical Expression Recognition (ICDAR 2023)☆15Aug 29, 2023Updated 2 years ago
- WikiTableSet: A largest publicly available image-based table recognition dataset in three languages built from Wikipedia☆32Jun 12, 2025Updated 8 months ago
- ☆142Feb 13, 2024Updated 2 years ago
- Large scale training of Latex formula recognition model, currently being organized and open source☆56Apr 17, 2024Updated last year
- Implementation of Nougat Neural Optical Understanding for Academic Documents☆9,852Feb 21, 2025Updated last year
- [MM'2024] PEneo, an effective algorithm for key-value pair extraction from form-like documents, designed for real-world applications.☆41Apr 7, 2025Updated 11 months ago
- Using OpenVINO to speed up inference of PaddleOCR-VL model☆25Mar 2, 2026Updated last week
- The official repository of "Document Image Machine Translation with Dynamic Multi-pre-trained Models Assembling"☆12Nov 26, 2025Updated 3 months ago
- LaTeX OCR 的数据仓库☆139Jun 11, 2024Updated last year
- Graph Key Information Extraction: GKIE☆11Sep 15, 2022Updated 3 years ago
- Wizard Bible archive☆12Apr 16, 2018Updated 7 years ago
- Read Ten Lines at One Glance: Line-Aware Semi-Autoregressive Transformer for Multi-Line Handwritten Mathematical Expression Recognition☆28Aug 29, 2023Updated 2 years ago
- An Open-Source Python3 tool with SMALL models for recognizing layouts, tables, math formulas (LaTeX), and text in images, converting them…☆3,026Feb 7, 2026Updated last month
- This Logseq plugin is designed to transform LaTex formula images from the clipboard into LaTex code using Transformers.☆14Jul 10, 2024Updated last year
- 轻量级文字识别技术创新大赛终榜第5名☆15Jul 15, 2021Updated 4 years ago
- Image to LaTeX pytorch model☆14Jul 6, 2023Updated 2 years ago
- DocTr++ in PaddlePaddle☆58Jul 24, 2024Updated last year
- STIRER: A Unified Model for Low-Resolution Scene Text Image Recovery and Recognition -- ACMMM 2023☆14Dec 2, 2024Updated last year
- ☆15Apr 26, 2024Updated last year
- Official implementation for ECCV 2022 paper "CoMER: Modeling Coverage for Transformer-based Handwritten Mathematical Expression Recogniti…☆131Sep 12, 2022Updated 3 years ago
- 数学公式识别增强版:中英文手写印刷公式、支持初级符号推导(数据结构基于 LaTeX 抽象语法树)Math Formula OCR Pro, supports handwrite, Chinese-mixed formulas and simple symbol reaso…☆1,284Jun 11, 2024Updated last year
- ☆102Dec 23, 2024Updated last year
- My solution for the ''LLM - Detect AI Generated Text'' kaggle competition☆16Feb 2, 2024Updated 2 years ago
- a LLM chatbot for Minecraft server.☆16Oct 14, 2025Updated 4 months ago
- ☆89Feb 9, 2025Updated last year
- ☆188Feb 27, 2024Updated 2 years ago
- A High-efficiency Open-source Toolkit for Table-to-Latex Task☆275Dec 6, 2025Updated 3 months ago
- Algorithms, papers, datasets, performance comparisons for Document AI.☆203Mar 1, 2025Updated last year
- High-Performance Transformers for Table Structure Recognition Need Early Convolutions☆44Apr 3, 2024Updated last year
- Math OCR model that outputs LaTeX and markdown☆1,111Jan 29, 2025Updated last year