VikParuchuri / texifyLinks
Math OCR model that outputs LaTeX and markdown
☆1,075Updated 7 months ago
Alternatives and similar repositories for texify
Users that are interested in texify are comparing it to the libraries listed below
Sorting:
- Formula recognition based on LaTeX-OCR and ONNXRuntime.☆364Updated 9 months ago
- TexTeller can convert image to latex formulas (image2latex, latex OCR) with higher accuracy and exhibits superior generalization ability,…☆585Updated last week
- Codebase for fine-tuning / evaluating nougat-based image2latex generation models☆157Updated 11 months ago
- Extract structured text from pdfs quickly☆585Updated 2 months ago
- UniMERNet: A Universal Network for Real-World Mathematical Expression Recognition☆396Updated 2 months ago
- Markdown rendering + Latex extras (equations, tables, ...), with conversion features, for the scientific community☆624Updated last week
- Lightweight, performant, deep table extraction☆503Updated 3 weeks ago
- Detect and extract tables to markdown and csv☆753Updated 7 months ago
- DocLayout-YOLO: Enhancing Document Layout Analysis through Diverse Synthetic Data and Global-to-Local Adaptive Perception☆1,564Updated 4 months ago
- UniTable: Towards a Unified Table Foundation Model☆502Updated last year
- Enhance Tesseract OCR output for scanned PDFs by applying Large Language Model (LLM) corrections.☆2,742Updated 6 months ago
- TF-ID: Table/Figure IDentifier for academic papers☆238Updated last year
- A Docker-powered service for PDF document layout analysis. This service provides a powerful and flexible PDF analysis service. The servic…☆670Updated this week
- library supporting NLP and CV research on scientific papers☆780Updated 9 months ago
- Large scale training of Latex formula recognition model, currently being organized and open source☆53Updated last year
- A Faster LayoutReader Model based on LayoutLMv3, Sort OCR bboxes to reading order.☆271Updated 2 weeks ago
- A High-efficiency Open-source Toolkit for Table-to-Latex Task☆258Updated 8 months ago
- Synthesizing Graphics Programs for Scientific Figures and Sketches with TikZ☆1,514Updated 2 weeks ago
- Effort to open-source NLLB checkpoints.☆457Updated last year
- img2table is a table identification and extraction Python Library for PDF and images, based on OpenCV image processing☆787Updated 2 weeks ago
- LaTeXML: a TeX and LaTeX to XML/HTML/ePub/MathML translator.☆1,124Updated last week
- Implementation of Nougat Neural Optical Understanding for Academic Documents☆9,604Updated 6 months ago
- A collection of original, innovative ideas and algorithms towards Advanced Literate Machinery. This project is maintained by the OCR Team…☆1,763Updated 4 months ago
- [CVPR 2025] A Comprehensive Benchmark for Document Parsing and Evaluation☆752Updated last week
- DocLayNet: A Large Human-Annotated Dataset for Document-Layout Analysis☆370Updated 2 years ago
- Python bindings to PDFium, reasonably cross-platform.☆622Updated this week
- HTML to Markdown converter and crawler.☆588Updated last year
- Python tool for generation of LaTex code from PDF files.☆91Updated last year
- LLMs as Copilots for Theorem Proving in Lean☆1,154Updated 2 weeks ago
- YOLOv10 trained on DocLayNet dataset.☆76Updated 9 months ago