VikParuchuri / texifyLinks
Math OCR model that outputs LaTeX and markdown
☆1,062Updated 4 months ago
Alternatives and similar repositories for texify
Users that are interested in texify are comparing it to the libraries listed below
Sorting:
- Formula recognition based on LaTeX-OCR and ONNXRuntime.☆354Updated 7 months ago
- Extract structured text from pdfs quickly☆497Updated 2 weeks ago
- UniMERNet: A Universal Network for Real-World Mathematical Expression Recognition☆354Updated last week
- TexTeller can convert image to latex formulas (image2latex, latex OCR) with higher accuracy and exhibits superior generalization ability,…☆557Updated 2 months ago
- An Open-Source Python3 tool with SMALL models for recognizing layouts, tables, math formulas (LaTeX), and text in images, converting them…☆2,473Updated last month
- Lightweight, performant, deep table extraction☆478Updated this week
- Codebase for fine-tuning / evaluating nougat-based image2latex generation models☆153Updated 9 months ago
- DocLayout-YOLO: Enhancing Document Layout Analysis through Diverse Synthetic Data and Global-to-Local Adaptive Perception☆1,367Updated 2 months ago
- UniTable: Towards a Unified Table Foundation Model☆482Updated last year
- TF-ID: Table/Figure IDentifier for academic papers☆236Updated 11 months ago
- Detect and extract tables to markdown and csv☆749Updated 5 months ago
- Markdown rendering + Latex extras (equations, tables, ...), with conversion features, for the scientific community☆612Updated 2 weeks ago
- img2table is a table identification and extraction Python Library for PDF and images, based on OpenCV image processing☆757Updated 4 months ago
- library supporting NLP and CV research on scientific papers☆773Updated 7 months ago
- [CVPR 2025] A Comprehensive Benchmark for Document Parsing and Evaluation☆526Updated last month
- A collection of original, innovative ideas and algorithms towards Advanced Literate Machinery. This project is maintained by the OCR Team…☆1,739Updated 2 months ago
- A Faster LayoutReader Model based on LayoutLMv3, Sort OCR bboxes to reading order.☆252Updated 2 weeks ago
- A Docker-powered service for PDF document layout analysis. This service provides a powerful and flexible PDF analysis service. The servic…☆619Updated 3 weeks ago
- Synthesizing Graphics Programs for Scientific Figures and Sketches with TikZ☆1,436Updated this week
- Python PDF parser for scientific publications: content and figures☆417Updated last year
- YOLO models trained by DocLayNet - power your Document Intelligent by Layout Analysis☆115Updated 3 months ago
- A High-efficiency Open-source Toolkit for Table-to-Latex Task☆246Updated 6 months ago
- This repository includes the official implementation of OpenScholar: Synthesizing Scientific Literature with Retrieval-augmented LMs.☆695Updated 2 months ago
- Large scale training of Latex formula recognition model, currently being organized and open source☆53Updated last year
- Easily deployable 🚀 API to convert PDF to markdown quickly with high accuracy.☆870Updated 8 months ago
- Improved file parsing for LLM’s☆3,002Updated 7 months ago
- 🔍 Better text detection by combining multiple OCR engines (EasyOCR, Tesseract, and Pororo) with 🧠 LLM.☆563Updated 2 weeks ago
- mPLUG-DocOwl: Modularized Multimodal Large Language Model for Document Understanding☆2,202Updated 3 weeks ago
- Vision-Augmented Retrieval and Generation (VARAG) - Vision first RAG Engine☆467Updated 5 months ago
- Python bindings to PDFium☆586Updated last week