VikParuchuri / texifyLinks
Math OCR model that outputs LaTeX and markdown
☆1,103Updated 11 months ago
Alternatives and similar repositories for texify
Users that are interested in texify are comparing it to the libraries listed below
Sorting:
- Codebase for fine-tuning / evaluating nougat-based image2latex generation models☆159Updated last year
- Formula recognition based on LaTeX-OCR and ONNXRuntime.☆373Updated last year
- TexTeller can convert image to latex formulas (image2latex, latex OCR) with higher accuracy and exhibits superior generalization ability,…☆692Updated 4 months ago
- Extract structured text from pdfs quickly☆643Updated 6 months ago
- Markdown rendering + Latex extras (equations, tables, ...), with conversion features, for the scientific community☆644Updated this week
- Lightweight, performant, deep table extraction☆518Updated 4 months ago
- UniMERNet: A Universal Network for Real-World Mathematical Expression Recognition☆440Updated 3 months ago
- UniTable: Towards a Unified Table Foundation Model☆519Updated last year
- DocLayout-YOLO: Enhancing Document Layout Analysis through Diverse Synthetic Data and Global-to-Local Adaptive Perception☆1,897Updated 8 months ago
- Enhance Tesseract OCR output for scanned PDFs by applying Large Language Model (LLM) corrections.☆2,811Updated 10 months ago
- TF-ID: Table/Figure IDentifier for academic papers☆245Updated last year
- library supporting NLP and CV research on scientific papers☆785Updated last year
- A Faster LayoutReader Model based on LayoutLMv3, Sort OCR bboxes to reading order.☆292Updated 4 months ago
- A High-efficiency Open-source Toolkit for Table-to-Latex Task☆272Updated 3 weeks ago
- img2table is a table identification and extraction Python Library for PDF and images, based on OpenCV image processing☆840Updated last month
- Large scale training of Latex formula recognition model, currently being organized and open source☆56Updated last year
- Synthesizing Graphics Programs for Scientific Figures and Sketches with TikZ☆1,668Updated 4 months ago
- Pix2Text MacOS App: A Mac Desktop App for Mathematical Formula Recognition and Text Recognition. Mac 本地数学公式识别和文本识别工具☆74Updated last year
- PyMuPDF4LLM☆1,209Updated 3 weeks ago
- Parse PDFs into markdown using Vision LLMs☆455Updated 3 months ago
- Table Transformer (TATR) is a deep learning model for extracting tables from unstructured documents (PDFs and images). This is also the o…☆2,822Updated last year
- The code powering searchthearxiv.com, a simple semantic search engine for more than 300,000 ML papers on arXiv.☆164Updated 8 months ago
- DocLayNet: A Large Human-Annotated Dataset for Document-Layout Analysis☆401Updated 2 years ago
- 🔍 Better text detection by combining multiple OCR engines (EasyOCR, Tesseract, and Pororo) with 🧠 LLM.☆598Updated 6 months ago
- Improved file parsing for LLM’s☆3,146Updated last year
- mPLUG-DocOwl: Modularized Multimodal Large Language Model for Document Understanding☆2,266Updated 7 months ago
- Python bindings to PDFium, reasonably cross-platform.☆699Updated last week
- Effort to open-source NLLB checkpoints.☆471Updated last year
- Python PDF parser for scientific publications: content and figures☆446Updated last year
- LaTeXML: a TeX and LaTeX to XML/HTML/ePub/MathML translator.☆1,188Updated this week