YongWookHa / im2latexLinks
Image to LaTeX pytorch model
☆14Updated last year
Alternatives and similar repositories for im2latex
Users that are interested in im2latex are comparing it to the libraries listed below
Sorting:
- A GPT-based generative LM for combined text and math formulas, leveraging tree-based formula encoding.☆39Updated last year
- code for paper "Accessing higher dimensions for unsupervised word translation"☆21Updated last year
- Python and JS tools to generate Printed LaTex formulas and images☆16Updated last year
- Another LaTex formula OCR tool☆15Updated 2 years ago
- This repo is used to release the ArxivFormula dataset.☆28Updated 6 months ago
- transformer based OCR framework used to train OCR or image to latex☆9Updated 2 years ago
- 阅读顺序、Layoutreader☆15Updated last month
- CTE: Contextualized Table Extraction Dataset☆17Updated 2 years ago
- Detect mathematical expressions in worksheets and draw bounding boxes.☆21Updated 4 years ago
- Code for ICPR2022 paper: "Graph Neural Networks and Representation Embedding for table extraction in PDF Documents"☆35Updated last year
- High-Performance Transformers for Table Structure Recognition Need Early Convolutions☆44Updated last year
- Repo for "TableParser: Automatic Table Parsing with Weak Supervision from Spreadsheets" at SDU@AAAI-22☆13Updated last year
- DocBankLoader is a dataset loader for DocBank, and can convert DocBank to the Object Detection models' format.☆24Updated 4 years ago
- Handwritten mathematical symbols recognition with TrOCR☆18Updated last year
- Chinese Mathematical Formula Detection (MFD) Dataset 中文文档数学公式检测数据集☆34Updated 2 years ago
- Pytorch implementation of math equation images to latex markup language.☆30Updated 4 years ago
- ReadingBank: A Benchmark Dataset for Reading Order Detection☆105Updated 9 months ago
- A dashboard for exploring timm learning rate schedulers☆19Updated 6 months ago
- Segmenting a given document using recursive xy-cut algorithm.☆12Updated 6 years ago
- ☆9Updated 5 years ago
- GTDB dataset for training & evaluation for mathematical OCR systems☆28Updated 4 years ago
- WikiTableSet: A largest publicly available image-based table recognition dataset in three languages built from Wikipedia☆30Updated 2 years ago
- An unofficial Implementation of DocParser: End-to-end OCR-free Information Extraction from Visually Rich Documents☆36Updated last year
- ☆13Updated last year
- This is a new metric that can be used to evaluate faithfulness of text generated by LLMs. The work behind this repository can be found he…☆31Updated last year
- Fully automated end-to-end framework to extract data from bar plots and other figures in scientific research papers using modules such as…☆112Updated 3 years ago
- Fast edit distance Python extension written in Cython/C++. Supports Levenshtein distance and Damerau Optimal String Alignment (OSA) dista…☆23Updated last week
- Microsoft Phi 2 Streamlit App, deployed on HuggingFace Spaces is based on the Microsoft Phi 2 small language model (SLM) for text generat…☆14Updated last year
- Official repository accompaying the ICDAR 2023 paper☆12Updated last year
- Image to Latex using Encoder-Decoder architecture☆15Updated 2 weeks ago