gmarus777 / Printed-Latex-Data-Generation
Python and JS tools to generate Printed LaTex formulas and images
☆16Updated last year
Alternatives and similar repositories for Printed-Latex-Data-Generation
Users that are interested in Printed-Latex-Data-Generation are comparing it to the libraries listed below
Sorting:
- Handwritten mathematical symbols recognition with TrOCR☆18Updated last year
- Code for AAAI 2023 Paper : “Alignment-Enriched Tuning for Patch-Level Pre-trained Document Image Models”☆17Updated 2 years ago
- WikiTableSet: A largest publicly available image-based table recognition dataset in three languages built from Wikipedia☆30Updated 2 years ago
- Repo for "TableParser: Automatic Table Parsing with Weak Supervision from Spreadsheets" at SDU@AAAI-22☆13Updated last year
- CTE: Contextualized Table Extraction Dataset☆17Updated 2 years ago
- Contrast-guided Feature Adjustment Module for Visual Information Extraction☆28Updated last year
- [COLM 2024] Early Weight Averaging meets High Learning Rates for LLM Pre-training☆16Updated 7 months ago
- An unofficial Implementation of DocParser: End-to-end OCR-free Information Extraction from Visually Rich Documents☆36Updated last year
- A multimodal large-scale model, which performs close to the closed-source Qwen-VL-PLUS on many datasets and significantly surpasses the p…☆14Updated last year
- Another LaTex formula OCR tool☆15Updated 2 years ago
- High-Performance Transformers for Table Structure Recognition Need Early Convolutions☆43Updated last year
- A GPT-based generative LM for combined text and math formulas, leveraging tree-based formula encoding.☆38Updated last year
- Image to Latex using Encoder-Decoder architecture☆13Updated last year
- ☆26Updated last year
- Read Ten Lines at One Glance: Line-Aware Semi-Autoregressive Transformer for Multi-Line Handwritten Mathematical Expression Recognition☆28Updated last year
- 阅读顺序、Layoutreader☆12Updated last week
- ☆37Updated 2 years ago
- Datasets and Evaluation Scripts for CompHRDoc☆38Updated 2 months ago
- Example codebase for fine-tuning layoutLMv3 on DocVQA☆52Updated 2 years ago
- "Towards Improving Document Understanding: An Exploration on Text-Grounding via MLLMs" 2023☆14Updated 5 months ago
- Implementation of the DocLLM paper for Llama models.☆13Updated last month
- Geometric Augmentation for Text Image☆9Updated 5 years ago
- Unofficially Implements https://arxiv.org/abs/2112.05682 to get Linear Memory Cost on Attention for PyTorch☆12Updated 3 years ago
- Tool to parse wiki tables from the HTML dump of Wikipedia☆11Updated 2 years ago
- ☆25Updated 7 years ago
- Dataset and scripts for HRDoc☆37Updated last year
- Exploration of the multi modal fuyu-8b model of Adept. 🤓 🔍☆28Updated last year
- Table detection (TD) and table structure recognition (TSR) using Yolov5/Yolov8, and you can get the same (even better) result compared wi…☆45Updated 10 months ago
- Tools for content datamining and NLP at scale☆43Updated 10 months ago
- Code for ICPR2022 paper: "Graph Neural Networks and Representation Embedding for table extraction in PDF Documents"☆35Updated last year