felix-schmitt / FormulaNet
FormulaNet is a new large-scale Mathematical Formula Detection dataset.
☆14Updated last year
Related projects: ⓘ
- Dataset of PNG images from synthetically generated table layouts with annotations in JSONL files☆121Updated 10 months ago
- ☆103Updated 7 months ago
- This repo is used to release the ArxivFormula dataset.☆20Updated 6 months ago
- ☆74Updated 2 years ago
- A large scale camera-taken table detection and recognition dataset.☆107Updated 11 months ago
- ☆21Updated 11 months ago
- MTL-TabNet: Multi-task Learning based Model for Image-based Table Recognition☆82Updated 3 months ago
- https://dl.acm.org/doi/10.1145/3657281☆85Updated 4 months ago
- ☆81Updated last year
- The official implementation of SPTS v2: Single-Point Text Spotting☆123Updated last year
- Official implementation for ICDAR 2021 best poster paper "Handwritten Mathematical Expression Recognition with Bidirectionally Trained Tr…☆121Updated 7 months ago
- Evaluation Tool for the ICDAR 2019 Competition on Table Detection and Recognition☆39Updated 2 years ago
- Algorithms, papers, datasets, performance comparisons for Document AI. Continuously updating.☆144Updated 3 weeks ago
- DocILE: Document Information Localization and Extraction Benchmark☆116Updated 4 months ago
- Code for ICPR2022 paper: "Graph Neural Networks and Representation Embedding for table extraction in PDF Documents"☆34Updated last year
- Table Structure Recognition☆52Updated last year
- Datasets and Evaluation Scripts for CompHRDoc☆19Updated 5 months ago
- The official code for “Geometric Representation Learning for Document Image Rectification”, ECCV, 2022.☆71Updated 2 months ago
- Synthesize distorted document image and control points.☆39Updated 2 years ago
- This is an official implementation for the WTW Dataset in "Parsing Table Structures in the Wild " on table detection and table structure …☆161Updated 3 years ago
- A Bottom-Up Instance Segmentation Strategy for segmenting document instances using Transformers☆52Updated last week
- Read Ten Lines at One Glance: Line-Aware Semi-Autoregressive Transformer for Multi-Line Handwritten Mathematical Expression Recognition☆25Updated last year
- RoDLA: Benchmarking the Robustness of Document Layout Analysis Models☆24Updated 5 months ago
- Contrast-guided Feature Adjustment Module for Visual Information Extraction☆28Updated last year
- ☆40Updated 2 years ago
- Evaluation of the Optical Character Recognition (OCR) capabilities of GPT-4V(ision)☆116Updated 10 months ago
- Inference, training and evaluation code for our models from the paper "Inv3D: a high-resolution 3D invoice dataset for template-guided si…☆39Updated 7 months ago
- The code and the DIW dataset for "Learning From Documents in the Wild to Improve Document Unwarping" (SIGGRAPH 2022)☆115Updated last month
- ☆37Updated 2 months ago
- Example codebase for fine-tuning layoutLMv3 on DocVQA☆48Updated 2 years ago