felix-schmitt / FormulaNetLinks
FormulaNet is a new large-scale Mathematical Formula Detection dataset.
☆19Updated 3 years ago
Alternatives and similar repositories for FormulaNet
Users that are interested in FormulaNet are comparing it to the libraries listed below
Sorting:
- Dataset of PNG images from synthetically generated table layouts with annotations in JSONL files☆152Updated 2 months ago
- This repo is used to release the ArxivFormula dataset.☆33Updated last year
- Algorithms, papers, datasets, performance comparisons for Document AI. Continuously updating.☆202Updated 8 months ago
- https://dl.acm.org/doi/10.1145/3657281☆97Updated last year
- ☆156Updated 6 months ago
- Official implementation for ICDAR 2021 best poster paper "Handwritten Mathematical Expression Recognition with Bidirectionally Trained Tr…☆126Updated last year
- The HierText dataset contains ~12k images from the Open Images dataset v6 with large amount of text entities. We provide word, line and p…☆300Updated 11 months ago
- The official implementation of SPTS v2: Single-Point Text Spotting☆138Updated 2 years ago
- [ICDAR 2023] (Oral) An End-to-End Unified Domain Adaptive Transformer for Document Instance Segmentation☆75Updated last year
- Table Structure Recognition☆78Updated 2 years ago
- ReadingBank: A Benchmark Dataset for Reading Order Detection☆113Updated last year
- ☆62Updated last year
- An unofficial Pytorch implementation of ERNIE-Layout which is originally released through PaddleNLP.☆106Updated 2 years ago
- A curated list of papers about key information extraction.☆102Updated 11 months ago
- ☆19Updated 3 years ago
- This is an official implementation for the WTW Dataset in "Parsing Table Structures in the Wild " on table detection and table structure …☆182Updated 4 years ago
- ☆100Updated last year
- A PyTorch implementation of DTrOCR: Decoder-only Transformer for Optical Character Recognition☆191Updated last month
- Official implementation for Dessurt: Document end-to-end self-supervised understanding and recognition transformer☆62Updated 2 years ago
- A collection of OCR-related datasets☆197Updated 3 years ago
- Implementation of DocFormer: End-to-End Transformer for Document Understanding, a multi-modal transformer based architecture for the task…☆287Updated 2 years ago
- ☆67Updated last year
- MTL-TabNet: Multi-task Learning based Model for Image-based Table Recognition☆104Updated last year
- A curated list of resources dedicated to table recognition☆404Updated 11 months ago
- ☆466Updated 4 months ago
- Distorted Document Images dataset (DDI-100).☆142Updated 3 years ago
- Official PyTorch implementation of LiLT: A Simple yet Effective Language-Independent Layout Transformer for Structured Document Understan…☆356Updated 3 years ago
- ICIP 2022: Adaptive Radial Projection on Fourier Magnitude Spectrum for Document Image Skew Estimation☆149Updated 6 months ago
- [MM'2024] Official release of RFUND introduced in the MM'2024 paper "PEneo: Unifying Line Extraction, Line Grouping, and Entity Linking f…☆20Updated 11 months ago
- Evaluation of the Optical Character Recognition (OCR) capabilities of GPT-4V(ision)☆125Updated 2 years ago