breezedeus / CnMFD_Dataset
Chinese Mathematical Formula Detection (MFD) Dataset 中文文档数学公式检测数据集
☆33Updated 2 years ago
Alternatives and similar repositories for CnMFD_Dataset:
Users that are interested in CnMFD_Dataset are comparing it to the libraries listed below
- ☆79Updated 2 years ago
- 1st Solution For ICDAR 2021 Competition on Mathematical Formula Detection(公式检测冠军方案)☆130Updated last year
- Deep Splitting and Merging for Table Structure Decomposition☆69Updated last year
- Table Structure Recognition☆66Updated last year
- Re-implementation of MASTER by mmocr☆90Updated 3 years ago
- 中文版面检测(Chinese layout detection),yolov8 is used to detect the layout of Chinese document images。☆58Updated last year
- A large scale camera-taken table detection and recognition dataset.☆118Updated last year
- ☆115Updated last year
- ☆53Updated 7 months ago
- DocTr++ in PaddlePaddle☆43Updated 6 months ago
- ICDAR 2024 Table OCR Model☆28Updated 2 months ago
- CDLA: A Chinese document layout analysis (CDLA) dataset☆258Updated 3 years ago
- OCR Document image deformation correction.复现阿里OCR皱巴巴文档图像形变矫正☆93Updated 4 years ago
- ☆165Updated 11 months ago
- An implementation of the Splitting and Merging table recognition method.☆78Updated 5 years ago
- 中文论文、证券类、财报类PDF数据☆23Updated 8 months ago
- TGRNet: A Table Graph Reconstruction Network for Table Structure Recognition☆96Updated 3 years ago
- TDF-ICDAR 2019 Dataset for Typeset Math Formula Detection☆67Updated 5 years ago
- An official implementation of paper "Paragraph2Graph: A Language-independent GNN-based framework for layout analysis"☆76Updated last year
- 通过浏览器渲染生成表格图像☆211Updated 10 months ago
- ☆77Updated last month
- ☆103Updated 4 years ago
- Implementation of research paper "Deep Splitting and Merging for Table Structure Decomposition"☆61Updated 2 years ago
- [MM'2024] PEneo, an effective algorithm for key-value pair extraction from form-like documents, designed for real-world applications.☆28Updated 2 months ago
- An unofficial Pytorch implementation of ERNIE-Layout which is originally released through PaddleNLP.☆104Updated last year
- WikiTableSet: A largest publicly available image-based table recognition dataset in three languages built from Wikipedia☆27Updated last year
- chinese document classification of layoutlmv3 and layoutxlm☆42Updated 2 years ago
- Read Ten Lines at One Glance: Line-Aware Semi-Autoregressive Transformer for Multi-Line Handwritten Mathematical Expression Recognition☆27Updated last year
- ☆41Updated last year
- [TAI 2023] Appearance Enhancement for Camera-captured Document Images in the Wild☆31Updated last year