forms-data-structures / forms-data
Repo to host the forms dataset
☆15Updated 4 years ago
Alternatives and similar repositories for forms-data:
Users that are interested in forms-data are comparing it to the libraries listed below
- Code for the ICDAR2021 paper "Visual FUDGE: Form Understanding via Dynamic Graph Editing"☆33Updated 3 years ago
- ☆57Updated 3 years ago
- Official PyTorch Implementation of DocSynth: A Layout Guided Approach for Controllable Document Image Synthesis - ICDAR 2021☆78Updated 3 years ago
- DocILE: Document Information Localization and Extraction Benchmark☆125Updated 11 months ago
- Form images from U.S. National Archives annotated with text bounding boxes, classes, relationships, and transcription.☆37Updated 2 years ago
- ☆22Updated 3 years ago
- ☆39Updated 3 years ago
- Detect textlines in document images☆92Updated 10 months ago
- Publicly released code for the LAMBERT model☆103Updated 3 years ago
- ☆81Updated last year
- DIAR software for synthetic document image and groundtruth generation, with various degradation models for data augmentation☆120Updated last year
- Official implementation for Dessurt: Document end-to-end self-supervised understanding and recognition transformer☆58Updated 2 years ago
- CVPR 2022: Table Structure Recognition☆39Updated 3 years ago
- OCR Annotations from Amazon Textract for Industry Documents Library☆103Updated 2 years ago
- OCR & Ground Truth Resources☆75Updated 2 years ago
- ☆159Updated 2 years ago
- CDeC-Net: Composite Deformable Cascade Network for Table Detection in Document Images☆133Updated 3 months ago
- Document Visual Question Answering☆116Updated 4 years ago
- A Bottom-Up Instance Segmentation Strategy for segmenting document instances using Transformers☆57Updated 7 months ago
- Key Information Extraction From Documents: Evaluation And Generator☆20Updated 4 years ago
- Distorted Document Images dataset (DDI-100).☆136Updated 2 years ago
- ☆79Updated 3 years ago
- Attention-based sequence-to-sequence model for handwritten word recognition☆57Updated 7 months ago
- ☆43Updated 2 years ago
- Pytorch implementation of our paper: Adapting OCR with Limited Labels☆60Updated last year
- Quicksign OCRized Text Dataset (QS-OCR)☆44Updated 5 years ago
- DocBankLoader is a dataset loader for DocBank, and can convert DocBank to the Object Detection models' format.☆23Updated 4 years ago
- ☆17Updated 9 months ago
- TableNet: Deep Learning model for end-to-end Table Detection and Tabular data extraction from Scanned Data Images In modern times, more a…☆57Updated 2 years ago
- Research papers and code on information extraction from image/pdf☆96Updated 2 years ago