svjack / docvqa-gen
Question Answering dataset generator of Document Visual in English and Chinese
☆23Updated last year
Related projects ⓘ
Alternatives and complementary repositories for docvqa-gen
- Example codebase for fine-tuning layoutLMv3 on DocVQA☆49Updated 2 years ago
- Dataset and scripts for HRDoc☆33Updated last year
- ☆92Updated 4 years ago
- ☆21Updated 8 months ago
- An unofficial Implementation of DocParser: End-to-end OCR-free Information Extraction from Visually Rich Documents☆33Updated last year
- My implementation of Kosmos2.5 from the paper: "KOSMOS-2.5: A Multimodal Literate Model"☆68Updated last week
- ☆23Updated 3 years ago
- an unofficial code for augment-XY-CUT in XYLayoutLM☆25Updated 2 years ago
- We identify the desiderata for a comprehensive benchmark and propose Visually Rich Document Understanding (VRDU). VRDU contains two datas…☆74Updated last year
- ReadingBank: A Benchmark Dataset for Reading Order Detection☆91Updated 2 months ago
- Document Visual Question Answering☆110Updated 4 years ago
- An unofficial Pytorch implementation of ERNIE-Layout which is originally released through PaddleNLP.☆99Updated last year
- CTE: Contextualized Table Extraction Dataset☆17Updated last year
- Contrast-guided Feature Adjustment Module for Visual Information Extraction☆28Updated last year
- Object Detection Model for Scanned Documents☆83Updated last year
- ☆78Updated 2 years ago
- ☆30Updated 7 months ago
- DocBankLoader is a dataset loader for DocBank, and can convert DocBank to the Object Detection models' format.☆23Updated 3 years ago
- A curated list of papers about key information extraction.☆79Updated 3 months ago
- An official implementation of paper "Paragraph2Graph: A Language-independent GNN-based framework for layout analysis"☆74Updated last year
- SlideVQA: A Dataset for Document Visual Question Answering on Multiple Images (AAAI2023)☆78Updated last year
- ☆50Updated 5 months ago
- ☆54Updated 10 months ago
- Datasets and Evaluation Scripts for CompHRDoc☆25Updated 7 months ago
- XFUND: A Multilingual Form Understanding Benchmark☆186Updated 2 years ago
- A Faster LayoutReader Model based on LayoutLMv3, Sort OCR bboxes to reading order.☆104Updated 5 months ago
- Algorithms, papers, datasets, performance comparisons for Document AI. Continuously updating.☆165Updated this week
- ☆127Updated 9 months ago
- Publicly released code for the LAMBERT model☆102Updated 3 years ago
- Table Structure Recognition☆62Updated last year