svjack / docvqa-gen
Question Answering dataset generator of Document Visual in English and Chinese
☆23Updated last year
Related projects ⓘ
Alternatives and complementary repositories for docvqa-gen
- Example codebase for fine-tuning layoutLMv3 on DocVQA☆49Updated 2 years ago
- Dataset and scripts for HRDoc☆31Updated last year
- Official repository for paper "TableBench: A Comprehensive and Complex Benchmark for Table Question Answering"☆29Updated 3 weeks ago
- an unofficial code for augment-XY-CUT in XYLayoutLM☆25Updated 2 years ago
- My implementation of Kosmos2.5 from the paper: "KOSMOS-2.5: A Multimodal Literate Model"☆68Updated this week
- An unofficial Implementation of DocParser: End-to-end OCR-free Information Extraction from Visually Rich Documents☆33Updated last year
- ☆92Updated 4 years ago
- ReadingBank: A Benchmark Dataset for Reading Order Detection☆91Updated 2 months ago
- An unofficial Pytorch implementation of ERNIE-Layout which is originally released through PaddleNLP.☆98Updated 11 months ago
- Code for ICPR2022 paper: "Graph Neural Networks and Representation Embedding for table extraction in PDF Documents"☆35Updated last year
- XFUND: A Multilingual Form Understanding Benchmark☆185Updated 2 years ago
- ☆21Updated 7 months ago
- An official implementation of paper "Paragraph2Graph: A Language-independent GNN-based framework for layout analysis"☆74Updated last year
- A Faster LayoutReader Model based on LayoutLMv3, Sort OCR bboxes to reading order.☆88Updated 5 months ago
- ☆50Updated 5 months ago
- We identify the desiderata for a comprehensive benchmark and propose Visually Rich Document Understanding (VRDU). VRDU contains two datas…☆74Updated last year
- CTE: Contextualized Table Extraction Dataset☆17Updated last year
- DocBankLoader is a dataset loader for DocBank, and can convert DocBank to the Object Detection models' format.☆23Updated 3 years ago
- LLM+RAG for QA☆19Updated 9 months ago
- Publicly released code for the LAMBERT model☆102Updated 3 years ago
- A curated list of papers about key information extraction.☆78Updated 2 months ago
- SlideVQA: A Dataset for Document Visual Question Answering on Multiple Images (AAAI2023)☆78Updated last year
- ☆77Updated 2 years ago
- MTL-TabNet: Multi-task Learning based Model for Image-based Table Recognition☆87Updated 5 months ago
- Contrast-guided Feature Adjustment Module for Visual Information Extraction☆28Updated last year
- Document Visual Question Answering☆110Updated 4 years ago
- Two approaches for robust TableQA: 1) ITR is a general-purpose retrieval-based approach for handling long tables in TableQA transformer m…☆33Updated last year
- DocLLM: A layout-aware generative language model for multimodal document understanding☆112Updated 10 months ago
- Key Information Extraction From Documents: Evaluation And Generator☆19Updated 3 years ago
- ☆16Updated last year