svjack / docvqa-genLinks
Question Answering dataset generator of Document Visual in English and Chinese
☆24Updated 2 years ago
Alternatives and similar repositories for docvqa-gen
Users that are interested in docvqa-gen are comparing it to the libraries listed below
Sorting:
- Example codebase for fine-tuning layoutLMv3 on DocVQA☆52Updated 3 years ago
- My implementation of Kosmos2.5 from the paper: "KOSMOS-2.5: A Multimodal Literate Model"☆74Updated last month
- Dataset and scripts for HRDoc☆40Updated 2 years ago
- Object Detection Model for Scanned Documents☆94Updated 8 months ago
- ☆95Updated 5 years ago
- XFUND: A Multilingual Form Understanding Benchmark☆214Updated 3 years ago
- An unofficial Implementation of DocParser: End-to-end OCR-free Information Extraction from Visually Rich Documents☆37Updated 2 years ago
- An unofficial Pytorch implementation of ERNIE-Layout which is originally released through PaddleNLP.☆106Updated 2 years ago
- ☆22Updated last year
- DocLLM: A layout-aware generative language model for multimodal document understanding☆129Updated last year
- Contrast-guided Feature Adjustment Module for Visual Information Extraction☆30Updated 2 years ago
- Table Structure Recognition☆78Updated 2 years ago
- ☆99Updated 4 years ago
- Datasets and Evaluation Scripts for CompHRDoc☆53Updated 9 months ago
- High-Performance Transformers for Table Structure Recognition Need Early Convolutions☆43Updated last year
- ICDAR 2024 Table OCR Model☆38Updated 4 months ago
- Document Visual Question Answering☆127Updated 5 years ago
- ☆45Updated 3 years ago
- We identify the desiderata for a comprehensive benchmark and propose Visually Rich Document Understanding (VRDU). VRDU contains two datas…☆80Updated 2 years ago
- [Paper] Code for the EMNLP2023 (Findings) paper "Global Structure Knowledge-Guided Relation Extraction Method for Visually-Rich Document"☆17Updated last year
- Two approaches for robust TableQA: 1) ITR is a general-purpose retrieval-based approach for handling long tables in TableQA transformer m…☆41Updated 2 years ago
- ☆32Updated last year
- A curated list of papers about key information extraction.☆102Updated 11 months ago
- ☆51Updated last year
- ☆99Updated 11 months ago
- Leveraging large language models for text-to-SQL synthesis, this project fine-tunes WizardLM/WizardCoder-15B-V1.0 with QLoRA on a custom …☆45Updated last year
- ☆22Updated 4 years ago
- Doc2Graph transforms documents into graphs and exploit a GNN to solve several tasks.☆133Updated last month
- A Toolkit for Table-based Question Answering☆115Updated 2 years ago
- ☆40Updated 5 years ago