Question Answering dataset generator of Document Visual in English and Chinese
☆24Apr 17, 2023Updated 2 years ago
Alternatives and similar repositories for docvqa-gen
Users that are interested in docvqa-gen are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Dataset and scripts for HRDoc☆41Jun 21, 2023Updated 2 years ago
- MXNet implementation of SEC☆21Aug 15, 2018Updated 7 years ago
- Contrast-guided Feature Adjustment Module for Visual Information Extraction☆30May 23, 2023Updated 2 years ago
- Code for ICCV 2023 Paper : “ICL-D3IE: In-Context Learning with Diverse Demonstrations Updating for Document Information Extraction”☆54Aug 8, 2023Updated 2 years ago
- SlideVQA: A Dataset for Document Visual Question Answering on Multiple Images (AAAI2023)☆105Mar 31, 2025Updated 11 months ago
- BH hackathon☆14Apr 4, 2024Updated last year
- simpledet和mmdetection源码阅读笔记☆27May 21, 2019Updated 6 years ago
- RUArt: A Novel Text-Centered Solution for Text-Based Visual Question Answering☆10Nov 27, 2022Updated 3 years ago
- This is a repository for paper titled, PlaSma: Making Small Language Models Better Procedural Knowledge Models for (Counterfactual) Plann…☆14Nov 3, 2023Updated 2 years ago
- ☆18Jul 7, 2025Updated 8 months ago
- finetune the resnet model☆12Sep 4, 2018Updated 7 years ago
- ☆13Feb 24, 2021Updated 5 years ago
- Document Visual Question Answering☆130Jul 30, 2020Updated 5 years ago
- ☆13Jun 16, 2021Updated 4 years ago
- GPGPU on Android☆13Feb 16, 2023Updated 3 years ago
- Optimized inference with Ascend and Hugging Face☆12Apr 23, 2024Updated last year
- Crafting Adversarial Examples for Neural Machine Translation☆10Apr 7, 2023Updated 2 years ago
- 日常总结的Android常用工具类&自定义View☆15Jun 6, 2021Updated 4 years ago
- API to load and query documents using RAG☆14Sep 25, 2023Updated 2 years ago
- ☆10May 25, 2022Updated 3 years ago
- An LLM based shell assistant that knows your usual shell commands.☆17Jul 18, 2025Updated 8 months ago
- Tool to parse wiki tables from the HTML dump of Wikipedia☆11Jun 12, 2022Updated 3 years ago
- Repository of the ICNLSP 2024 paper "Efficient Few-shot Learning for Multi-label Classification of Scientific Documents with Many Classes…☆17Jan 7, 2025Updated last year
- Face Depth map Generation using PRNet☆10Dec 21, 2020Updated 5 years ago
- Detect the format of one or more identically-formatted date strings☆13Apr 26, 2018Updated 7 years ago
- ☆24Nov 21, 2023Updated 2 years ago
- This robot processes randomly generated PDF invoices with Amazon Textract and saves the extracted invoice data in an Excel file.☆14Jan 27, 2023Updated 3 years ago
- This is the repository for the Master of Science thesis titled "GAN-based Matrix Factorization for Recommender Systems".☆10Aug 10, 2020Updated 5 years ago
- Pure MicroPython Bosch BME680 sensor driver☆18Feb 27, 2024Updated 2 years ago
- ☆58Oct 23, 2024Updated last year
- NExT-QA: Next Phase of Question-Answering to Explaining Temporal Actions (CVPR'21)☆30Jul 18, 2023Updated 2 years ago
- Data Release for VALUE Benchmark☆30Feb 16, 2022Updated 4 years ago
- Fine-tune FLUX 1.dev for personal AI photos☆22Sep 4, 2024Updated last year
- 🎵 高品质无损音乐播放、下载 🎵☆21Mar 2, 2020Updated 6 years ago
- ICLR Reproducibility Challenge: Generative Adversarial Models For Learning Private And Fair Representations☆12Jan 12, 2019Updated 7 years ago
- Can VLMs understand students' hand-drawn math work?☆17Jan 20, 2026Updated 2 months ago
- DigitallyReconstructedRadiograph implementation in Python.☆10Jan 7, 2021Updated 5 years ago
- Inference, training and evaluation code for our models from the paper "Inv3D: a high-resolution 3D invoice dataset for template-guided si…☆59Feb 7, 2024Updated 2 years ago
- My implementation of Kosmos2.5 from the paper: "KOSMOS-2.5: A Multimodal Literate Model"☆74Feb 6, 2026Updated last month