baselines for DocVQA dataset
☆21Apr 11, 2021Updated 5 years ago
Alternatives and similar repositories for DocVQA
Users that are interested in DocVQA are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Document Visual Question Answering☆131Jul 30, 2020Updated 5 years ago
- ☆188May 8, 2024Updated 2 years ago
- JSON Schema format for storing datasets details, documents processed contents, and documents annotations in the document understanding do…☆14Nov 5, 2024Updated last year
- Implementation of the paper: "Turning Tables: Generating Examples from Semi-structured Tables for Endowing Language Models with Reasoning…☆22Nov 2, 2021Updated 4 years ago
- https://www.nlp.ecei.tohoku.ac.jp/projects/aio/☆16Aug 4, 2022Updated 3 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- ☆52May 28, 2024Updated 2 years ago
- Unofficial implementation of the paper "MetaHTR: Towards Writer-Adaptive Handwritten Text Recognition" by Bhunia et al. (2021).☆14Jun 22, 2022Updated 3 years ago
- ☆25Jun 25, 2021Updated 4 years ago
- [MM'2024] Official release of RFUND introduced in the MM'2024 paper "PEneo: Unifying Line Extraction, Line Grouping, and Entity Linking f…☆21Dec 4, 2024Updated last year
- [CVPR 2025] DocLayLLM: An Efficient Multi-modal Extension of Large Language Models for Text-rich Document Understanding☆30Dec 18, 2025Updated 5 months ago
- Investigation of focal and dice loss for the Kaggle 2018 data science bowl.☆18Mar 6, 2018Updated 8 years ago
- Publicly released code for the LAMBERT model☆106Jun 14, 2021Updated 5 years ago
- An unofficial PyTorch implementation of "Lin et al. ViBERTgrid: A Jointly Trained Multi-Modal 2D Document Representation for Key Informat…☆53Jan 9, 2024Updated 2 years ago
- Detectron2 for Document Layout Analysis☆189Aug 2, 2024Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Based on the WACV 2020 paper - Fine Grained Classification and Retrieval by Combining Visual and Locally Pooled Textual Features☆25Nov 15, 2021Updated 4 years ago
- Used in M4C feature extraction script: https://github.com/facebookresearch/mmf/blob/project/m4c/projects/M4C/scripts/extract_ocr_frcn_fea…☆13Jan 30, 2020Updated 6 years ago
- Official PyTorch implementation of LiLT: A Simple yet Effective Language-Independent Layout Transformer for Structured Document Understan…☆366Oct 31, 2022Updated 3 years ago
- 飞桨模型加密库☆10Nov 13, 2021Updated 4 years ago
- ☆10Oct 1, 2020Updated 5 years ago
- ☆11Mar 24, 2023Updated 3 years ago
- Self-managed translation project interface☆15Updated this week
- Multi-span Style Extraction for Generative Reading Comprehension☆10Apr 2, 2021Updated 5 years ago
- PyTorch code for "FactPEGASUS: Factuality-Aware Pre-training and Fine-tuning for Abstractive Summarization" (NAACL 2022)☆40Sep 15, 2022Updated 3 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Code and Data for ManyModalQA: Modality Disambiguation and QA over Diverse Inputs☆18Mar 2, 2020Updated 6 years ago
- 国家统计局中国省市县乡村5级地址抓取,http://www.stats.gov.cn/tjsj/tjbz/tjyqhdmhcxhfdm/2018/index.html☆12Jan 8, 2020Updated 6 years ago
- ☆15May 26, 2021Updated 5 years ago
- ☆15Sep 16, 2021Updated 4 years ago
- DeVLBert: Learning Deconfounded Visio-Linguistic Representations☆27Nov 27, 2022Updated 3 years ago
- Predicting agriculture crop yields using climate modeling and deep learning.☆13Jun 2, 2017Updated 9 years ago
- A library for training crosscoders☆17May 28, 2025Updated last year
- ☆27Mar 6, 2026Updated 3 months ago
- Crop and resize texture in unity editor! Open it: Press F1☆16Sep 1, 2017Updated 8 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- EAST: An Efficient and Accurate Scene Text Detector☆15Jan 22, 2018Updated 8 years ago
- ☆10Aug 14, 2019Updated 6 years ago
- Sparse Fourier Backpropagation in Cryo-EM Reconstruction☆12Dec 3, 2023Updated 2 years ago
- A single-line modification to any (dualizer-based) optimizer that allows the optimizer to adapt to the scale of the gradients as they cha…☆19Jan 11, 2025Updated last year
- Simple is not Easy: A Simple Strong Baseline for TextVQA and TextCaps[AAAI2021]☆57Apr 5, 2022Updated 4 years ago
- Create cohorts from databases utilizing the OMOP CDM☆10May 19, 2025Updated last year
- Neural Reranking for Named Entity Recognition, accepted as regular paper at RANLP 2017☆23Jul 15, 2017Updated 8 years ago