nttmdlab-nlp/VisualMRC

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/nttmdlab-nlp/VisualMRC)

nttmdlab-nlp / VisualMRC

VisualMRC: Machine Reading Comprehension on Document Images (AAAI2021)

☆57

Alternatives and similar repositories for VisualMRC

Users that are interested in VisualMRC are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

ronghanghu / vqa-maskrcnn-benchmark-m4c
View on GitHub
Used in M4C feature extraction script: https://github.com/facebookresearch/mmf/blob/project/m4c/projects/M4C/scripts/extract_ocr_frcn_fea…
☆13Jan 30, 2020Updated 6 years ago
xinke-wang / Awesome-Text-VQA
View on GitHub
☆188May 8, 2024Updated 2 years ago
clovaai / webvicob
View on GitHub
Official Implementation of Web-based Visual Corpus Builder (Webvicob), ICDAR 2023
☆110Oct 24, 2023Updated 2 years ago
Form2Seq-Data / Dataset
View on GitHub
Dataset corresponding to the paper: "Form2Seq : A Framework for Higher-Order Form Structure Extraction"
☆10Feb 17, 2021Updated 5 years ago
hppRC / defsent
View on GitHub
DefSent: Sentence Embeddings using Definition Sentences
☆23Aug 5, 2021Updated 4 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
applicaai / kleister-nda
View on GitHub
☆61Aug 18, 2021Updated 4 years ago
applicaai / kleister-charity
View on GitHub
☆40Aug 18, 2021Updated 4 years ago
herobd / FUDGE
View on GitHub
Code for the ICDAR2021 paper "Visual FUDGE: Form Understanding via Dynamic Graph Editing"
☆33Mar 4, 2022Updated 4 years ago
ronghanghu / mmf
View on GitHub
A modular framework for Visual Question Answering research by the FAIR A-STAR team
☆45Aug 26, 2021Updated 4 years ago
microsoft / TAP
View on GitHub
TAP: Text-Aware Pre-training for Text-VQA and Text-Caption, CVPR 2021 (Oral)
☆72May 22, 2023Updated 3 years ago
researchmm / generate-it
View on GitHub
A collection of models for image<->text generation in ACM MM 2021.
☆67Oct 31, 2021Updated 4 years ago
yashkant / sam-textvqa
View on GitHub
Official code for paper "Spatially Aware Multimodal Transformers for TextVQA" published at ECCV, 2020.
☆65Sep 15, 2021Updated 4 years ago
AndresPMD / StacMR
View on GitHub
Scene Text Aware Cross Modal Retrieval (StacMR)
☆24Sep 3, 2021Updated 4 years ago
syp2ysy / prompt-SelF
View on GitHub
[TIP] Exploring Effective Factors for Improving Visual In-Context Learning
☆21Jul 2, 2025Updated last year
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
luciusssss / why-learn-shortcut
View on GitHub
[ACL'21 Findings] Why Machine Reading Comprehension Models Learn Shortcuts?
☆16Aug 8, 2023Updated 2 years ago
machine-intelligence-laboratory / DDI-100
View on GitHub
Distorted Document Images dataset (DDI-100).
☆147Nov 1, 2022Updated 3 years ago
naver-ai / tablevqabench
View on GitHub
☆46May 21, 2024Updated 2 years ago
yahoojapan / YJCaptions
View on GitHub
☆60Nov 29, 2016Updated 9 years ago
ananyahjha93 / libself
View on GitHub
PyTorch Lightning based framework to run experiments for self-supervised learning tasks.
☆10Feb 14, 2020Updated 6 years ago
DS3Lab / WordScape
View on GitHub
The WordScape repository contains code for the WordScape pipeline to create datasets to train document understanding models.
☆42Dec 7, 2023Updated 2 years ago
chemicaltree / tetra
View on GitHub
☆10Sep 14, 2022Updated 3 years ago
AkariAsai / unanswerable_qa
View on GitHub
The official implementation for ACL 2021 "Challenges in Information Seeking QA: Unanswerable Questions and Paragraph Retrieval".
☆28Jun 19, 2021Updated 5 years ago
ymcui / expmrc
View on GitHub
ExpMRC: Explainability Evaluation for Machine Reading Comprehension
☆62Aug 30, 2023Updated 2 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
nobu-g / JGLUE-evaluation-scripts
View on GitHub
Training and evaluation scripts for JGLUE, a Japanese language understanding benchmark
☆18Updated this week
nttmdlab-nlp / SlideVQA
View on GitHub
SlideVQA: A Dataset for Document Visual Question Answering on Multiple Images (AAAI2023)
☆106Mar 31, 2025Updated last year
jasonwu0731 / GettingToKnowYou
View on GitHub
☆21Nov 30, 2019Updated 6 years ago
bowong / Layered-Memory-Network
View on GitHub
A Layered Memory Network for MovieQA
☆16Apr 27, 2018Updated 8 years ago
researchmm / soho
View on GitHub
[CVPR'21 Oral] Seeing Out of tHe bOx: End-to-End Pre-training for Vision-Language Representation Learning
☆208Sep 30, 2022Updated 3 years ago
hyounghk / VideoQADenseCapFrameGate-ACL2020
View on GitHub
Code for ACL 2020 paper "Dense-Caption Matching and Frame-Selection Gating for Temporal Localization in VideoQA." Hyounghun Kim, Zineng T…
☆34May 14, 2020Updated 6 years ago
LukeForeverYoung / UReader
View on GitHub
☆142Feb 13, 2024Updated 2 years ago
osekilab / JCoLA
View on GitHub
☆19Apr 21, 2026Updated 3 months ago
due-benchmark / du-schema
View on GitHub
JSON Schema format for storing datasets details, documents processed contents, and documents annotations in the document understanding do…
☆14Nov 5, 2024Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
nullnull / normalizeNumexp
View on GitHub
normalizer of numerical / temporal expression
☆11Sep 2, 2018Updated 7 years ago
manzoku23 / M1-Pytorch-Tutorial
View on GitHub
Pytorch Tutorial for M1 students. This repository include Encoder Deocder model and Classification model building code.
☆12Jun 1, 2022Updated 4 years ago
coastalcph / zeroshot_lexglue
View on GitHub
Zero-shot evaluation on LEXGLUE tasks with GTP3.5
☆29Mar 11, 2023Updated 3 years ago
uvavision / SyViC
View on GitHub
[ICCV 2023] Going Beyond Nouns With Vision & Language Models Using Synthetic Data
☆13Sep 30, 2023Updated 2 years ago
izuna385 / Wikia-and-Wikipedia-EL-Dataset-Creator
View on GitHub
You can create datasets from Wikia/Wikipedia that can be used for entity recognition and Entity Linking. Dumps for ja-wiki and VTuber-wik…
☆18May 2, 2021Updated 5 years ago
Observeai-Research / Phoneme-BERT
View on GitHub
☆34Jun 15, 2021Updated 5 years ago
ibm-aur-nlp / domain-specific-QA
View on GitHub
Extracting six domain-specific QA datasets from MS MARCO
☆17Dec 1, 2019Updated 6 years ago