due-benchmark / du-schemaLinks
JSON Schema format for storing datasets details, documents processed contents, and documents annotations in the document understanding domain.
☆13Updated 11 months ago
Alternatives and similar repositories for du-schema
Users that are interested in du-schema are comparing it to the libraries listed below
Sorting:
- The code related to the baselines from NeurIPS 2021 paper "DUE: End-to-End Document Understanding Benchmark."☆36Updated 2 years ago
- VisualMRC: Machine Reading Comprehension on Document Images (AAAI2021)☆56Updated 7 months ago
- Contrastive Fact Verification☆73Updated 3 years ago
- [EMNLP 2021] LM-Critic: Language Models for Unsupervised Grammatical Error Correction☆120Updated 4 years ago
- A Dataset for Tuning and Evaluation of Sentence Simplification Models with Multiple Rewriting Transformations☆56Updated 3 years ago
- ☆92Updated 4 years ago
- Code for NAACL 2021 full paper "Efficient Attentions for Long Document Summarization"☆68Updated 4 years ago
- XCOPA: A Multilingual Dataset for Causal Commonsense Reasoning☆104Updated 4 years ago
- ☆36Updated 3 years ago
- MediaSum: A Large-scale Media Interview Dataset for Dialogue Summarization☆80Updated 4 years ago
- A Multi-subject High School Examinations Dataset for Cross-lingual and Multilingual Question Answering☆46Updated 3 years ago
- This repo supports various cross-lingual transfer learning & multilingual NLP models.☆92Updated 2 years ago
- ☆58Updated 4 years ago
- ☆58Updated 4 years ago
- This is the official repository for NAACL 2021, "XOR QA: Cross-lingual Open-Retrieval Question Answering".☆80Updated 4 years ago
- ☆41Updated 4 years ago
- [NAACL 2022] Robust (Controlled) Table-to-Text Generation with Structure-Aware Equivariance Learning.☆57Updated last year
- Code for ACL 2022 paper "Expanding Pretrained Models to Thousands More Languages via Lexicon-based Adaptation"☆30Updated 3 years ago
- PyTorch implementation and pre-trained models for ASP - Autoregressive Structured Prediction with Language Models, EMNLP 22. https://arxi…☆107Updated last year
- Simple Questions Generate Named Entity Recognition Datasets (EMNLP 2022)☆76Updated 2 years ago
- A framework for building semantic parsers (including neural module networks) with AllenNLP, built by the authors of AllenNLP☆107Updated 3 years ago
- Dataset for NAACL 2021 paper: "DART: Open-Domain Structured Data Record to Text Generation"☆155Updated 2 years ago
- We introduce MKQA, an open-domain question answering evaluation set comprising 10k question-answer pairs aligned across 26 typologically …☆187Updated 3 years ago
- Multi-hop dense retrieval for question answering☆217Updated 4 years ago
- The official repository for Efficient Long-Text Understanding Using Short-Text Models (Ivgi et al., 2022) paper☆70Updated 2 years ago
- ☆68Updated 5 months ago
- The official code for PRIMERA: Pyramid-based Masked Sentence Pre-training for Multi-document Summarization☆157Updated 2 years ago
- Code for WikiAsp: Multi-document aspect-based summarization.☆42Updated 4 years ago
- Text Extraction Formulation + Feedback Loop for state-of-the-art WSD (EMNLP 2021)☆53Updated 3 years ago
- Dataset for Coherent Topic Segmentation and Classification☆36Updated 5 years ago