due-benchmark / du-schema
JSON Schema format for storing datasets details, documents processed contents, and documents annotations in the document understanding domain.
☆14Updated this week
Related projects ⓘ
Alternatives and complementary repositories for du-schema
- The code related to the baselines from NeurIPS 2021 paper "DUE: End-to-End Document Understanding Benchmark."☆35Updated last year
- VisualMRC: Machine Reading Comprehension on Document Images (AAAI2021)☆51Updated 3 years ago
- ☆92Updated 3 years ago
- A Dataset for Tuning and Evaluation of Sentence Simplification Models with Multiple Rewriting Transformations☆54Updated 2 years ago
- [EMNLP 2021] LM-Critic: Language Models for Unsupervised Grammatical Error Correction☆119Updated 3 years ago
- The official repository for Efficient Long-Text Understanding Using Short-Text Models (Ivgi et al., 2022) paper☆68Updated last year
- MediaSum: A Large-scale Media Interview Dataset for Dialogue Summarization☆63Updated 3 years ago
- Extracting six domain-specific QA datasets from MS MARCO☆17Updated 4 years ago
- ☆73Updated 3 years ago
- ☆36Updated 2 years ago
- ☆70Updated 3 years ago
- ☆16Updated last year
- Contrastive Fact Verification☆70Updated 2 years ago
- Publicly released code for the LAMBERT model☆102Updated 3 years ago
- ☆55Updated last year
- Authors' implementation of the paper Adaptive Information Seeking for Open-Domain Question Answering, published in EMNLP 2021.☆37Updated last year
- EMNLP 2021 Tutorial: Multi-Domain Multilingual Question Answering☆38Updated 3 years ago
- ☆55Updated 3 years ago
- ☆37Updated 3 years ago
- UDapter is a multilingual dependency parser that uses "contextual" adapters together with language-typology features for language-specifi…☆30Updated last year
- ☆60Updated last year
- Source code for paper "Learning from Noisy Labels for Entity-Centric Information Extraction", EMNLP 2021☆55Updated 2 years ago
- This repository contains the code for the paper 'PARM: Paragraph Aggregation Retrieval Model for Dense Document-to-Document Retrieval' pu…☆40Updated 2 years ago
- ☆30Updated 2 years ago
- The Multitask Long Document Benchmark☆39Updated 2 years ago
- Tools for evaluating the performance of MT metrics on data from recent WMT metrics shared tasks.☆90Updated last month
- Simple Questions Generate Named Entity Recognition Datasets (EMNLP 2022)☆75Updated last year
- Stronger Baselines for Grammatical Error Correction Using a Pretrained Encoder-Decoder Model.☆37Updated last year
- Code for our TSD paper "TOKEN is a MASK: Few-shot Named Entity Recognition with Pre-trained Language Models"☆14Updated 2 years ago
- ☆90Updated 7 months ago