danielvarab / massive-summ
☆27Updated last year
Related projects ⓘ
Alternatives and complementary repositories for massive-summ
- ☆25Updated 10 months ago
- EMNLP 2021 Tutorial: Multi-Domain Multilingual Question Answering☆38Updated 3 years ago
- Code for ACL 2022 paper "Expanding Pretrained Models to Thousands More Languages via Lexicon-based Adaptation"☆31Updated 2 years ago
- Research code for the paper "How Good is Your Tokenizer? On the Monolingual Performance of Multilingual Language Models"☆26Updated 3 years ago
- X-SCITLDR: Cross-Lingual Extreme Summarization of Scholarly Documents (JCDL 2022)☆14Updated 2 years ago
- This dataset contains human judgements about answer equivalence. The data is based on SQuAD (Stanford Question Answering Dataset), and co…☆22Updated 2 years ago
- This is the official repository for NAACL 2021, "XOR QA: Cross-lingual Open-Retrieval Question Answering".☆79Updated 3 years ago
- [EMNLP 2020] Collective HumAn OpinionS on Natural Language Inference Data☆33Updated 2 years ago
- ☆58Updated 2 years ago
- ☆92Updated 3 years ago
- Faithfulness and factuality annotations of XSum summaries from our paper "On Faithfulness and Factuality in Abstractive Summarization" (h…☆81Updated 3 years ago
- EMNLP 2021 - Frustratingly Simple Pretraining Alternatives to Masked Language Modeling☆31Updated 3 years ago
- Code for ACL 21: Generating Query Focused Summaries from Query-Free Resources☆33Updated 2 years ago
- ☆40Updated 3 years ago
- UDapter is a multilingual dependency parser that uses "contextual" adapters together with language-typology features for language-specifi…☆30Updated last year
- ☆41Updated 3 years ago
- ReConsider is a re-ranking model that re-ranks the top-K (passage, answer-span) predictions of an Open-Domain QA Model like DPR (Karpukhi…☆49Updated 3 years ago
- PyTorch implementation of NAACL 2021 paper "Multi-view Subword Regularization"☆24Updated 3 years ago
- ☆67Updated 3 years ago
- ☆21Updated 6 months ago
- Python source code for EMNLP 2021 Findings paper: "Subword Mapping and Anchoring Across Languages".☆13Updated 3 years ago
- XCOPA: A Multilingual Dataset for Causal Commonsense Reasoning☆99Updated 3 years ago
- Pretraining scripts for BART transformer model☆11Updated last year
- ☆37Updated 3 years ago
- ☆70Updated 3 years ago
- Code for "Simulated Multiple Reference Training Improves Low-Resource Machine Translation"☆15Updated 3 years ago
- ☆73Updated 3 years ago
- ☆57Updated last year
- This is a repository for the paper on testing inductive bias with scaled-down RoBERTa models.☆19Updated 2 years ago
- ☆46Updated 4 years ago