MarkusSagen / Master-Thesis-Multilingual-Longformer
Master thesis with code investigating methods for incorporating long-context reasoning in low-resource languages, without the need to pre-train from scratch. We investigated if multilingual models could inherit these properties by making it an Efficient Transformer (s.a. the Longformer architecture).
☆33Updated 3 years ago
Alternatives and similar repositories for Master-Thesis-Multilingual-Longformer:
Users that are interested in Master-Thesis-Multilingual-Longformer are comparing it to the libraries listed below
- State of the art Semantic Sentence Embeddings☆99Updated 2 years ago
- Collection of NLP model explanations and accompanying analysis tools☆145Updated last year
- ☆86Updated 3 years ago
- A Natural Language Inference (NLI) model based on Transformers (BERT and ALBERT)☆132Updated last year
- ☆92Updated 3 years ago
- https://arxiv.org/pdf/1909.04054☆78Updated 2 years ago
- Code and resources for the paper "BERT-QE: Contextualized Query Expansion for Document Re-ranking".☆50Updated 3 years ago
- ☆68Updated 3 years ago
- Pytorch implementation of Highly Parallel Autoregressive Entity Linking with Discriminative Correction☆67Updated 2 years ago
- Source code for paper "Learning from Noisy Labels for Entity-Centric Information Extraction", EMNLP 2021☆55Updated 3 years ago
- EMNLP 2021 Tutorial: Multi-Domain Multilingual Question Answering☆38Updated 3 years ago
- ☆57Updated 2 years ago
- Multilingual abstractive summarization dataset extracted from WikiHow.☆88Updated last week
- Code for the CRAC 2021 paper "On Generalization in Coreference Resolution" (Best short paper award)☆35Updated last year
- ☆36Updated last year
- pyTorch implementation of Recurrence over BERT (RoBERT) based on this paper https://arxiv.org/abs/1910.10781 and comparison with pyTorch …☆80Updated 2 years ago
- SUPERT: Unsupervised multi-document summarization evaluation & generation☆94Updated 2 years ago
- This repository contains materials for the SIGIR 2022 tutorial on opinion summarization.☆34Updated 2 years ago
- Coreference resolution with different higher-order inference methods; implemented in PyTorch.☆36Updated last year
- Named Entity Recognition with Small Strongly Labeled and Large Weakly Labeled Data☆100Updated last year
- BERTserini☆25Updated 2 years ago
- Code for NAACL 2021 full paper "Efficient Attentions for Long Document Summarization"☆66Updated 3 years ago
- Dataset for NAACL 2021 paper: "DART: Open-Domain Structured Data Record to Text Generation"☆152Updated 2 years ago
- ☆75Updated 3 years ago
- A machine learning-based system that uses state-of-the-art natural language processing (NLP) question answering (QA) techniques combined …☆26Updated last year
- EMNLP 2020 - Summarizing Text on Any Aspects☆37Updated 4 years ago
- Research framework for low resource text classification that allows the user to experiment with classification models and active learning…☆101Updated 3 years ago
- ☆41Updated 3 years ago
- Mr. TyDi is a multi-lingual benchmark dataset built on TyDi, covering eleven typologically diverse languages.☆74Updated 3 years ago
- ☆47Updated 2 years ago