MarkusSagen / Master-Thesis-Multilingual-Longformer
Master thesis with code investigating methods for incorporating long-context reasoning in low-resource languages, without the need to pre-train from scratch. We investigated if multilingual models could inherit these properties by making it an Efficient Transformer (s.a. the Longformer architecture).
☆33Updated 3 years ago
Alternatives and similar repositories for Master-Thesis-Multilingual-Longformer:
Users that are interested in Master-Thesis-Multilingual-Longformer are comparing it to the libraries listed below
- ☆67Updated 3 years ago
- Collection of NLP model explanations and accompanying analysis tools☆145Updated last year
- SUPERT: Unsupervised multi-document summarization evaluation & generation☆94Updated 2 years ago
- ☆92Updated 3 years ago
- This repository contains materials for the SIGIR 2022 tutorial on opinion summarization.☆34Updated 2 years ago
- Pytorch implementation of Highly Parallel Autoregressive Entity Linking with Discriminative Correction☆67Updated 2 years ago
- Coreference resolution with different higher-order inference methods; implemented in PyTorch.☆36Updated last year
- Mr. TyDi is a multi-lingual benchmark dataset built on TyDi, covering eleven typologically diverse languages.☆74Updated 2 years ago
- SacreROUGE is a library dedicated to the use and development of text generation evaluation metrics with an emphasis on summarization.☆140Updated 2 years ago
- Self-supervised NER prototype - updated version (69 entity types - 17 broad entity groups). Uses pretrained BERT models with no fine tuni…☆79Updated 2 years ago
- Code for NAACL 2021 full paper "Efficient Attentions for Long Document Summarization"☆66Updated 3 years ago
- ☆74Updated 3 years ago
- PyTorch implementation of the end-to-end coreference resolution model with different higher-order inference methods.☆59Updated last year
- Reimplementation of a BERT based model (Shi et al, 2019), currently the state-of-the-art for English SRL. This model implements also pred…☆70Updated 2 years ago
- EMNLP 2021 Tutorial: Multi-Domain Multilingual Question Answering☆38Updated 3 years ago
- Source code for paper "Learning from Noisy Labels for Entity-Centric Information Extraction", EMNLP 2021☆55Updated 3 years ago
- ☆77Updated 9 months ago
- Multilingual abstractive summarization dataset extracted from WikiHow.☆85Updated 3 years ago
- Multi^2OIE: Multilingual Open Information Extraction Based on Multi-Head Attention with BERT (Findings of ACL: EMNLP 2020)☆56Updated 2 years ago
- Data/Code Repository for https://api.semanticscholar.org/CorpusID:218470122☆130Updated 6 months ago
- https://arxiv.org/pdf/1909.04054☆78Updated 2 years ago
- The official code for PRIMERA: Pyramid-based Masked Sentence Pre-training for Multi-document Summarization☆156Updated 2 years ago
- A Natural Language Inference (NLI) model based on Transformers (BERT and ALBERT)☆133Updated last year
- ☆36Updated last year
- Code from the paper "What do Models Learn from Question Answering Datasets?" (EMNLP 2020)☆55Updated 4 years ago
- ☆41Updated 3 years ago
- ☆85Updated 3 years ago
- ☆57Updated 2 years ago
- pyTorch implementation of Recurrence over BERT (RoBERT) based on this paper https://arxiv.org/abs/1910.10781 and comparison with pyTorch …☆80Updated 2 years ago
- cRocoDiLe is a dataset extraction tool for Relation Extraction using Wikipedia and Wikidata presented in REBEL (EMNLP 2021).☆66Updated last year