MarkusSagen / Master-Thesis-Multilingual-LongformerLinks
Master thesis with code investigating methods for incorporating long-context reasoning in low-resource languages, without the need to pre-train from scratch. We investigated if multilingual models could inherit these properties by making it an Efficient Transformer (s.a. the Longformer architecture).
☆33Updated 3 years ago
Alternatives and similar repositories for Master-Thesis-Multilingual-Longformer
Users that are interested in Master-Thesis-Multilingual-Longformer are comparing it to the libraries listed below
Sorting:
- Multilingual abstractive summarization dataset extracted from WikiHow.☆92Updated 3 months ago
- SUPERT: Unsupervised multi-document summarization evaluation & generation☆94Updated 2 years ago
- This repo supports various cross-lingual transfer learning & multilingual NLP models.☆92Updated last year
- Self-supervised NER prototype - updated version (69 entity types - 17 broad entity groups). Uses pretrained BERT models with no fine tuni…☆78Updated 2 years ago
- Collection of NLP model explanations and accompanying analysis tools☆144Updated 2 years ago
- Pytorch implementation of Highly Parallel Autoregressive Entity Linking with Discriminative Correction☆67Updated 3 years ago
- ☆68Updated last month
- LongSumm - Scientific Document Summarization Task☆74Updated 2 years ago
- EMNLP 2021 Tutorial: Multi-Domain Multilingual Question Answering☆38Updated 3 years ago
- https://arxiv.org/pdf/1909.04054☆79Updated 2 years ago
- ☆87Updated 3 years ago
- Lexical Simplification with Pretrained Encoders☆70Updated 4 years ago
- This repository contains materials for the SIGIR 2022 tutorial on opinion summarization.☆34Updated 2 years ago
- ☆47Updated 2 years ago
- ☆92Updated 3 years ago
- ☆36Updated 2 years ago
- The official code for PRIMERA: Pyramid-based Masked Sentence Pre-training for Multi-document Summarization☆156Updated 2 years ago
- ☆59Updated 2 years ago
- Dataset for NAACL 2021 paper: "DART: Open-Domain Structured Data Record to Text Generation"☆154Updated 2 years ago
- Source code for paper "Learning from Noisy Labels for Entity-Centric Information Extraction", EMNLP 2021☆55Updated 3 years ago
- A Natural Language Inference (NLI) model based on Transformers (BERT and ALBERT)☆132Updated last year
- A long version of BART model based on Longformer model☆23Updated 2 years ago
- SacreROUGE is a library dedicated to the use and development of text generation evaluation metrics with an emphasis on summarization.☆144Updated 2 years ago
- A repository for our AAAI-2020 Cross-lingual-NER paper. Code will be updated shortly.☆47Updated 2 years ago
- PyTorch implementation of the end-to-end coreference resolution model with different higher-order inference methods.☆60Updated 2 years ago
- CrossNER: Evaluating Cross-Domain Named Entity Recognition (AAAI-2021)☆130Updated 4 years ago
- MoverScore: Text Generation Evaluating with Contextualized Embeddings and Earth Mover Distance☆206Updated last year
- Named Entity Recognition as Dependency Parsing☆39Updated 4 years ago
- ☆76Updated 3 years ago
- Code for NAACL 2021 full paper "Efficient Attentions for Long Document Summarization"☆67Updated 3 years ago