Kvasirs / MILES
MILES is a multilingual text simplifier inspired by LSBert - A BERT-based lexical simplification approach proposed in 2018. Unlike LSBert, MILES uses the bert-base-multilingual-uncased model, as well as simple language-agnostic approaches to complex word identification (CWI) and candidate ranking.
☆48Updated 3 years ago
Alternatives and similar repositories for MILES:
Users that are interested in MILES are comparing it to the libraries listed below
- Code to reproduce the experiments from the paper.☆101Updated last year
- ☆74Updated 3 years ago
- A Word Sense Disambiguation system integrating implicit and explicit external knowledge.☆68Updated 3 years ago
- Lexical Simplification with Pretrained Encoders☆70Updated 4 years ago
- This repository contains datasets and code for the paper "HINT3: Raising the bar for Intent Detection in the Wild" accepted at EMNLP-2020…☆33Updated 3 years ago
- QED: A Framework and Dataset for Explanations in Question Answering☆115Updated 3 years ago
- On Generating Extended Summaries of Long Documents☆78Updated 4 years ago
- A program to choose transfer languages for cross-lingual learning☆72Updated last year
- Dataset of ML and NLP papers☆35Updated 2 years ago
- Tool to fix bitexts and tag near-duplicates for removal☆29Updated last week
- A embed able annotation tool for end to end cross document co-reference☆41Updated last year
- Pipeline component for spaCy (and other spaCy-wrapped parsers such as spacy-stanza and spacy-udpipe) that adds CoNLL-U properties to a Do…☆79Updated 7 months ago
- Code for the paper: Saying No is An Art: Contextualized Fallback Responses for Unanswerable Dialogue Queries☆19Updated 3 years ago
- Code and datasets of "Multilingual Extractive Reading Comprehension by Runtime Machine Translation"☆40Updated 6 years ago
- A lightweight but powerful library to build token indices for NLP tasks, compatible with major Deep Learning frameworks like PyTorch and …☆51Updated 2 months ago
- EMNLP 2021 Tutorial: Multi-Domain Multilingual Question Answering☆38Updated 3 years ago
- Easier Automatic Sentence Simplification Evaluation☆160Updated last year
- Many Natural Language Processing tasks rely on sentence boundary detection (SBD). Although amazing libraries like spacy provide state of …☆61Updated 4 years ago
- Python-based implementation of the Translate-Align-Retrieve method to automatically translate the SQuAD Dataset to Spanish.☆59Updated 2 years ago
- Zero-shot Transfer Learning from English to Arabic☆29Updated 2 years ago
- A benchmark for code-switched NLP, ACL 2020☆74Updated 8 months ago
- Examples for aligning, padding and batching sequence labeling data (NER) for use with pre-trained transformer models☆65Updated 2 years ago
- A Dataset for Tuning and Evaluation of Sentence Simplification Models with Multiple Rewriting Transformations☆54Updated 2 years ago
- Multilingual abstractive summarization dataset extracted from WikiHow.☆85Updated 3 years ago
- This repository contains the code for "BERTRAM: Improved Word Embeddings Have Big Impact on Contextualized Representations".☆63Updated 4 years ago
- As good as new. How to successfully recycle English GPT-2 to make models for other languages (ACL Findings 2021)☆48Updated 3 years ago
- ☆12Updated 3 years ago
- Codebase, data and models for the Keep it Simple paper at ACL2021☆38Updated last year
- Fine-tune transformers with pytorch-lightning☆44Updated 2 years ago
- Use BERT to Fill in the Blanks☆82Updated 3 years ago