Kvasirs / MILES
MILES is a multilingual text simplifier inspired by LSBert - A BERT-based lexical simplification approach proposed in 2018. Unlike LSBert, MILES uses the bert-base-multilingual-uncased model, as well as simple language-agnostic approaches to complex word identification (CWI) and candidate ranking.
☆48Updated 3 years ago
Alternatives and similar repositories for MILES:
Users that are interested in MILES are comparing it to the libraries listed below
- ☆74Updated 3 years ago
- Code to reproduce the experiments from the paper.☆101Updated last year
- QED: A Framework and Dataset for Explanations in Question Answering☆115Updated 3 years ago
- A Dataset for Tuning and Evaluation of Sentence Simplification Models with Multiple Rewriting Transformations☆54Updated 2 years ago
- As good as new. How to successfully recycle English GPT-2 to make models for other languages (ACL Findings 2021)☆48Updated 3 years ago
- A Word Sense Disambiguation system integrating implicit and explicit external knowledge.☆68Updated 3 years ago
- Code and Data for Evaluation WG☆41Updated 2 years ago
- Lexical Simplification with Pretrained Encoders☆70Updated 3 years ago
- This repository contains datasets and code for the paper "HINT3: Raising the bar for Intent Detection in the Wild" accepted at EMNLP-2020…☆33Updated 3 years ago
- On Generating Extended Summaries of Long Documents☆78Updated 4 years ago
- ☆20Updated 2 years ago
- A program to choose transfer languages for cross-lingual learning☆72Updated last year
- BERT models for many languages created from Wikipedia texts☆34Updated 4 years ago
- Use BERT to Fill in the Blanks☆82Updated 3 years ago
- Code for the paper: Saying No is An Art: Contextualized Fallback Responses for Unanswerable Dialogue Queries☆19Updated 3 years ago
- Codebase, data and models for the Keep it Simple paper at ACL2021☆38Updated last year
- numeric fused-head identification and resolution☆33Updated 5 years ago
- An original implementation of EMNLP 2020, "AmbigQA: Answering Ambiguous Open-domain Questions"☆117Updated 2 years ago
- Python-based implementation of the Translate-Align-Retrieve method to automatically translate the SQuAD Dataset to Spanish.☆59Updated 2 years ago
- A package for fine-tuning Transformers with TPUs, written in Tensorflow2.0+☆37Updated 3 years ago
- Repository for the ACL 2020 virtual conference website (work in progress)☆38Updated 2 years ago
- LongSumm - Scientific Document Summarization Task☆74Updated 2 years ago
- Pipeline component for spaCy (and other spaCy-wrapped parsers such as spacy-stanza and spacy-udpipe) that adds CoNLL-U properties to a Do…☆79Updated 6 months ago
- ☆64Updated last year
- Zero-shot Transfer Learning from English to Arabic☆29Updated 2 years ago
- Language Modelling Makes Sense - WSD (and more) with Contextual Embeddings☆95Updated last year
- Dataset of ML and NLP papers☆35Updated 2 years ago
- ☆66Updated 4 years ago
- Repository with code for MaChAmp: https://aclanthology.org/2021.eacl-demos.22/☆83Updated 3 weeks ago
- Statistics on multilingual datasets☆17Updated 2 years ago