MILES is a multilingual text simplifier inspired by LSBert - A BERT-based lexical simplification approach proposed in 2018. Unlike LSBert, MILES uses the bert-base-multilingual-uncased model, as well as simple language-agnostic approaches to complex word identification (CWI) and candidate ranking.
☆50May 3, 2021Updated 5 years ago
Alternatives and similar repositories for MILES
Users that are interested in MILES are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Code to reproduce the experiments from the paper.☆104Oct 10, 2023Updated 2 years ago
- Neural CRF Model for Sentence Alignment in Text Simplification☆67Jan 19, 2025Updated last year
- A monolingual parallel corpus for sentence simplification☆11Jul 4, 2016Updated 9 years ago
- Controllable Sentence Simplification with T5☆18May 24, 2023Updated 3 years ago
- A Dataset for Tuning and Evaluation of Sentence Simplification Models with Multiple Rewriting Transformations☆60Sep 16, 2022Updated 3 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Exploring Neural Text Simplification☆72Feb 14, 2018Updated 8 years ago
- 📄 Evidence Retrieval and Claim Verification for the FEVER shared task using Transformer Networks☆12Feb 21, 2020Updated 6 years ago
- Codebase, data and models for the Keep it Simple paper at ACL2021☆39Jun 28, 2023Updated 3 years ago
- Lexical Simplification with Pretrained Encoders☆70Feb 5, 2021Updated 5 years ago
- Unsupervised Neural Text Simplification☆31Apr 14, 2021Updated 5 years ago
- ☆16May 22, 2023Updated 3 years ago
- Generate multiple choice fill-in-the-blank questions from any article.☆13Dec 8, 2022Updated 3 years ago
- Klexikon: A German Dataset for Joint Summarization and Simplification☆16Oct 5, 2022Updated 3 years ago
- ☆22Mar 31, 2022Updated 4 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- An implementation of data augmentation methods for natural language processing tasks.☆13Jul 25, 2024Updated last year
- This is a repo for DCQA QUD parsing implemenation☆12Aug 5, 2025Updated 10 months ago
- Interface for reading the Paraphrase Database (PPDB)☆24Mar 14, 2018Updated 8 years ago
- Code and models used in "MUSS Multilingual Unsupervised Sentence Simplification by Mining Paraphrases".☆98Feb 2, 2023Updated 3 years ago
- Easier Automatic Sentence Simplification Evaluation☆167Sep 25, 2023Updated 2 years ago
- COMIC: This is the code repo of our TMM2019 work titled "COMIC: Towards a Compact Image Captioning Model with Attention".☆15Jun 22, 2021Updated 5 years ago
- DRESS simplification model (EMNLP 2017) described in http://aclweb.org/anthology/D/D17/D17-1062.pdf☆154Nov 9, 2021Updated 4 years ago
- Extension of the SentenceSimplification project☆61Mar 31, 2025Updated last year
- ☆10Jul 27, 2018Updated 7 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Alignment and annotation for comparable documents.☆22Oct 16, 2018Updated 7 years ago
- NLP command-line assistant powered by OpenAI☆21Jan 27, 2024Updated 2 years ago
- Comparing PyTorch, JIT and ONNX for inference with Transformers☆19Feb 22, 2021Updated 5 years ago
- This repository contains materials for our tutorial on automatic grammatical error correction: R. Grundkiewicz, C. Bryant, M. Felice: A C…☆38Dec 12, 2020Updated 5 years ago
- ☆25May 11, 2024Updated 2 years ago
- TopicScan: Visualization and validation interface for NMF Topic Modeling☆23Jul 23, 2020Updated 5 years ago
- Gazeta: Dataset for automatic summarization of Russian news / Газета: набор данных для автоматического реферирования на русском языке☆37Oct 6, 2021Updated 4 years ago
- Text Simplification System and Dataset☆124Jul 7, 2023Updated 2 years ago
- Automatic extraction of edited sentences from text edition histories.☆82Feb 14, 2022Updated 4 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- Code for gradient rollback, which explains predictions of neural matrix factorization models, as for example used for knowledge base comp…☆21Mar 16, 2021Updated 5 years ago
- Code for the paper: Saying No is An Art: Contextualized Fallback Responses for Unanswerable Dialogue Queries☆19Nov 29, 2021Updated 4 years ago
- ☆12Jun 8, 2021Updated 5 years ago
- PorSimplesSent - A Portuguese corpus of aligned sentences pairs to investigate sentence readability assessment☆13Jan 15, 2020Updated 6 years ago
- This is the code for neural-Jacana aligner, and the data for MultiMWA dataset.☆20Feb 12, 2023Updated 3 years ago
- Data and code accompanying the paper "As Little as Possible, as Much as Necessary: Detecting Over- and Undertranslations with Contrastive…☆22Apr 13, 2023Updated 3 years ago
- GMEG☆32Nov 21, 2024Updated last year