s-nlp / parallel_detoxification_dataset
Data from "Crowdsourcing of Parallel Corpora: the Case of Style Transfer for Detoxification" paper
☆14Updated 3 weeks ago
Alternatives and similar repositories for parallel_detoxification_dataset:
Users that are interested in parallel_detoxification_dataset are comparing it to the libraries listed below
- RuSimpleSentEval (RSSE) shared task repo☆21Updated 3 years ago
- Russian Artificial Text Detection☆17Updated 2 years ago
- Probing suite for evaluation of Russian embedding and language models☆33Updated 6 months ago
- ☆12Updated 2 years ago
- Pipeline for easy fine-tuning of BERT architecture for sequence classification☆23Updated last year
- RUSSE 2022: Russian Text Detoxification Based on Parallel Corpora☆20Updated 3 weeks ago
- A small library with distillation, quantization and pruning pipelines☆26Updated 4 years ago
- NEREL: A Russian Dataset with Nested Named Entities, Relations and Events☆29Updated last year
- Russian Corpus of Linguistic Acceptability☆43Updated 6 months ago
- Code for AINL2018 paper Deep Convolutional Networks for Supervised Morpheme Segmentation of Russian Language☆22Updated 5 years ago
- "Rossiya Segodnya" news dataset☆45Updated 5 years ago
- Russian RoBERTa☆29Updated 5 years ago
- Курс по глубокому обучению в обработке естественных языков для магистров компьютерной лингвистики Высшей Школы Экономики☆47Updated 2 years ago
- Code and data of "Methods for Detoxification of Texts for the Russian Language" paper☆47Updated 3 weeks ago
- NLP course @ CS Faculty, HSE☆15Updated 5 years ago
- http://nlp.seas.harvard.edu/2018/04/03/attention.html☆62Updated 3 years ago
- Datasets for the task of tracing diachronic semantic shifts in Russian for two large-scale time period pairs (from pre-Soviet to Soviet t…☆14Updated 2 months ago
- Code for "Planning and Generating Natural and Diverse Disfluent Texts as Augmentation for Disfluency Detection"☆15Updated 3 years ago
- Source code for paper Grammatical Error Correction in Low-Resource Scenarios (W-NUT 2019)☆13Updated 2 years ago
- Russian dialog datasets parsers and crawlers.☆16Updated 3 years ago
- BSNLP 2021☆33Updated 5 months ago
- ☆23Updated 4 years ago
- Distillation of BERT model with catalyst framework☆77Updated last year
- Models for automatically transforming toxic text to neutral☆34Updated last year
- Complimentary code for our paper Automatic punctuation restoration with BERT models☆49Updated last year
- Speech analytics package for call-center☆23Updated 4 years ago
- A sentence paraphraser based on dependency parsing and word embedding similarity.☆22Updated 3 years ago
- Simple library to work with pre-trained ELMo models in TensorFlow☆52Updated last year
- RuREBus shared task repo☆30Updated 4 years ago
- ☆18Updated 6 years ago