s-nlp / parallel_detoxification_dataset
Data from "Crowdsourcing of Parallel Corpora: the Case of Style Transfer for Detoxification" paper
☆14Updated 3 months ago
Alternatives and similar repositories for parallel_detoxification_dataset:
Users that are interested in parallel_detoxification_dataset are comparing it to the libraries listed below
- RuSimpleSentEval (RSSE) shared task repo☆22Updated 3 years ago
- RUSSE 2022: Russian Text Detoxification Based on Parallel Corpora☆20Updated last week
- Models for automatically transforming toxic text to neutral☆34Updated last year
- Russian Corpus of Linguistic Acceptability☆42Updated 4 months ago
- Russian Artificial Text Detection☆17Updated 2 years ago
- A small library with distillation, quantization and pruning pipelines☆26Updated 3 years ago
- Code and data of "Methods for Detoxification of Texts for the Russian Language" paper☆46Updated last week
- Курс по глубокому обучению в обработке естественных языков для магистров компьютерной лингвистики Высшей Школы Экономики☆47Updated 2 years ago
- Russian paraphrasers. Generate paraphrases with mt5, gpt2, etc.☆54Updated last year
- NEREL: A Russian Dataset with Nested Named Entities, Relations and Events☆27Updated last year
- Probing suite for evaluation of Russian embedding and language models☆33Updated 4 months ago
- ☆12Updated 2 years ago
- An official implementation of "BPE-Dropout: Simple and Effective Subword Regularization" algorithm.☆49Updated 4 years ago
- Dual Encoders for State-of-the-art Natural Language Processing.☆61Updated 2 years ago
- Pipeline for easy fine-tuning of BERT architecture for sequence classification☆22Updated last year
- Russian RoBERTa☆29Updated 5 years ago
- A corpus of Ukrainian Twitter texts + instructions for downloading and filtering texts.☆15Updated 5 years ago
- Train punctuation and capitalization models for different languages☆24Updated 2 years ago
- "Rossiya Segodnya" news dataset☆45Updated 5 years ago
- Distillation of BERT model with catalyst framework☆76Updated last year
- NLP course @ CS Faculty, HSE☆15Updated 4 years ago
- LOW-RESOURCE NEURAL MACHINE TRANSLATION: A BENCHMARK FOR FIVE AFRICAN LANGUAGES☆15Updated 4 years ago
- Code for AINL2018 paper Deep Convolutional Networks for Supervised Morpheme Segmentation of Russian Language☆19Updated 5 years ago
- RuREBus shared task repo☆30Updated 4 years ago
- ☆12Updated 3 months ago
- GeDi: Generative Discriminator Guided Sequence Generation☆208Updated 2 years ago
- Code for CAET5☆23Updated last year
- code associated with ACL 2021 DExperts paper☆113Updated last year
- A sentence paraphraser based on dependency parsing and word embedding similarity.☆22Updated 3 years ago
- Russian coreference resolution made as simple and accessible as could be☆12Updated 2 years ago