s-nlp / parallel_detoxification_dataset
Data from "Crowdsourcing of Parallel Corpora: the Case of Style Transfer for Detoxification" paper
☆14Updated last week
Related projects ⓘ
Alternatives and complementary repositories for parallel_detoxification_dataset
- RuSimpleSentEval (RSSE) shared task repo☆21Updated 3 years ago
- Probing suite for evaluation of Russian embedding and language models☆32Updated last month
- ☆12Updated 2 years ago
- Russian RoBERTa☆29Updated 4 years ago
- A small library with distillation, quantization and pruning pipelines☆26Updated 3 years ago
- Distillation of BERT model with catalyst framework☆75Updated last year
- "Rossiya Segodnya" news dataset☆45Updated 5 years ago
- RUSSE 2022: Russian Text Detoxification Based on Parallel Corpora☆20Updated 2 years ago
- Pipeline for easy fine-tuning of BERT architecture for sequence classification☆22Updated last year
- ☆18Updated 5 years ago
- Russian Drug Reaction Corpus (RuDReC)☆8Updated 3 years ago
- Russian Corpus of Linguistic Acceptability☆41Updated last month
- Russian Artificial Text Detection☆17Updated 2 years ago
- ☆36Updated last year
- BSNLP 2021☆32Updated last week
- Курс по глубокому обучению в обработке естественных языков для магистров компьютерной лингвистики Высшей Школы Экономики☆47Updated 2 years ago
- Pytorch library for end-to-end transformer models training, inference and serving☆70Updated 2 years ago
- Russian paraphrasers. Generate paraphrases with mt5, gpt2, etc.☆52Updated last year
- Gazeta: Dataset for automatic summarization of Russian news / Газета: набор данных для автоматического реферирования на русском языке☆32Updated 3 years ago
- ☆29Updated last year
- 1st place solution for GramEval-2020☆14Updated last year
- NLP course @ CS Faculty, HSE☆15Updated 4 years ago
- Models for automatically transforming toxic text to neutral☆33Updated last year
- Russian coreference resolution made as simple and accessible as could be☆12Updated 2 years ago
- Code and data of "Methods for Detoxification of Texts for the Russian Language" paper☆46Updated 2 months ago
- Train punctuation and capitalization models for different languages☆24Updated 2 years ago
- A sentence paraphraser based on dependency parsing and word embedding similarity.☆22Updated 3 years ago
- NEREL: A Russian Dataset with Nested Named Entities, Relations and Events☆25Updated last year
- LM Pretraining with PyTorch/TPU☆132Updated 5 years ago
- Russian dialog datasets parsers and crawlers.☆16Updated 3 years ago