s-nlp / parallel_detoxification_dataset
Data from "Crowdsourcing of Parallel Corpora: the Case of Style Transfer for Detoxification" paper
☆14Updated 3 weeks ago
Related projects ⓘ
Alternatives and complementary repositories for parallel_detoxification_dataset
- RuSimpleSentEval (RSSE) shared task repo☆21Updated 3 years ago
- Russian Artificial Text Detection☆17Updated 2 years ago
- NEREL: A Russian Dataset with Nested Named Entities, Relations and Events☆25Updated last year
- ☆12Updated 2 years ago
- Probing suite for evaluation of Russian embedding and language models☆32Updated last month
- A small library with distillation, quantization and pruning pipelines☆26Updated 3 years ago
- Russian paraphrasers. Generate paraphrases with mt5, gpt2, etc.☆52Updated last year
- BSNLP 2021☆32Updated 2 weeks ago
- Pytorch library for end-to-end transformer models training, inference and serving☆70Updated 2 years ago
- Russian RoBERTa☆29Updated 4 years ago
- Курс по глубокому обучению в обработке естественных языков для магистров компьютерной лингвистики Высшей Школы Экономики☆47Updated 2 years ago
- Russian Corpus of Linguistic Acceptability☆41Updated last month
- Pipeline for easy fine-tuning of BERT architecture for sequence classification☆22Updated last year
- NLP course @ CS Faculty, HSE☆15Updated 4 years ago
- RUSSE 2022: Russian Text Detoxification Based on Parallel Corpora☆20Updated 2 years ago
- Code and data of "Methods for Detoxification of Texts for the Russian Language" paper☆46Updated 2 months ago
- Complimentary code for our paper Automatic punctuation restoration with BERT models☆48Updated last year
- Distillation of BERT model with catalyst framework☆75Updated last year
- RuREBus shared task repo☆30Updated 3 years ago
- An official implementation of "BPE-Dropout: Simple and Effective Subword Regularization" algorithm.☆48Updated 3 years ago
- Gazeta: Dataset for automatic summarization of Russian news / Газета: набор данных для автоматического реферирования на русском языке☆32Updated 3 years ago
- MOdel ResOurCe COnsumption. Evaluate Russian SuperGLUE models performance: inference speed, RAM usage. Reproducible scores using Docker☆21Updated 2 years ago
- Russian coreference resolution made as simple and accessible as could be☆12Updated 2 years ago
- "Rossiya Segodnya" news dataset☆45Updated 5 years ago
- Russian dialog datasets parsers and crawlers.☆16Updated 3 years ago
- ☆13Updated last year
- Unofficial implementation of QaNER: Prompting Question Answering Models for Few-shot Named Entity Recognition.☆65Updated 2 years ago
- Datasets for the task of tracing diachronic semantic shifts in Russian for two large-scale time period pairs (from pre-Soviet to Soviet t…☆14Updated 6 months ago
- Code for BERT classifier finetuning for multiclass text classification☆70Updated 2 years ago
- Word Embeddings for Low Resource Languages: The Case of Buryat☆10Updated last year