s-nlp / parallel_detoxification_dataset
Data from "Crowdsourcing of Parallel Corpora: the Case of Style Transfer for Detoxification" paper
☆14Updated 4 months ago
Alternatives and similar repositories for parallel_detoxification_dataset:
Users that are interested in parallel_detoxification_dataset are comparing it to the libraries listed below
- RuSimpleSentEval (RSSE) shared task repo☆22Updated 3 years ago
- ☆12Updated 2 years ago
- RUSSE 2022: Russian Text Detoxification Based on Parallel Corpora☆20Updated last month
- Russian paraphrasers. Generate paraphrases with mt5, gpt2, etc.☆54Updated last year
- NEREL: A Russian Dataset with Nested Named Entities, Relations and Events☆27Updated last year
- Pipeline for easy fine-tuning of BERT architecture for sequence classification☆23Updated last year
- Russian RoBERTa☆29Updated 5 years ago
- Russian Artificial Text Detection☆17Updated 2 years ago
- Probing suite for evaluation of Russian embedding and language models☆33Updated 5 months ago
- Dual Encoders for State-of-the-art Natural Language Processing.☆61Updated 2 years ago
- "Rossiya Segodnya" news dataset☆45Updated 5 years ago
- Code and data of "Methods for Detoxification of Texts for the Russian Language" paper☆47Updated last month
- Models for automatically transforming toxic text to neutral☆34Updated last year
- NLP course @ CS Faculty, HSE☆15Updated 5 years ago
- Russian dialog datasets parsers and crawlers.☆16Updated 3 years ago
- RuREBus shared task repo☆30Updated 4 years ago
- This is an official repository for "Artificial Text Detection via Examining the Topology of Attention Maps" presented at EMNLP 2021 confe…☆22Updated last year
- Russian Corpus of Linguistic Acceptability☆42Updated 5 months ago
- A small library with distillation, quantization and pruning pipelines☆26Updated 3 years ago
- Code for BERT classifier finetuning for multiclass text classification☆71Updated 2 years ago
- Doing things with embeddings☆64Updated 2 years ago
- ☆42Updated 3 years ago
- ☆29Updated 2 years ago
- Word Embeddings for Low Resource Languages: The Case of Buryat☆10Updated last week
- Курс по глубокому обучению в обработке естественных языков для магистров компьютерной лингвистики Высшей Школы Экономики☆47Updated 2 years ago
- Complimentary code for our paper Automatic punctuation restoration with BERT models☆48Updated last year
- (re)Implementation of Learning Multi-level Dependencies for Robust Word Recognition☆17Updated 8 months ago
- PyTorch implementation of 'An Unsupervised Neural Attention Model for Aspect Extraction' by He et al. ACL2017'☆66Updated 3 years ago
- RuTransform: python framework for adversarial attacks and text data augmentation for Russian☆19Updated last year
- Pytorch library for end-to-end transformer models training, inference and serving☆70Updated 2 years ago