RossiyaSegodnya / ria_news_datasetLinks
"Rossiya Segodnya" news dataset
☆45Updated 5 years ago
Alternatives and similar repositories for ria_news_dataset
Users that are interested in ria_news_dataset are comparing it to the libraries listed below
Sorting:
- Probing suite for evaluation of Russian embedding and language models☆33Updated 8 months ago
- http://www.dialog-21.ru/evaluation/2016/letter/☆57Updated 8 years ago
- ☆29Updated 2 years ago
- RuREBus shared task repo☆30Updated 4 years ago
- Russian RoBERTa☆29Updated 5 years ago
- ☆36Updated 2 years ago
- ☆33Updated 7 years ago
- ☆83Updated 2 years ago
- A simple and fast rule-based sentence segmentation. Tested on OpenCorpora and SynTagRus datasets.☆52Updated 6 years ago
- Large silver standart Russian corpus with NER, morphology and syntax markup☆67Updated last year
- DEREK (Domain Entities and Relations Extraction Kit)☆10Updated 2 years ago
- ☆55Updated 7 years ago
- nlp workshop at datafest siberia 2019☆22Updated 2 years ago
- Russian coreference resolution made as simple and accessible as could be☆12Updated 2 years ago
- Russian paraphrasers. Generate paraphrases with mt5, gpt2, etc.☆54Updated 2 years ago
- A Russian data set for question answering over Wikidata☆47Updated 4 years ago
- ☆50Updated 7 years ago
- NLP course @ CS Faculty, HSE☆15Updated 5 years ago
- Natural language processing tools for English and Russian (postagging, syntax parsing, SRL, NER, language detection etc.)☆65Updated last week
- ☆23Updated 4 years ago
- Проект для перевода чисел, записанных в текстовом виде на русском языке.☆104Updated 4 years ago
- Russian FrameBank offline resources☆13Updated 5 years ago
- SpaCy official Russian model proposal☆31Updated 4 years ago
- MOdel ResOurCe COnsumption. Evaluate Russian SuperGLUE models performance: inference speed, RAM usage. Reproducible scores using Docker☆22Updated 2 years ago
- Code and data of "Methods for Detoxification of Texts for the Russian Language" paper☆49Updated 2 months ago
- Morphological analyzer for Russian and English languages based on neural networks and dictionary-lookup systems.☆154Updated last year
- Word Embeddings for Low Resource Languages: The Case of Buryat☆10Updated 3 months ago
- AWD-LSTM language model trained on newspaper corpora with fast.ai☆27Updated 5 years ago
- Russian Corpus of Linguistic Acceptability☆44Updated 8 months ago
- Custom Russian tokenizer for spaCy☆43Updated 6 years ago