odaykhovskaya / obscene_words_ruView external linksLinks
Корпус ненормативной лексики русского языка для нужд NLP. Любые исправления и дополнения приветствуются
☆140Jan 15, 2020Updated 6 years ago
Alternatives and similar repositories for obscene_words_ru
Users that are interested in obscene_words_ru are comparing it to the libraries listed below
Sorting:
- My best solution for mlbootcamp4 competition☆11Jun 11, 2017Updated 8 years ago
- ☆51Nov 20, 2017Updated 8 years ago
- Morphological analyzer for Russian and English languages based on neural networks and dictionary-lookup systems.☆157May 22, 2024Updated last year
- Открытые лингвистические датасеты: тональный словарь русского языка КартаСловСент, датасет по семантике, ассоциативный граф и датасет по …☆372Nov 24, 2021Updated 4 years ago
- ☆36Dec 8, 2022Updated 3 years ago
- nlp workshop at datafest siberia 2019☆22Dec 8, 2022Updated 3 years ago
- A list of pretrained Transformer models for the Russian language.☆177Feb 3, 2020Updated 6 years ago
- A Python implementation of LightFM, a hybrid recommendation algorithm.☆14Nov 3, 2017Updated 8 years ago
- ☆16May 19, 2016Updated 9 years ago
- Это репозиторий, куда я складываю материалы с разных своих докладов. Тут пока грязновато, но я приберусь.☆13Jun 4, 2019Updated 6 years ago
- ☆13Nov 12, 2025Updated 3 months ago
- Подборка ресурсов по машинному обучению☆1,447Jan 12, 2021Updated 5 years ago
- Corpus of Russian news articles collected from Lenta.Ru☆146Nov 19, 2022Updated 3 years ago
- Memes - why so popular?☆35Jan 30, 2019Updated 7 years ago
- Named entity recognition (NER) in Russian texts / Определение именованных сущностей (NER) в тексте на русском языке☆41Oct 10, 2025Updated 4 months ago
- Pre-trained models for tokenization, sentence segmentation and so on☆15Aug 22, 2017Updated 8 years ago
- All presentations from Data Fest Kyiv 2017 http://datafest.in.ua☆13Apr 24, 2017Updated 8 years ago
- Проект для перевода чисел, записанных в текстовом виде на русском языке.☆105May 13, 2021Updated 4 years ago
- Russian mass media stemmed texts corpus / Корпус лемматизированных (морфологически нормализованных) текстов российских СМИ☆93Apr 4, 2017Updated 8 years ago
- Habrahabr API Client Library for Python☆37Jun 20, 2014Updated 11 years ago
- Materials for deep NLP course☆117Nov 11, 2018Updated 7 years ago
- "Rossiya Segodnya" news dataset☆46Sep 25, 2019Updated 6 years ago
- Links to Russian corpora + Python functions for loading and parsing☆309Feb 9, 2026Updated last week
- Set of common tools and techniques for everyday data science tasks with examples☆58Aug 2, 2019Updated 6 years ago
- An automatically annotated sentiment analysis dataset of product reviews in Russian.☆17Oct 25, 2020Updated 5 years ago
- Dockerized version of Google's SyntaxNet Parser and POS tagger for Russian + standalone server.☆16May 4, 2017Updated 8 years ago
- Russian FrameBank offline resources☆13Mar 27, 2020Updated 5 years ago
- (re)Implementation of Learning Multi-level Dependencies for Robust Word Recognition☆17Jul 25, 2024Updated last year
- Inspired by word2vec-pride-vis the replacement of words of Russian most valuable novels text with closest word2vec model words. By Boris …☆49Aug 1, 2024Updated last year
- ☆12Jun 5, 2016Updated 9 years ago
- Machine learning problems☆19Sep 14, 2025Updated 5 months ago
- ☆33Feb 14, 2019Updated 7 years ago
- A simple and fast rule-based sentence segmentation. Tested on OpenCorpora and SynTagRus datasets.☆52Jul 4, 2018Updated 7 years ago
- Solves basic Russian NLP tasks, API for lower level Natasha projects☆1,311Oct 17, 2024Updated last year
- Massively mapping management tool☆27Sep 6, 2021Updated 4 years ago
- http://www.dialog-21.ru/evaluation/2016/letter/☆57Dec 8, 2016Updated 9 years ago
- Analysis of Github Commits Comments☆36Feb 20, 2017Updated 8 years ago
- Репозиторий для лекций, семинаров и заданий по курсу "Анализ неструктурированных данных" ФКН ВШЭ☆34Dec 5, 2018Updated 7 years ago
- Russian morphological tagset converters library.☆42Oct 4, 2019Updated 6 years ago