Russian names parsers, gender identification and processing tools
☆137Dec 6, 2023Updated 2 years ago
Alternatives and similar repositories for russiannames
Users that are interested in russiannames are comparing it to the libraries listed below
Sorting:
- "Rossiya Segodnya" news dataset☆46Sep 25, 2019Updated 6 years ago
- Quick and dirty date parsing Python library to parse HTML dates really fast☆21Jan 3, 2026Updated last month
- Extracts tables from .docx files and saves them as .csv or .xls files☆65Oct 11, 2023Updated 2 years ago
- Memes - why so popular?☆35Jan 30, 2019Updated 7 years ago
- Russian coreference resolution competition☆11Mar 24, 2023Updated 2 years ago
- Создание реестра всех доменных имён Российской Федерации относящихся к органам власти, государственным учреждениям, а также региональным …☆55Oct 8, 2022Updated 3 years ago
- SAGE: Spelling correction, corruption and evaluation for multiple languages☆165Dec 8, 2025Updated 2 months ago
- Общероссийские справочники из открытых источников☆12May 23, 2019Updated 6 years ago
- Python library to read, write and convert data files with formats BSON, JSON, NDJSON, Parquet, ORC, XLS, XLSX, XML and many others☆28Jan 28, 2026Updated last month
- ☆33Sep 20, 2017Updated 8 years ago
- Registry of data portals, catalogs, data repositories including data catalogs dataset and catalog description standard☆50Updated this week
- Russian data and parsers from database of registry of repression victims (http://lists.memo.ru/)☆12Sep 1, 2021Updated 4 years ago
- A simple and fast rule-based sentence segmentation. Tested on OpenCorpora and SynTagRus datasets.☆52Jul 4, 2018Updated 7 years ago
- Morphological analyzer for Russian and English languages based on neural networks and dictionary-lookup systems.☆157May 22, 2024Updated last year
- System for automatic pronominal resolution for Russian☆14Apr 3, 2020Updated 5 years ago
- ☆56May 12, 2018Updated 7 years ago
- Russian data from the SynTagRus corpus.☆86Nov 12, 2025Updated 3 months ago
- Opendata resources in Russian / Открытые данные на русском языке☆223Dec 16, 2021Updated 4 years ago
- List of russians personal names and surnames. Список русских имен и фамилий.☆59Apr 1, 2016Updated 9 years ago
- undatum: a command-line tool for data processing. Brings CSV simplicity to NDJSON, BSON, XML and other data files☆50Jan 19, 2026Updated last month
- Russian language models for spaCy☆241Jul 14, 2021Updated 4 years ago
- Dataset collected from popular Russian collective blog Habrahabr.ru☆13Oct 24, 2016Updated 9 years ago
- Corpus of Russian news articles collected from Lenta.Ru☆146Nov 19, 2022Updated 3 years ago
- Lazy helper tool to make easier scraping with simple tasks☆19Oct 12, 2022Updated 3 years ago
- Открытые лингвистические датасеты: тональный словарь русского языка КартаСловСент, датасет по семантике, ассоциативный граф и датасет по …☆371Nov 24, 2021Updated 4 years ago
- Sentiment analysis library for russian language☆321Oct 30, 2023Updated 2 years ago
- Python library and cmd tool to backup API calls☆18Nov 14, 2025Updated 3 months ago
- "Руформеры" - список популярных базовых моделей на основе трансформеров для решения задач по автоматической обработке русского языка☆38Nov 21, 2023Updated 2 years ago
- ☆36Dec 8, 2022Updated 3 years ago
- Russian Law as Open Data☆48Feb 5, 2026Updated 3 weeks ago
- A CLI tool that bundles source code files into a single context for LLM prompts☆21Jan 9, 2025Updated last year
- Краулеры для проекта Taiga Corpus и Taiga Parser, скачивание ресурсов из открытых источников☆14Apr 9, 2019Updated 6 years ago
- Russian coreference resolution made as simple and accessible as could be☆12Sep 3, 2022Updated 3 years ago
- Metadata and data identification tool and Python library. Identifies PII, common identifiers, language specific identifiers. Fully custom…☆46Jan 1, 2026Updated 2 months ago
- Full history for django models☆19Apr 6, 2015Updated 10 years ago
- ☆13Feb 2, 2025Updated last year
- Collecting and analysing open data stuff☆13May 27, 2021Updated 4 years ago
- Using YouTube to prepare a speech recognition dataset for any language☆10Mar 30, 2021Updated 4 years ago
- Rule-based facts extraction for Russian language☆330Jul 24, 2023Updated 2 years ago