lang-uk / ua-gazetteersLinks
Набір різноманітних колекцій даних українською мовою зібраний протягом роботи над антикорупційними проектами. CSV–формат, до деяких датасетів також наявний переклад англійською або російською.
☆28Updated 3 years ago
Alternatives and similar repositories for ua-gazetteers
Users that are interested in ua-gazetteers are comparing it to the libraries listed below
Sorting:
- Ukranian NER annotation project☆92Updated 3 months ago
- Браунський корпус української мови☆116Updated 2 months ago
- Scripts for updating pymorphy2 dictionaries☆37Updated last year
- Ukrainian tone dictionary☆48Updated 8 years ago
- Попытка сделать свой GLR-парсер для русского языка на Python☆141Updated 8 years ago
- ☆27Updated 2 months ago
- A web-based engine for creating and annotating textual corpora☆247Updated last year
- Flask/Mongo application to provide intuitive web-interface for tasks distribution☆35Updated 2 weeks ago
- Python interface to http://opencorpora.org/☆45Updated 4 years ago
- Our project to digitaze and open all declaration of ukrainian officials☆25Updated 2 years ago
- ☆20Updated 8 years ago
- Russian mass media stemmed texts corpus / Корпус лемматизированных (морфологически нормализованных) текстов российских СМИ☆89Updated 8 years ago
- Parser and analyzer of Russian in Python 3☆96Updated 12 years ago
- Curated list of Ukrainian natural language processing (NLP) resources (corpora, pretrained models, libriaries, etc.)☆207Updated 3 weeks ago
- Simple python lib to tokenize texts into sentences and sentences to words. Small, fast and robust. Comes with ukrainian flavour☆61Updated last year
- Site and documents of the lang-uk group☆40Updated 9 years ago
- Term extraction for Russian language☆89Updated 6 years ago
- Project to generate POS tag dictionary for Ukrainian language☆589Updated 3 weeks ago
- ☆50Updated 7 years ago
- UA-GEC: Grammatical Error Correction and Fluency Corpus for the Ukrainian Language☆262Updated last year
- Корпус ненормативной лексики русского языка для нужд NLP. Любые исправления и дополнения приветствуются☆137Updated 5 years ago
- Seman is a set of linguistic tools to analyze Russian or German texts, it contains lexicons and grammars. The project is interesting as a…☆88Updated 5 months ago
- Russian morphological tagset converters library.☆42Updated 5 years ago
- Ukrainian instruction-tuned language models and datasets☆96Updated last year
- Открытые лингвистические датасеты: тональный словарь русского языка КартаСловСент, датасет по семантике, ассоциативный граф и датасет по …☆369Updated 3 years ago
- ☆40Updated 6 years ago
- Comparing quality and performance of NLP systems for Russian language☆49Updated 2 years ago
- Inspired by word2vec-pride-vis the replacement of words of Russian most valuable novels text with closest word2vec model words. By Boris …☆49Updated last year
- Transliteration for ukrainian language that uses officialy approved rules☆66Updated last year
- Creating Russian voice model for cmu-sphinx☆90Updated 9 years ago