Simple python lib to tokenize texts into sentences and sentences to words. Small, fast and robust. Comes with ukrainian flavour
☆60Nov 1, 2023Updated 2 years ago
Alternatives and similar repositories for tokenize-uk
Users that are interested in tokenize-uk are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- This is a project to demonstrate NLP API from LanguageTool for Ukrainian language.☆77Feb 5, 2026Updated last month
- ☆19Feb 7, 2017Updated 9 years ago
- ☆30Nov 12, 2025Updated 4 months ago
- Digital lexicographic systems Ukrainian language + (the grammatical dictionary, synonymous dictionary, etymological dictionary +)☆65Dec 8, 2022Updated 3 years ago
- Stemmer for Ukrainian language in Python☆24Aug 10, 2017Updated 8 years ago
- Project to generate POS tag dictionary for Ukrainian language☆615Mar 17, 2026Updated last week
- UA-GEC: Grammatical Error Correction and Fluency Corpus for the Ukrainian Language☆269Feb 11, 2024Updated 2 years ago
- Fun pet project for creating Ukrainian-speaking Conversational AI☆20May 4, 2023Updated 2 years ago
- ☆10Mar 4, 2016Updated 10 years ago
- A corpus of Ukrainian Twitter texts + instructions for downloading and filtering texts.☆15Jul 4, 2019Updated 6 years ago
- Add accents to words in the Ukrainian language☆15Oct 31, 2022Updated 3 years ago
- A novel stemmer for the Ukrainian language trained with AI☆29Nov 22, 2022Updated 3 years ago
- Materials from talks I gave☆21Nov 22, 2022Updated 3 years ago
- Flask/Mongo application to provide intuitive web-interface for tasks distribution☆36Feb 19, 2026Updated last month
- Code for Detecting language from text in python using fasttext☆13May 25, 2020Updated 5 years ago
- HR Analytics Dataset☆10Mar 29, 2019Updated 6 years ago
- A tool to extract plain text from HTML pages☆10Dec 7, 2017Updated 8 years ago
- Kyiv Smart City website.☆15Sep 18, 2015Updated 10 years ago
- 🇺🇦 Speech Recognition & Synthesis for Ukrainian☆426Sep 12, 2025Updated 6 months ago
- Match tokenized words and phrases within the original, untokenized, often messy, text.☆19Apr 11, 2023Updated 2 years ago
- Deep Reinforcement Learning Agent☆19Dec 9, 2015Updated 10 years ago
- Plugin for LiveStreet, JSON API interface☆19Jul 14, 2017Updated 8 years ago
- Docker file for ClickHouse☆14Jun 23, 2016Updated 9 years ago
- Transliteration for ukrainian language that uses officialy approved rules☆67Jul 19, 2024Updated last year
- Dictionary of word stresses in the Ukrainian language 🇺🇦☆22Sep 29, 2024Updated last year
- An elaborate approach for ABC-XYZ Analysis☆11May 10, 2020Updated 5 years ago
- ☆12Jul 30, 2024Updated last year
- This is a telegram bot for correcting language mistakes in group chats☆10Jun 29, 2021Updated 4 years ago
- ☆30Mar 13, 2026Updated last week
- ☆27Jun 12, 2023Updated 2 years ago
- LEFTJOIN.ru public repository☆24Dec 8, 2022Updated 3 years ago
- All presentations from Data Fest Kyiv 2017 http://datafest.in.ua☆13Apr 24, 2017Updated 8 years ago
- Scripts for "Deploy ML to production" workshop☆23Apr 25, 2018Updated 7 years ago
- A list of deep learning papers and notes on them☆11Dec 31, 2016Updated 9 years ago
- Our project to digitaze and open all declaration of ukrainian officials☆25Mar 6, 2023Updated 3 years ago
- C++ and Python Codes from my projects☆10Feb 29, 2020Updated 6 years ago
- DEREK (Domain Entities and Relations Extraction Kit)☆10May 22, 2023Updated 2 years ago
- Distributed sentiment analysis on GitHub commit comments☆10Jun 9, 2015Updated 10 years ago
- Use Python3, Django, Django-rest-framework to achieve alipay payment. 包括支付宝支付,支付宝服务器异步通知,支付宝退款☆12May 26, 2018Updated 7 years ago