Simple python lib to tokenize texts into sentences and sentences to words. Small, fast and robust. Comes with ukrainian flavour
☆60Nov 1, 2023Updated 2 years ago
Alternatives and similar repositories for tokenize-uk
Users that are interested in tokenize-uk are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- This is a project to demonstrate NLP API from LanguageTool for Ukrainian language.☆77Feb 5, 2026Updated 2 months ago
- Ukrainian stopwords collection☆10Mar 5, 2020Updated 6 years ago
- Браунський корпус української мови☆118Mar 9, 2026Updated last month
- ☆30Nov 12, 2025Updated 5 months ago
- Набір різноманітних колекцій даних українською мовою зібраний протягом роботи над антикорупційними проектами. CSV–формат, до деяких датас…☆28Jan 21, 2022Updated 4 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Home of Projector's "Data Science. Natural Language Processing" 2020 Edition☆19Oct 3, 2023Updated 2 years ago
- UA-GEC: Grammatical Error Correction and Fluency Corpus for the Ukrainian Language☆270Feb 11, 2024Updated 2 years ago
- Fun pet project for creating Ukrainian-speaking Conversational AI☆20May 4, 2023Updated 2 years ago
- A corpus of Ukrainian Twitter texts + instructions for downloading and filtering texts.☆15Jul 4, 2019Updated 6 years ago
- A novel stemmer for the Ukrainian language trained with AI☆29Nov 22, 2022Updated 3 years ago
- Materials from talks I gave☆21Nov 22, 2022Updated 3 years ago
- Flask/Mongo application to provide intuitive web-interface for tasks distribution☆36Feb 19, 2026Updated last month
- Code for Detecting language from text in python using fasttext☆13May 25, 2020Updated 5 years ago
- A tool to extract plain text from HTML pages☆10Dec 7, 2017Updated 8 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- A fake presidential speech generator with a Mad Libs element.☆10Jul 19, 2017Updated 8 years ago
- Ukrainian dictionary in your terminal☆15Jul 6, 2024Updated last year
- Match tokenized words and phrases within the original, untokenized, often messy, text.☆19Apr 11, 2023Updated 3 years ago
- Deep Reinforcement Learning Agent☆19Dec 9, 2015Updated 10 years ago
- Ukrainian instruction-tuned language models and datasets☆96Jul 12, 2024Updated last year
- solutions to the end-chapter exercises in Paul Graham's "ANSI Common Lisp"☆11Feb 3, 2016Updated 10 years ago
- ☆13Feb 11, 2021Updated 5 years ago
- This is the implementation of paper "Learning to Ask Conversational Questions by Optimizing Levenshtein Distance".☆10Jul 5, 2021Updated 4 years ago
- Dictionary of word stresses in the Ukrainian language 🇺🇦☆22Sep 29, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Vosk ASR Docker images with GPU for Jetson boards, PCs, M1 laptops and GPC☆45May 16, 2022Updated 3 years ago
- ☆12Jul 30, 2024Updated last year
- This is a telegram bot for correcting language mistakes in group chats☆10Jun 29, 2021Updated 4 years ago
- ☆30Mar 25, 2026Updated 2 weeks ago
- ☆17Sep 2, 2017Updated 8 years ago
- Yii2-user demo site☆13Jan 5, 2017Updated 9 years ago
- A memory-based morphological parser for Python☆16Oct 12, 2012Updated 13 years ago
- Зведений список всіх існуючих інструментів, стандартів, реєстрів та решти за відкритими даними в Україні☆11May 18, 2019Updated 6 years ago
- All presentations from Data Fest Kyiv 2017 http://datafest.in.ua☆13Apr 24, 2017Updated 8 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Skype extension for WakaTime☆12Feb 5, 2015Updated 11 years ago
- Scripts for "Deploy ML to production" workshop☆23Apr 25, 2018Updated 7 years ago
- Text language identification using Wikipedia data☆31Aug 15, 2017Updated 8 years ago
- C++ and Python Codes from my projects☆10Feb 29, 2020Updated 6 years ago
- DEREK (Domain Entities and Relations Extraction Kit)☆10May 22, 2023Updated 2 years ago
- Using NLP Topic Models to automate research paper topic classification.☆12Apr 14, 2021Updated 5 years ago
- Distributed sentiment analysis on GitHub commit comments☆10Jun 9, 2015Updated 10 years ago