Russian text segmenter and tokenizer
☆18Mar 2, 2021Updated 5 years ago
Alternatives and similar repositories for rutokenizer
Users that are interested in rutokenizer are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Part-of-Speech Tagger for Russian language☆23Jul 29, 2020Updated 5 years ago
- Named entity recognition (NER) in Russian texts / Определение именованных сущностей (NER) в тексте на русском языке☆42Oct 10, 2025Updated 6 months ago
- Лемматизатор для русскоязычных текстов☆46Jun 4, 2020Updated 5 years ago
- 📚 A small collection of Russian literature 📚☆15Dec 9, 2022Updated 3 years ago
- Простой нормализатор текстов перед синтезом речи☆48May 13, 2024Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- T5-based (russian) text normalization☆27Jan 25, 2024Updated 2 years ago
- Simple Python package for breaking Russian words into syllables☆32Feb 20, 2020Updated 6 years ago
- Bunch of notebooks for pre-training custom Saiga-like LLM☆12Feb 9, 2024Updated 2 years ago
- Sentiment analysis of tweets in Russian using Convolutional Neural Networks (CNN) with Word2Vec embeddings.☆59Feb 27, 2021Updated 5 years ago
- ☆13Aug 7, 2021Updated 4 years ago
- Russian GPT2 model☆62Jul 12, 2021Updated 4 years ago
- Data and Code for COLM 2025 paper "Retrieval-Augmented Generation with Conflicting Evidence"☆23Apr 18, 2025Updated last year
- [UNSUPPORTED] - please use https://github.com/kmike/pymorphy2. Russian and English morphology analyser (POS tagger + inflection engine) w…☆41Jul 23, 2015Updated 10 years ago
- Это прототип решения типа Agentic RAG (Retrieval-Augmented Generation) с данными из Jira, Confluence и Git.☆11Dec 4, 2024Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- SpaCy official Russian model proposal☆32Jan 24, 2021Updated 5 years ago
- Multilingual RAG benchmark.☆10Nov 22, 2024Updated last year
- Grammar rules and dictionaries for the phonetic transcription of Russian sentences☆33Sep 23, 2021Updated 4 years ago
- Using transformers to generate Russian poetry☆36Aug 21, 2023Updated 2 years ago
- ☆17Apr 14, 2023Updated 3 years ago
- 学习vLLM,使用vLLM部署Qwen2-0.5B的模型,并使用docker部署。☆20Jun 22, 2024Updated last year
- Implementation of transformer for optical character recognition of russian words☆14Nov 25, 2023Updated 2 years ago
- radiomixer☆14Feb 16, 2022Updated 4 years ago
- My own raytracer in one week ⚡☆31Feb 27, 2023Updated 3 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ChatGPT Jailbreak promts☆15Mar 22, 2023Updated 3 years ago
- Modified Arena-Hard-Auto LLM evaluation toolkit with an emphasis on Russian language☆46Mar 20, 2025Updated last year
- DEPRECATED - A webapp for collecting speech samples for voice recognition testing and training☆20May 23, 2019Updated 6 years ago
- A tool that generates python code out of your GraphQL schema.☆16Nov 27, 2023Updated 2 years ago
- ☆14Aug 30, 2022Updated 3 years ago
- ☆16Oct 29, 2023Updated 2 years ago
- Проект языковой модели для проведения морфемного анализа, сегментации и токенизации слов русского языка.☆17Jan 10, 2025Updated last year
- NoORM (Not only ORM) - Python library that makes your database operations convenient and natural☆17Oct 27, 2025Updated 6 months ago
- 🇷🇺 Punctuation restoration production-ready model for Russian language 🇷🇺☆59Jul 9, 2021Updated 4 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Solves basic Russian NLP tasks, API for lower level Natasha projects☆1,327Apr 13, 2026Updated 3 weeks ago
- Train punctuation and capitalization models for different languages☆26Apr 2, 2022Updated 4 years ago
- 🌙 Reduce eye strain when reading docs☆17Oct 1, 2018Updated 7 years ago
- A set of scripts and configurations for pretraining of Large Language Models (LLM)☆35Mar 2, 2025Updated last year
- Grapheme-to-Phoneme conversion with Joint-Sequence RnnLMs☆31Dec 15, 2014Updated 11 years ago
- ☆24Nov 3, 2024Updated last year
- PPSpeech: Phrase based Parallel End-to-End TTS System☆35Aug 31, 2020Updated 5 years ago