Russian text segmenter and tokenizer
☆18Mar 2, 2021Updated 5 years ago
Alternatives and similar repositories for rutokenizer
Users that are interested in rutokenizer are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Part-of-Speech Tagger for Russian language☆23Jul 29, 2020Updated 5 years ago
- Named entity recognition (NER) in Russian texts / Определение именованных сущностей (NER) в тексте на русском языке☆42Oct 10, 2025Updated 8 months ago
- Sharepoint REST Utilities C#☆15Nov 19, 2019Updated 6 years ago
- Simple neuroevolution AI example using pygame & NEAT python.☆11Oct 20, 2024Updated last year
- Лемматизатор для русскоязычных текстов☆46Jun 4, 2020Updated 6 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Татьяна - система "умный дом" на базе Raspberry Pi☆11Jan 7, 2018Updated 8 years ago
- 📚 A small collection of Russian literature 📚☆15Dec 9, 2022Updated 3 years ago
- Грамматический Словарь Русского Языка (+ английский, японский, etc)☆78Aug 10, 2020Updated 5 years ago
- Код для файнтюна LM (rugpt, LLaMa, FRED T5) средствами transformers + deepspeed + LoRa☆14May 22, 2023Updated 3 years ago
- Простой нормализатор текстов перед синтезом речи☆48May 13, 2024Updated 2 years ago
- Генеративные текстовые модели☆14Sep 12, 2018Updated 7 years ago
- T5-based (russian) text normalization☆27Jan 25, 2024Updated 2 years ago
- Simple Python package for breaking Russian words into syllables☆32Feb 20, 2020Updated 6 years ago
- Soulgem Oven 4 Special Edition Edition☆27Oct 19, 2022Updated 3 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Bunch of notebooks for pre-training custom Saiga-like LLM☆12Feb 9, 2024Updated 2 years ago
- 🔌 Репозиторий игры Space Station 14 проекта Space Stories.☆13Jun 8, 2026Updated last week
- Sentiment analysis of tweets in Russian using Convolutional Neural Networks (CNN) with Word2Vec embeddings.☆59Feb 27, 2021Updated 5 years ago
- ☆13Aug 7, 2021Updated 4 years ago
- Russian GPT2 model☆62Jul 12, 2021Updated 4 years ago
- V wrapper of Edubart's minicoro - A cross-platform coroutine library☆13Nov 26, 2022Updated 3 years ago
- Анализ данных и статистика в R☆18May 28, 2026Updated 2 weeks ago
- Ocy project cleaner☆13Feb 1, 2024Updated 2 years ago
- Data and Code for COLM 2025 paper "Retrieval-Augmented Generation with Conflicting Evidence"☆23Apr 18, 2025Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- [UNSUPPORTED] - please use https://github.com/kmike/pymorphy2. Russian and English morphology analyser (POS tagger + inflection engine) w…☆41Jul 23, 2015Updated 10 years ago
- Это прототип решения типа Agentic RAG (Retrieval-Augmented Generation) с данными из Jira, Confluence и Git.☆11Dec 4, 2024Updated last year
- A collection of word lists in machine readable, web-native (.yml and .json) format☆30Jul 20, 2023Updated 2 years ago
- SpaCy official Russian model proposal☆32Jan 24, 2021Updated 5 years ago
- Multilingual RAG benchmark.☆11Nov 22, 2024Updated last year
- Grammar rules and dictionaries for the phonetic transcription of Russian sentences☆33Sep 23, 2021Updated 4 years ago
- fast compression of short text messages☆14Oct 31, 2015Updated 10 years ago
- This repository provides data and code for "Vox Populi, Vox DIY: Benchmark Dataset for Crowdsourced Audio Transcription" paper.☆16Jul 22, 2021Updated 4 years ago
- Using transformers to generate Russian poetry☆36Aug 21, 2023Updated 2 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Corpus of Russian news articles collected from Lenta.Ru☆145Nov 19, 2022Updated 3 years ago
- A lightweight, thread-safe Rust library for managing system-wide hotkeys on Windows☆17May 17, 2025Updated last year
- My NLP datasets for Russian language☆390Feb 18, 2023Updated 3 years ago
- A primarily osu!b1815 2011 Bancho written in Go! Designed to work on every osu! client out there.☆11Dec 4, 2024Updated last year
- ☆17Apr 14, 2023Updated 3 years ago
- 学习vLLM,使用vLLM部署Qwen2-0.5B的模型,并使用docker部署。☆20Jun 22, 2024Updated last year
- Implementation of transformer for optical character recognition of russian words☆14Nov 25, 2023Updated 2 years ago