Train punctuation and capitalization models for different languages
☆26Apr 2, 2022Updated 3 years ago
Alternatives and similar repositories for multipunct
Users that are interested in multipunct are comparing it to the libraries listed below
Sorting:
- Convert MUSE from TensorFlow to PyTorch and ONNX☆11May 22, 2024Updated last year
- radiomixer☆14Feb 16, 2022Updated 4 years ago
- Проект для перевода чисел, записанных в текстовом виде на русском языке.☆11Apr 5, 2022Updated 3 years ago
- Репозиторий измеряет качество Yandexgpt, Gigachat, T-Pro, Saiga, Vikhr, Ruadapt на популярных англоязычных бенчмарках: MGSM, MATH, HumanE…☆23Apr 16, 2025Updated 10 months ago
- ☆13Aug 7, 2021Updated 4 years ago
- RuLeanALBERT is a pretrained masked language model for the Russian language that uses a memory-efficient architecture.☆93May 27, 2023Updated 2 years ago
- AWD-LSTM language model trained on newspaper corpora with fast.ai☆27Apr 9, 2020Updated 5 years ago
- 🇷🇺 Punctuation restoration production-ready model for Russian language 🇷🇺☆59Jul 9, 2021Updated 4 years ago
- Reinforcement Learning Library.☆29Aug 16, 2022Updated 3 years ago
- A database-like benchmark of feature generation from time-series data☆13Nov 27, 2024Updated last year
- CraftML is a restful web service for easy pipeline creation without code.☆13Apr 18, 2021Updated 4 years ago
- ☆13Dec 7, 2022Updated 3 years ago
- ☆58Jan 24, 2024Updated 2 years ago
- ☆21Apr 2, 2025Updated 10 months ago
- The broad index of NLP resources for Eastern European languages. The best EEML 2021 project.☆19Jun 24, 2022Updated 3 years ago
- This repository provides data and code for "Vox Populi, Vox DIY: Benchmark Dataset for Crowdsourced Audio Transcription" paper.☆16Jul 22, 2021Updated 4 years ago
- library for filling in missing values using artificial intelligence methods☆18Jan 8, 2023Updated 3 years ago
- ☆17Apr 14, 2023Updated 2 years ago
- Git Hooks Tutorial.☆17Jul 6, 2022Updated 3 years ago
- ☆22Aug 26, 2024Updated last year
- Development of a prototype engine for searching for goods on the tender procurement portal☆27Oct 25, 2022Updated 3 years ago
- T5-based (russian) text normalization☆25Jan 25, 2024Updated 2 years ago
- Russian text normalization pipeline for speech-to-text and other applications based on tagging s2s networks☆123Mar 15, 2021Updated 4 years ago
- DEPRECATED - A webapp for collecting speech samples for voice recognition testing and training☆20May 23, 2019Updated 6 years ago
- Pipeline for easy fine-tuning of BERT architecture for sequence classification☆23Jul 21, 2023Updated 2 years ago
- Comparing quality and performance of NLP systems for Russian language☆51Jul 24, 2023Updated 2 years ago
- Fine-tuned Multilingual BERT and Multilingual USE for sentiment analysis in Russian. RuReviews, RuSentiment, Kaggle Russian News Dataset,…☆51Feb 16, 2021Updated 5 years ago
- Небольшие авторские книги / учебные пособия / инструкции☆25Feb 14, 2025Updated last year
- ☆26Feb 20, 2026Updated last week
- Question answering on russian with XLMRobertaLarge as a service☆21Nov 6, 2021Updated 4 years ago
- First place solution for Yandex.Algorithm 2018 (ML Track)☆21May 16, 2018Updated 7 years ago
- ML Course created for Bauman Moscow State Technical University☆65Aug 31, 2022Updated 3 years ago
- Morphological analyzer for Russian and English languages based on neural networks and dictionary-lookup systems.☆157May 22, 2024Updated last year
- SAGE: Spelling correction, corruption and evaluation for multiple languages☆165Dec 8, 2025Updated 2 months ago
- Make GNN easy to start with☆134Jan 30, 2026Updated last month
- Pipeline for training NER models using PyTorch.☆56Jul 19, 2022Updated 3 years ago
- Проект для перевода чисел, записанных в текстовом виде на русском языке.☆105May 13, 2021Updated 4 years ago
- Unofficial implementation of QaNER: Prompting Question Answering Models for Few-shot Named Entity Recognition.☆65Oct 15, 2022Updated 3 years ago
- Workshop on Learning and Applying Large Language Models for Social Science Research☆81Dec 8, 2025Updated 2 months ago