Train punctuation and capitalization models for different languages
☆26Apr 2, 2022Updated 3 years ago
Alternatives and similar repositories for multipunct
Users that are interested in multipunct are comparing it to the libraries listed below
Sorting:
- Convert MUSE from TensorFlow to PyTorch and ONNX☆11May 22, 2024Updated last year
- radiomixer☆14Feb 16, 2022Updated 4 years ago
- Репозиторий измеряет качество Yandexgpt, Gigachat, T-Pro, Saiga, Vikhr, Ruadapt на популярных англоязычных бенчмарках: MGSM, MATH, HumanE…☆23Apr 16, 2025Updated 11 months ago
- RuLeanALBERT is a pretrained masked language model for the Russian language that uses a memory-efficient architecture.☆93May 27, 2023Updated 2 years ago
- ☆58Jan 24, 2024Updated 2 years ago
- 🇷🇺 Punctuation restoration production-ready model for Russian language 🇷🇺☆59Jul 9, 2021Updated 4 years ago
- ☆13Aug 7, 2021Updated 4 years ago
- A database-like benchmark of feature generation from time-series data☆13Nov 27, 2024Updated last year
- ☆21Apr 2, 2025Updated 11 months ago
- Russian text normalization pipeline for speech-to-text and other applications based on tagging s2s networks☆124Mar 15, 2021Updated 5 years ago
- Git Hooks Tutorial.☆17Jul 6, 2022Updated 3 years ago
- AWD-LSTM language model trained on newspaper corpora with fast.ai☆27Apr 9, 2020Updated 5 years ago
- First place solution for Yandex.Algorithm 2018 (ML Track)☆21May 16, 2018Updated 7 years ago
- This repository provides data and code for "Vox Populi, Vox DIY: Benchmark Dataset for Crowdsourced Audio Transcription" paper.☆16Jul 22, 2021Updated 4 years ago
- Reinforcement Learning Library.☆29Aug 16, 2022Updated 3 years ago
- ☆13Dec 7, 2022Updated 3 years ago
- материалы курса по питону для студентов дпо-программы "компьютерная лингвистика" в НИУ ВШЭ (2020-2021)☆11Feb 21, 2022Updated 4 years ago
- The broad index of NLP resources for Eastern European languages. The best EEML 2021 project.☆19Jun 24, 2022Updated 3 years ago
- CraftML is a restful web service for easy pipeline creation without code.☆13Apr 18, 2021Updated 4 years ago
- ☆17Apr 14, 2023Updated 2 years ago
- Comparing quality and performance of NLP systems for Russian language☆51Jul 24, 2023Updated 2 years ago
- DEPRECATED - A webapp for collecting speech samples for voice recognition testing and training☆20May 23, 2019Updated 6 years ago
- ☆23Aug 26, 2024Updated last year
- library for filling in missing values using artificial intelligence methods☆18Jan 8, 2023Updated 3 years ago
- Code for "ParaGuide: Guided Diffusion Paraphrasers for Plug-and-Play Textual Style Transfer"☆15Jul 17, 2024Updated last year
- Morphological analyzer for Russian and English languages based on neural networks and dictionary-lookup systems.☆157May 22, 2024Updated last year
- ☆15Sep 15, 2022Updated 3 years ago
- Top ML papers of the week.☆46Updated this week
- EMNLP 2024 | Style-Specific Neurons for Steering LLMs in Text Style Transfer☆13Mar 23, 2025Updated 11 months ago
- Make GNN easy to start with☆134Mar 10, 2026Updated last week
- Repository containing our datasets for HTR (handwritten text recognition) task.☆27Sep 8, 2022Updated 3 years ago
- Question answering on russian with XLMRobertaLarge as a service☆21Nov 6, 2021Updated 4 years ago
- Code and data for the NAACL 2021 paper: "XFORMAL: A Benchmark for Multilingual Formality Style Transfer"☆12Jun 7, 2021Updated 4 years ago
- Data from "Crowdsourcing of Parallel Corpora: the Case of Style Transfer for Detoxification" paper☆14Apr 3, 2025Updated 11 months ago
- ML Course created for Bauman Moscow State Technical University☆65Aug 31, 2022Updated 3 years ago
- this repository is created to accumulate all LaTeX templates needed at Skoltech☆20Nov 27, 2018Updated 7 years ago
- REST API for sentence tokenization and embedding using Multilingual Universal Sentence Encoder.☆51Sep 5, 2021Updated 4 years ago
- Compact high quality word embeddings for Russian language☆217Jul 24, 2023Updated 2 years ago
- Демонстрация структуры ml проекта☆11Oct 12, 2022Updated 3 years ago