Train punctuation and capitalization models for different languages
☆26Apr 2, 2022Updated 4 years ago
Alternatives and similar repositories for multipunct
Users that are interested in multipunct are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- radiomixer☆14Feb 16, 2022Updated 4 years ago
- Репозиторий измеряет качество Yandexgpt, Gigachat, T-Pro, Saiga, Vikhr, Ruadapt на популярных англоязычных бенчмарках: MGSM, MATH, HumanE…☆23Apr 16, 2025Updated last year
- ☆58Jan 24, 2024Updated 2 years ago
- 🇷🇺 Punctuation restoration production-ready model for Russian language 🇷🇺☆59Jul 9, 2021Updated 4 years ago
- ☆13Aug 7, 2021Updated 4 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Проект для перевода чисел, записанных в текстовом виде на русском языке.☆11Apr 5, 2022Updated 4 years ago
- Git Hooks Tutorial.☆17Jul 6, 2022Updated 3 years ago
- Links to Russian corpora + Python functions for loading and parsing☆311Apr 21, 2026Updated last week
- AWD-LSTM language model trained on newspaper corpora with fast.ai☆27Apr 9, 2020Updated 6 years ago
- First place solution for Yandex.Algorithm 2018 (ML Track)☆21May 16, 2018Updated 7 years ago
- This repository provides data and code for "Vox Populi, Vox DIY: Benchmark Dataset for Crowdsourced Audio Transcription" paper.☆16Jul 22, 2021Updated 4 years ago
- Reinforcement Learning Library.☆29Aug 16, 2022Updated 3 years ago
- ☆13Dec 7, 2022Updated 3 years ago
- материалы курса по питону для студентов дпо-программы "компьютерная лингвистика" в НИУ ВШЭ (2020-2021)☆11Feb 21, 2022Updated 4 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- The broad index of NLP resources for Eastern European languages. The best EEML 2021 project.☆19Jun 24, 2022Updated 3 years ago
- CraftML is a restful web service for easy pipeline creation without code.☆13Apr 18, 2021Updated 5 years ago
- ☆17Apr 14, 2023Updated 3 years ago
- Comparing quality and performance of NLP systems for Russian language☆50Jul 24, 2023Updated 2 years ago
- Utilities to work with Church Slavonic language☆16May 8, 2018Updated 7 years ago
- Pipeline for easy fine-tuning of BERT architecture for sequence classification☆23Jul 21, 2023Updated 2 years ago
- ☆23Aug 26, 2024Updated last year
- ☆22Jun 10, 2025Updated 10 months ago
- SAGE: Spelling correction, corruption and evaluation for multiple languages☆166Dec 8, 2025Updated 4 months ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Pipeline for training NER models using PyTorch.☆55Jul 19, 2022Updated 3 years ago
- library for filling in missing values using artificial intelligence methods☆18Jan 8, 2023Updated 3 years ago
- Code for "ParaGuide: Guided Diffusion Paraphrasers for Plug-and-Play Textual Style Transfer"☆16Jul 17, 2024Updated last year
- ☆15Sep 15, 2022Updated 3 years ago
- Top ML papers of the week.☆47Updated this week
- Unofficial implementation of QaNER: Prompting Question Answering Models for Few-shot Named Entity Recognition.☆64Oct 15, 2022Updated 3 years ago
- EMNLP 2024 | Style-Specific Neurons for Steering LLMs in Text Style Transfer☆13Mar 23, 2025Updated last year
- Make GNN easy to start with☆134Mar 10, 2026Updated last month
- Question answering on russian with XLMRobertaLarge as a service☆21Nov 6, 2021Updated 4 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Code and data for the NAACL 2021 paper: "XFORMAL: A Benchmark for Multilingual Formality Style Transfer"☆12Jun 7, 2021Updated 4 years ago
- Data from "Crowdsourcing of Parallel Corpora: the Case of Style Transfer for Detoxification" paper☆14Apr 3, 2025Updated last year
- ML Course created for Bauman Moscow State Technical University☆65Aug 31, 2022Updated 3 years ago
- Code for the MTEB leaderboard☆30Feb 4, 2025Updated last year
- this repository is created to accumulate all LaTeX templates needed at Skoltech☆20Nov 27, 2018Updated 7 years ago
- REST API for sentence tokenization and embedding using Multilingual Universal Sentence Encoder.☆51Sep 5, 2021Updated 4 years ago
- Grammar rules and dictionaries for the phonetic transcription of Russian sentences☆33Sep 23, 2021Updated 4 years ago