Lesha17 / Punctuation
Training BERT for punctuation task
☆10Updated 4 years ago
Related projects ⓘ
Alternatives and complementary repositories for Punctuation
- ☆13Updated 3 years ago
- ☆13Updated last year
- Train punctuation and capitalization models for different languages☆24Updated 2 years ago
- T5-based (russian) text normalization☆19Updated 10 months ago
- This app is intended to automatically create a corpus for ASR systems using pseudo-labeling.☆27Updated 9 months ago
- Smart Language Model☆47Updated last year
- (re)Implementation of Learning Multi-level Dependencies for Robust Word Recognition☆17Updated 4 months ago
- UDAR Does Accented Russian: A finite-state morphological analyzer of Russian that handles stressed wordforms.☆26Updated 2 months ago
- Modified version of RusStress (https://github.com/MashaPo/russtress) — python package for placing stress in Russian text using RNN (BiLST…☆30Updated 3 months ago
- MOdel ResOurCe COnsumption. Evaluate Russian SuperGLUE models performance: inference speed, RAM usage. Reproducible scores using Docker☆21Updated 2 years ago
- ☆11Updated 3 years ago
- Repository with illustrations for cft-contest-2018☆12Updated 6 years ago
- Word Embeddings for Low Resource Languages: The Case of Buryat☆10Updated last year
- ☆21Updated 3 years ago
- ☆17Updated 2 months ago
- This repository provides data and code for "Vox Populi, Vox DIY: Benchmark Dataset for Crowdsourced Audio Transcription" paper.☆15Updated 3 years ago
- Code and data of "Methods for Detoxification of Texts for the Russian Language" paper☆46Updated 2 months ago
- Normalize Text in Russian☆24Updated last year
- Русско-Английский вокодер на GAN☆17Updated 3 years ago
- ☆11Updated last year
- RuTransform: python framework for adversarial attacks and text data augmentation for Russian☆17Updated last year
- Probing suite for evaluation of Russian embedding and language models☆32Updated last month
- Parallelized automatic corpus collection for ASR. Forked from https://github.com/EgorLakomkin/KTSpeechCrawler☆23Updated 3 years ago
- Pipeline for easy fine-tuning of BERT architecture for sequence classification☆22Updated last year
- Punctuation and casing restoration for the Russian Language (BERT-based)☆20Updated 2 years ago
- Prabhupadavani: A Code-mixed Speech Translation Data for 25 languages☆13Updated 2 years ago
- Python клиент API распознавания и синтеза речи Облака ЦРТ☆11Updated last year
- Code for AINL2018 paper Deep Convolutional Networks for Supervised Morpheme Segmentation of Russian Language☆19Updated 5 years ago
- Russian phonetical transcription☆9Updated 11 months ago
- Нейронная сеть для восстановления пунктуации на русском языке.☆20Updated 2 years ago