Lesha17 / Punctuation
Training BERT for punctuation task
☆10Updated 4 years ago
Alternatives and similar repositories for Punctuation:
Users that are interested in Punctuation are comparing it to the libraries listed below
- ☆13Updated 2 years ago
- ☆13Updated 3 years ago
- This app is intended to automatically create a corpus for ASR systems using pseudo-labeling.☆27Updated last year
- Train punctuation and capitalization models for different languages☆24Updated 2 years ago
- T5-based (russian) text normalization☆20Updated last year
- (re)Implementation of Learning Multi-level Dependencies for Robust Word Recognition☆17Updated 8 months ago
- Smart Language Model☆46Updated 2 years ago
- ☆11Updated 3 years ago
- This repository provides data and code for "Vox Populi, Vox DIY: Benchmark Dataset for Crowdsourced Audio Transcription" paper.☆15Updated 3 years ago
- Punctuation and casing restoration for the Russian Language (BERT-based)☆20Updated 3 years ago
- ☆11Updated 2 years ago
- ☆23Updated 3 years ago
- ANYKS Spell-Checker☆19Updated 2 years ago
- ☆23Updated 3 years ago
- RuTransform: python framework for adversarial attacks and text data augmentation for Russian☆19Updated last year
- ☆18Updated 3 months ago
- Repository with illustrations for cft-contest-2018☆12Updated 6 years ago
- Normalize Text in Russian☆26Updated last year
- Modified version of RusStress (https://github.com/MashaPo/russtress) — python package for placing stress in Russian text using RNN (BiLST…☆33Updated 7 months ago
- Speech analytics package for call-center☆23Updated 4 years ago
- Русско-Английский вокодер на GAN☆17Updated 3 years ago
- Russian open TTS dataset☆12Updated 5 years ago
- Parallelized automatic corpus collection for ASR. Forked from https://github.com/EgorLakomkin/KTSpeechCrawler☆23Updated 4 years ago
- Python клиент API распознавания и синтеза речи Облака ЦРТ☆11Updated 2 years ago
- UDAR Does Accented Russian: A finite-state morphological analyzer of Russian that handles stressed wordforms.☆28Updated 6 months ago
- Code and data of "Methods for Detoxification of Texts for the Russian Language" paper☆47Updated last month
- Simple Python library, distributed via binary wheels with few direct dependencies, for easily using wav2vec 2.0 models for speech recogni…☆24Updated 3 years ago
- FusionBrain Challenge 2.0: creating multimodal multitask model☆16Updated 2 years ago
- Probing suite for evaluation of Russian embedding and language models☆33Updated 5 months ago
- Pipeline for easy fine-tuning of BERT architecture for sequence classification☆23Updated last year