suralmasha / RuTranscript
Russian phonetical transcription
☆9Updated last year
Alternatives and similar repositories for RuTranscript:
Users that are interested in RuTranscript are comparing it to the libraries listed below
- a repository for trainabale tts multi speaker☆14Updated 3 years ago
- This app is intended to automatically create a corpus for ASR systems using pseudo-labeling.☆27Updated last year
- Neural model for prediction of stress position in Russian words☆11Updated last year
- ☆12Updated 3 weeks ago
- SpeechGLUE is a speech version of the GLUE benchmark, driven by text-to-speech.☆13Updated last year
- C++ version of pyannote audio overlapped speech detection pipeline☆11Updated last year
- MnTTS: An Open-Source Mongolian Text-to-Speech Synthesis Dataset and Accompanied Baseline. (Accepted by IALP'2022)☆17Updated 2 years ago
- NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment☆16Updated 2 years ago
- This repository contains all the code necessary for running the multilingual distilwhisper from Ferraz et al. 2024 IEEE ICASSP paper.☆20Updated 11 months ago
- Forced alignment decoder for Whisper.☆14Updated 11 months ago
- ☆9Updated 4 months ago
- ☆12Updated 2 years ago
- Evaluation of STT models for german language☆15Updated 3 years ago
- A handy dataset of noises for ASR☆19Updated 5 years ago
- ☆11Updated 3 years ago
- ☆11Updated last year
- ☆13Updated 2 years ago
- This repository contains the Kaldi LF-MMI implementation of the paper "Bayesian Learning of LF-MMI Trained Time Delay Neural Networks for…☆9Updated 2 years ago
- Normalize Text in Russian☆26Updated last year
- Vocoder-Free Non-Parallel Conversion of Whispered Speech With Masked Cycle-Consistent Generative Adversarial Networks☆17Updated last year
- Python package of MP-SENet from Explicit Estimation of Magnitude and Phase Spectra in Parallel for High-Quality Speech Enhancement.☆12Updated 3 months ago
- Speaker-aware CTC (SACTC) for multi-talker overlapped speech recognition.☆12Updated last month
- ☆13Updated 3 years ago
- Implementation of different noise embeddings for noise aware training of Kaldi acoustic models.☆13Updated 4 years ago
- A semi-supervised sequence-to-sequence ASR☆10Updated 2 years ago
- My hybrid TTS network that combines, VALL-E, VoiceBox, SpeechFlow, Seamless and TortoiseTTS into one☆27Updated 6 months ago
- Official repo for "Audio-Visual Speech Recognition In-the-Wild: Multi-Angle Vehicle Cabin Corpus and Attention-based Method" in ICASSP 20…☆9Updated 10 months ago