ssmlkl / MnTTS2Links
This is the experimental description of MnTTS2.
☆11Updated last year
Alternatives and similar repositories for MnTTS2
Users that are interested in MnTTS2 are comparing it to the libraries listed below
Sorting:
- MnTTS: An Open-Source Mongolian Text-to-Speech Synthesis Dataset and Accompanied Baseline. (Accepted by IALP'2022)☆20Updated 2 years ago
- ☆80Updated last year
- Phoneme segmentation using pre-trained speech models☆55Updated 2 years ago
- ☆24Updated last year
- This repo related to the paper "A Framework for Phoneme-Level Pronunciation Assessment Using CTC" for INTERSPEECH2024☆23Updated 8 months ago
- An unofficial PyTorch implementation of Mix-Phoneme-Bert☆40Updated 2 years ago
- ☆32Updated 2 years ago
- [ICASSP 2024] KNN-CTC: Enhancing ASR via Retrieval of CTC Pseudo Labels☆39Updated last year
- DUSTED: Spoken-Term Discovery using Discrete Speech Units☆17Updated 10 months ago
- ☆25Updated 3 years ago
- multilingual speech aligner☆75Updated last year
- FCTalker: Fine and Coarse Grained Context Modeling for Expressive Conversational Speech Synthesis (Accepted by ISCSLP'2024)☆25Updated last year
- Goodness of Pronunciation algorithm using PyKaldi☆17Updated 3 years ago
- Speech samples and code of BEdit-TTS☆33Updated last year
- Automatic speech annotator processing speech with voice activaty detection, overlapping speech detection, speaker diarization and automat…☆33Updated last year
- PnG BERT: Augmented BERT on Phonemes and Graphemes for Neural TTS☆24Updated 3 years ago
- Crowdsourced and Automatic Speech Prominence Estimation☆22Updated last year
- This is the official repository for the HUI-Audio-Corpus-German. The corresponding paper is in the process of publication. With the repo…☆32Updated 2 years ago
- 56 language, 1 model Multilingual ASR☆25Updated 4 years ago
- ☆22Updated 11 months ago
- ☆26Updated last year
- [IJCAI'23] Learning to Speak from Text for Low-Resource TTS☆63Updated 2 years ago
- End-to-end MOdeling of ASR (Automatic Speech Recognition)☆33Updated 2 years ago
- English conversation corpus for conversational TTS.☆21Updated 2 years ago
- SERAB: a multi-lingual benchmark for speech emotion recognition☆28Updated 2 years ago
- Official implementation for the paper Fine-grained style control in transformer-based text-to-speech synthesis.☆88Updated 3 years ago
- ☆19Updated 10 months ago
- ☆11Updated last year
- Phoneme alignment representation compatible with multiple forced aligners☆21Updated last year
- Official implementation of the paper "Laughter Synthesis using Pseudo Phonetic Tokens with a Large-scale In-the-wild Laughter Corpus" acc…☆76Updated 2 years ago