bookbot-kids / g2p_id
g2p ID: Indonesian Grapheme-to-Phoneme Converter
☆15Updated last month
Alternatives and similar repositories for g2p_id:
Users that are interested in g2p_id are comparing it to the libraries listed below
- This is the official repository for the HUI-Audio-Corpus-German. The corresponding paper is in the process of publication. With the repo…☆27Updated last year
- [IJCAI'23] Learning to Speak from Text for Low-Resource TTS☆64Updated last year
- PnG BERT: Augmented BERT on Phonemes and Graphemes for Neural TTS☆21Updated 2 years ago
- VietTTS: An Open-Source Vietnamese Text to Speech☆23Updated last month
- Convert English text from written expressions into spoken forms☆22Updated 2 years ago
- NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment☆16Updated 2 years ago
- Implementation of the paper "BERTphone: Phonetically-aware Encoder Representations for Utterance-level Speaker and Language Recognition"☆17Updated 4 years ago
- phone inventory library☆16Updated last year
- English conversation corpus for conversational TTS.☆21Updated last year
- Just another FastSpeech 2 but cleaner code :)☆25Updated 6 months ago
- This is the M-AILABS Speech Dataset☆37Updated last month
- ☆25Updated 5 months ago
- An unofficial PyTorch implementation of Mix-Phoneme-Bert☆39Updated last year
- ☆56Updated 2 years ago
- Finetuning VITS Efficiently☆32Updated last year
- Official implementation of the paper "Laughter Synthesis using Pseudo Phonetic Tokens with a Large-scale In-the-wild Laughter Corpus" acc…☆73Updated last year
- Official implementation of the APSIPA 2022 paper: Exploring Speaker Age Estimation on Different Self-Supervised Learning Models☆14Updated 2 years ago
- ☆19Updated 4 months ago
- Python wrappers for Kaldi Levenshtein's distance and alignment code.☆62Updated 10 months ago
- Dataset of ICASSP 2021 MULTILINGUAL PHONETIC DATASET FOR LOW RESOURCE SPEECH RECOGNITION☆39Updated last year
- A toolkit to calculate speech audio quality. Not affiliated with the original authors☆48Updated 5 months ago
- CML-TTS: A Multilingual Dataset for Speech Synthesis☆29Updated 5 months ago
- ☆27Updated 2 years ago
- Conditional Variational Auto-Encoder with Jointly Training FastSpeech2 and HiFi-GAN for End to End Text to Speech☆22Updated 2 years ago
- This repository contains all the code necessary for running the multilingual distilwhisper from Ferraz et al. 2024 IEEE ICASSP paper.☆20Updated 10 months ago
- Speech-To-Text forced-alignment Speech processing Universal PERformance Benchmark☆25Updated 6 months ago
- Qualtric or Qualtreat? Generate Qualtrics listening tests for Text-To-Speech evaluations.☆34Updated 6 months ago
- ☆28Updated last year
- ☆48Updated 2 months ago
- GE2E Speaker Encoder - Generalized End-To-End Loss for Speaker Verification☆13Updated 4 years ago