MiniXC / phones
A collection of utilities for handling IPA phones.
☆25Updated last year
Related projects ⓘ
Alternatives and complementary repositories for phones
- A toolkit to calculate speech audio quality. Not affiliated with the original authors☆39Updated 3 months ago
- NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment☆16Updated 2 years ago
- Official Code for SyllableLM: Learning Coarse Semantic Units for Speech Language Models☆35Updated last month
- ☆56Updated last year
- Code for paper titled "Perception of prosodic variation for speech synthesis using an unsupervised discrete representation of F0" submitt…☆16Updated 4 years ago
- ☆26Updated 3 years ago
- ☆32Updated 2 months ago
- Temporary anonymous version☆22Updated 8 months ago
- Torch implementation of Whisper-guided DDPM based Voice Conversion☆49Updated last year
- ☆42Updated 2 years ago
- Just another FastSpeech 2 but cleaner code :)☆25Updated 4 months ago
- Prosodic Speech Segmentation with Transformers☆23Updated 8 months ago
- ☆22Updated 3 years ago
- The TTSDS benchmark evaluates synthetic speech quality by considering prosody, speaker identity, and intelligibility, comparing these fac…☆19Updated this week
- This repository contains data used in the NAACL 2021 Paper - Proteno: Text Normalization with Limited Data for Fast Deployment in Text to…☆42Updated 3 years ago
- Collection of scripts from mHuBERT-147.☆22Updated this week
- ☆17Updated last year
- Official repository for NAST: Noise Aware Speech Tokenization for Speech Language Models (Interspeech 2024) https://arxiv.org/abs/2406.11…☆42Updated 4 months ago
- ☆20Updated 6 years ago
- Code for the paper: How Much Context Does My Attention-Based ASR System Need?☆11Updated 2 weeks ago
- Speech Parameter Estimation Using Differentiable Speech Synthesizer☆44Updated last year
- phone inventory library☆15Updated last year
- Kaldi style neural network training in pytorch for use in place of nnet3 in Kaldi.☆26Updated 3 months ago
- Dataset of ICASSP 2021 MULTILINGUAL PHONETIC DATASET FOR LOW RESOURCE SPEECH RECOGNITION☆37Updated last year
- Python wrappers for Kaldi Levenshtein's distance and alignment code.☆61Updated 8 months ago
- Pronunciation-assisted Subword Modeling☆29Updated 5 years ago
- ☆32Updated 2 months ago
- ☆12Updated 3 years ago