dafyddg / RFA
Implementation of the Rhythm Formant Analysis methodology for identifying speech rhythms and rhythm variation in the low frequency spectrum and spectrogram.
☆14Updated last year
Related projects ⓘ
Alternatives and complementary repositories for RFA
- Grapheme-to-phoneme (G2P) conversion is the process of generating pronunciation for words based on their written form. It has a highly es…☆18Updated 3 years ago
- ☆15Updated 3 years ago
- DUSTED: Spoken-Term Discovery using Discrete Speech Units☆13Updated last month
- Unsupervised Voice Activity Detection by Modeling Source and System Information using Zero Frequency Filtering☆18Updated last year
- Simple tool for speech dataset augmentation for modeling various prosodies.☆14Updated 3 years ago
- ☆11Updated 2 years ago
- Conditional Variational Auto-Encoder with Jointly Training FastSpeech2 and HiFi-GAN for End to End Text to Speech☆22Updated 2 years ago
- Aty-TTS: Improving fairness for spoken language understanding in atypical speech with Text-to-Speech☆10Updated 11 months ago
- ☆10Updated last year
- NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment☆16Updated 2 years ago
- Sing any popular song with your voice☆10Updated 2 years ago
- A fundamental frequency estimation algorithm using features from the magnitude and phase spectrogram.☆21Updated 3 years ago
- Code for the paper "MULTI-BAND MASKING FOR WAVEFORM-BASED SINGING VOICE SEPARATION" that was accepted on EUSIPCO2022☆15Updated 2 years ago
- A toolset for easy formant extraction and visualization from wav files and TTS models☆30Updated 2 years ago
- Official implementation of "Unsupervised Pre-training for Data-Efficient Text-to-Speech on Low Resource Languages", ICASSP 2023☆27Updated last year
- Synthesized singing voice demos of WeSinger 2 paper.☆27Updated last year
- Crowdsourced and Automatic Speech Prominence Estimation☆13Updated 6 months ago
- with alignment learning and continuous wavelet transform☆19Updated 2 years ago
- ☆13Updated 2 years ago
- An evaluation set for large-scale trained TTS models (Coming in Sep 2024)☆12Updated 2 months ago
- LIGHTVOC AN UPSAMPLING-FREE GAN VOCODER BASED ON CONFORMER AND INVERSE SHORT-TIME FOURIER TRANSFORM☆16Updated 5 months ago
- Code for paper titled "Perception of prosodic variation for speech synthesis using an unsupervised discrete representation of F0" submitt…☆16Updated 4 years ago
- Vocoder-Free Non-Parallel Conversion of Whispered Speech With Masked Cycle-Consistent Generative Adversarial Networks☆17Updated last year
- ☆11Updated last year
- Official Code for SyllableLM: Learning Coarse Semantic Units for Speech Language Models☆35Updated 3 weeks ago
- Source code and speech samples for the DSU-AVO paper accepted to INTERSPEECH 2023☆11Updated 5 months ago
- Audio Generation model working with GPT-2 and VQVAE compressed representation of MelSpectrograms☆18Updated last year
- ☆18Updated 2 months ago
- ☆31Updated last year