dafyddg / RFA
Implementation of the Rhythm Formant Analysis methodology for identifying speech rhythms and rhythm variation in the low frequency spectrum and spectrogram.
☆14Updated last year
Related projects ⓘ
Alternatives and complementary repositories for RFA
- Unsupervised Voice Activity Detection by Modeling Source and System Information using Zero Frequency Filtering☆18Updated last year
- ☆11Updated 2 years ago
- ☆15Updated 3 years ago
- Simple tool for speech dataset augmentation for modeling various prosodies.☆14Updated 3 years ago
- Code for paper titled "Perception of prosodic variation for speech synthesis using an unsupervised discrete representation of F0" submitt…☆16Updated 4 years ago
- Aty-TTS: Improving fairness for spoken language understanding in atypical speech with Text-to-Speech☆10Updated 11 months ago
- Grapheme-to-phoneme (G2P) conversion is the process of generating pronunciation for words based on their written form. It has a highly es…☆18Updated 3 years ago
- Conditional Variational Auto-Encoder with Jointly Training FastSpeech2 and HiFi-GAN for End to End Text to Speech☆22Updated 2 years ago
- DUSTED: Spoken-Term Discovery using Discrete Speech Units☆13Updated last month
- A toolset for easy formant extraction and visualization from wav files and TTS models☆30Updated 2 years ago
- Code for the paper "MULTI-BAND MASKING FOR WAVEFORM-BASED SINGING VOICE SEPARATION" that was accepted on EUSIPCO2022☆15Updated 2 years ago
- Vocoder-Free Non-Parallel Conversion of Whispered Speech With Masked Cycle-Consistent Generative Adversarial Networks☆17Updated last year
- ☆31Updated last year
- A fundamental frequency estimation algorithm using features from the magnitude and phase spectrogram.☆21Updated 3 years ago
- ☆13Updated 2 months ago
- Unofficial pytorch implementation of VISinger: Variational Inference with Adversarial Learning for End-to-end Singing Voice Synthesis (IC…☆15Updated last year
- 60k hours of phoneme-aligned audio from audio books☆18Updated 3 months ago
- A simple command line tool to calculate WER for ASR.☆13Updated last month
- LIGHTVOC AN UPSAMPLING-FREE GAN VOCODER BASED ON CONFORMER AND INVERSE SHORT-TIME FOURIER TRANSFORM☆16Updated 6 months ago
- ☆19Updated 2 months ago
- ICASSP 2021 accepted papers in term of voice conversion (VC)☆18Updated 3 years ago
- ☆15Updated 4 months ago
- Official implementation of "Unsupervised Pre-training for Data-Efficient Text-to-Speech on Low Resource Languages", ICASSP 2023☆27Updated last year
- Reimplementation of Miipher☆20Updated last year
- Repository for reproducing result in journal "Self-supervised learning for Speech Emotion Recognition"☆9Updated last year
- Please visit https://thuhcsi.github.io/SnakeGAN/☆36Updated last year
- An evaluation set for large-scale trained TTS models (Coming in Sep 2024)☆12Updated 2 months ago