georgid / AlignmentEvaluation
Scripts for computing common lyrics-to-audio alignment evaluation metrics. Usable evaluation for any token-based alignment (e.g. if token is word, phrase, note, section etc.) User for the evaluation of the MIREX Lyrics-to-audio challenge
☆18Updated 3 years ago
Related projects: ⓘ
- DSing ASR task: Resources and Baseline for an unaccompanied singing ASR.☆20Updated 2 years ago
- ☆15Updated last year
- GlottDNN vocoder and tools for training DNN excitation models☆32Updated 3 years ago
- Multiple Fundamental Frequency Estimation☆26Updated 10 years ago
- ☆18Updated 5 years ago
- Phoneme Level Lyrics Alignment and Text-Informed Singing Voice Separation☆21Updated 2 years ago
- Python package implementing the TD-PSOLA algorithm for speech processing☆42Updated 7 years ago
- ☆34Updated 5 years ago
- RawNet: Fast End-to-End Neural Vocoder☆42Updated 5 years ago
- A python implementation of a simple Unit Selection Text-to-Speech (TTS) synthesis system. It works with CMU-Arctic data by default☆10Updated 9 years ago
- DNN based singing voice synthesis☆17Updated 5 years ago
- Addressing Text-dependent Speaker Verification Using Singing Speech☆9Updated 5 years ago
- ☆26Updated 3 years ago
- Pitch estimation network (PiENet) for noise-robust neural F0 estimation of speech signals☆50Updated 5 years ago
- A fundamental frequency estimation algorithm using features from the magnitude and phase spectrogram.☆21Updated 3 years ago
- ☆18Updated 2 years ago
- Yin pitch estimator in PyTorch☆113Updated last year
- ☆22Updated last year
- Script for converting kaldi GMM/HMM models to HTK format☆11Updated 2 months ago
- [InterSpeech 2020] "Improving the Speaker Identity of Non-Parallel Many-to-Many VoiceConversion with Adversarial Speaker Recognition" by …☆39Updated last year
- EMPHASIS: An Emotional Phoneme-based Acoustic Model for Speech Synthesis System☆15Updated 5 years ago
- Hybrid speech synthesiser☆28Updated 5 years ago
- ☆11Updated last year
- ☆19Updated 5 years ago
- Audio LPC (linear prediction code) using mel spectorgram, compatible for LPCNet☆61Updated 3 years ago
- Pulse Model vocoder☆41Updated 5 years ago
- Dynamic time warping (DTW) functions for specifically speech alignment.☆26Updated 4 months ago
- Multipurpose Multi Speaker Mixture Signal Generator☆43Updated 6 months ago
- using world vocoder to extract features and make data for training neural networks☆11Updated 6 years ago
- Streaming source separation for music and speech files, using the Open-Unmix LSTM architecture.☆17Updated last year