georgid / AlignmentDurationLinks
Lyrics-to-audio-alignement system. Based on Machine Learning Algorithms: Hidden Markov Models with Viterbi forced alignment. The alignment is explicitly aware of durations of musical notes. The phonetic model are classified with MLP Deep Neural Network.
☆58Updated 5 years ago
Alternatives and similar repositories for AlignmentDuration
Users that are interested in AlignmentDuration are comparing it to the libraries listed below
Sorting:
- A complete training recipe for kaldi-based Automatic Lyrics Transcription.☆31Updated 3 years ago
- Aligns text (lyrics) with monophonic singing voice (audio). The algorithm uses structural segmentation to segment the audio into structur…☆93Updated 7 years ago
- DEPRECATED: Jamendo music dataset with time-aligned lyrics for lyrics alignment evaluation☆88Updated 6 months ago
- "Joint Detection and Classification of Singing Voice Melody Using Convolutional Recurrent Neural Networks"☆130Updated 5 years ago
- Python package implementing the TD-PSOLA algorithm for speech processing☆43Updated 8 years ago
- The official implementation of "TONet: Tone-Octave Network for Singing Melody Extraction from Polyphonic Music"☆41Updated 3 years ago
- Sound examples for the Neural Parametric Singing Synthesizer (NPSS)☆22Updated 3 years ago
- A python wrapper for REAPER☆81Updated 9 months ago
- Contains code for our work on speech to singing conversion (ICASSP 2020)☆50Updated 5 years ago
- ☆15Updated 3 years ago
- pytorch implementation of JDCNet, singing voice detection and classification network☆53Updated 2 years ago
- Pitch-shifting, time-stretching, and vocoding of speech with Controllable LPCNet (CLPCNet)☆161Updated 3 years ago
- python pYIN☆91Updated 9 years ago
- Revisiting Singing Voice Detection : a Quantitative Review and the Future Outlook☆67Updated 2 years ago
- The source code of "A Streamlined Encoder/Decoder Architecture for Melody Extraction"☆73Updated 5 years ago
- Crowdsourced Audio Quality Evaluation Toolkit☆55Updated 2 years ago
- Pitch-shifting and time-stretching with TD-PSOLA☆86Updated 2 years ago
- Pitch estimation network (PiENet) for noise-robust neural F0 estimation of speech signals☆50Updated 6 years ago
- A PyTorch implementation of the paper: "LaSAFT: Latent Source Attentive Frequency Transformation for Conditioned Source Separation" (ICAS…☆86Updated 3 years ago
- Fast Python implementation of the Yin algorithm: a fundamental frequency estimator☆103Updated 3 years ago
- The official implementation of the Interspeech 2021 paper WSRGlow: A Glow-based Waveform Generative Model for Audio Super-Resolution.☆126Updated 4 years ago
- Util code, issues, discussions☆29Updated 7 years ago
- Yin pitch estimator in PyTorch☆117Updated 3 years ago
- VOCANO: A note transcription framework for singing voice in polyphonic music☆71Updated 4 years ago
- DNN based singing voice synthesis☆17Updated 7 years ago
- Fully-Convolutional Network for Pitch Estimation of Speech Signals☆59Updated 2 years ago
- A system works on singing voice synthesis☆79Updated 2 years ago
- Pitch shifter using WSOLA and resampling implemented by Python3☆39Updated 8 years ago
- Prososdy Morph: A python library for manipulating pitch and duration in an algorithmic way, for resynthesizing speech.☆84Updated this week
- Upsampling Artifacts in Neural Audio Synthesis – https://arxiv.org/abs/2010.14356☆81Updated 4 years ago