deezer / MultilingualLyricsToAudioAlignment
DALI datasets split used to train models presented in the paper Multilingual lyrics-to-audio alignment (ISMIR 2020).
☆13Updated 3 years ago
Related projects: ⓘ
- Codebase for 'A Real-Time Lyrics Alignment System Using Chroma And Phonetic Features For Classical Vocal Performance', ICASSP 2024☆11Updated 7 months ago
- ☆55Updated last year
- Lyrics-to-audio-alignement system. Based on Machine Learning Algorithms: Hidden Markov Models with Viterbi forced alignment. The alignme…☆56Updated 4 years ago
- MR-MT3: Memory Retaining Multi-Track Music Transcription to Mitigate Instrument Leakage☆32Updated 2 months ago
- An unofficial implementation of https://arxiv.org/abs/2005.05106☆46Updated 3 years ago
- ☆13Updated this week
- The MIR-MLPop dataset and the official implementation of the paper "MIR-MLPop: A Multilingual Pop Music Dataset with Time-Aligned Lyrics …☆22Updated 4 months ago
- NU-Wave: A Diffusion Probabilistic Model for Neural Audio Upsampling☆37Updated 3 years ago
- Autovocoder: Fast Waveform Generation from a Learned Speech Representation using Differentiable Digital Signal Processing☆67Updated last year
- Timbre Transfer using Denoising Diffusion Implicit Models (ISMIR 2023)☆26Updated 6 months ago
- Jamendo music dataset with time-aligned lyrics for lyrics alignment evaluation☆72Updated 10 months ago
- The source code and pre-trained model of the paper "On the Preparation and Validation of a Large-scale Dataset"☆52Updated last year
- ☆25Updated 2 weeks ago
- Implementation of paper "End-to-end lyrics alignment for polyphonic music using an audio-to-character recognition model"☆15Updated last year
- This repo contains the source code of the first deep learning-base singing voice beat tracking system. It leverages WavLM and DistilHuBER…☆26Updated 2 years ago
- Deep Performer: Score-to-audio music performance synthesis☆41Updated last year
- Scripts for computing common lyrics-to-audio alignment evaluation metrics. Usable evaluation for any token-based alignment (e.g. if tok…☆18Updated 3 years ago
- Unofficial implementation of NANSY++ in Pytorch Lightning☆46Updated 6 months ago
- ☆87Updated last year
- Prosody and Pronunciation Modification Network☆36Updated last month
- ☆29Updated this week
- Official implementation of DualCycleGAN for nonparallel audio super resolution☆49Updated last year
- ☆11Updated 3 years ago
- Frechet Audio Distance evaluation in PyTorch☆34Updated last year
- ☆24Updated 2 years ago
- Torch implementation of NANSY, Neural Analysis and Synthesis, arXiv:2110.14513☆63Updated last year
- The DJ Mix Dataset☆11Updated 2 years ago
- The official implementation of EmoSphere-TTS☆58Updated last month
- ☆37Updated 3 months ago
- A Chinese version of A Neural Parametric Singing Synthesizer☆12Updated 2 years ago