schufo / plla-tisvs
Phoneme Level Lyrics Alignment and Text-Informed Singing Voice Separation
☆21Updated 2 years ago
Related projects: ⓘ
- A fundamental frequency estimation algorithm using features from the magnitude and phase spectrogram.☆21Updated 3 years ago
- Project for MIDI to Audio Synthesis☆19Updated last year
- Audio Generation model working with GPT-2 and VQVAE compressed representation of MelSpectrograms☆18Updated 11 months ago
- Code for the paper "MULTI-BAND MASKING FOR WAVEFORM-BASED SINGING VOICE SEPARATION" that was accepted on EUSIPCO2022☆15Updated 2 years ago
- The MIR-MLPop dataset and the official implementation of the paper "MIR-MLPop: A Multilingual Pop Music Dataset with Time-Aligned Lyrics …☆22Updated 4 months ago
- This is a subset of the DALI set consisting of 240 polyphonic recordings that is used to benchmark lyrics transcription evaluation.☆12Updated 2 years ago
- ☆18Updated 5 years ago
- DSing ASR task: Resources and Baseline for an unaccompanied singing ASR.☆20Updated 2 years ago
- ☆26Updated 3 years ago
- Inference codebase for "Cacophony: An Improved Contrastive Audio-Text Model". Preprint: https://arxiv.org/abs/2402.06986☆25Updated 4 months ago
- ☆29Updated this week
- ☆31Updated 2 years ago
- ☆15Updated last year
- PyTorch implementation of the ICASSP-24 paper: "Improving Audio Captioning Models with Fine-grained Audio Features, Text Embedding Superv…☆29Updated 8 months ago
- Labels for kiritan_singing data with extra resources for DNN-based singing voice synthesis (SVS) systems.☆29Updated 8 months ago
- Reproducible Subjective Evaluation☆57Updated 6 months ago
- Streaming source separation for music and speech files, using the Open-Unmix LSTM architecture.☆17Updated last year
- Implementation of CREPE Pitch tracker with PyTorch☆19Updated 4 years ago
- ☆25Updated this week
- Please visit https://thuhcsi.github.io/SnakeGAN/☆36Updated last year
- Supplementary Materials of ISMIR 2022 paper "Analysis and detection of singing techniques in repertoires of J-POP solo singers" by Yuya Y…☆17Updated 4 months ago
- PodcastMix A dataset for separating music and speech in podcasts.☆43Updated last month
- 60k hours of phoneme-aligned audio from audio books☆18Updated last month
- ☆18Updated 2 years ago
- ☆41Updated last year
- ☆15Updated 3 years ago
- Deep Performer: Score-to-audio music performance synthesis☆41Updated last year
- An implementation of "Towards Improving Harmonic Sensitivity and Prediction Stability for Singing Melody Extraction", in ISMIR 2023☆19Updated 8 months ago
- A toolset for easy formant extraction and visualization from wav files and TTS models☆29Updated 2 years ago
- Deep Speech Distances PyTorch☆27Updated 2 years ago