groadabike / Kaldi-Dsing-task
DSing ASR task: Resources and Baseline for an unaccompanied singing ASR.
☆20Updated 2 years ago
Related projects: ⓘ
- This is a subset of the DALI set consisting of 240 polyphonic recordings that is used to benchmark lyrics transcription evaluation.☆12Updated 2 years ago
- GlottDNN vocoder and tools for training DNN excitation models☆32Updated 3 years ago
- A fundamental frequency estimation algorithm using features from the magnitude and phase spectrogram.☆21Updated 3 years ago
- Phoneme Level Lyrics Alignment and Text-Informed Singing Voice Separation☆21Updated 2 years ago
- ☆22Updated last year
- Multipurpose Multi Speaker Mixture Signal Generator☆43Updated 6 months ago
- Code for paper "Using Phonetic Posteriorgram Based Frame Pairing for Segmental Accent Conversion"☆35Updated 4 years ago
- ☆15Updated last year
- ☆18Updated 5 years ago
- ☆18Updated 2 years ago
- [InterSpeech 2020] "Improving the Speaker Identity of Non-Parallel Many-to-Many VoiceConversion with Adversarial Speaker Recognition" by …☆39Updated last year
- Reproduction of paper: Disentangling Correlated Speaker and Noise for Speech Synthesis via Data Augmentation and Adversarial Factorizatio…☆17Updated 5 years ago
- Convert WSJ sphere format to waveform and do data simulation.☆16Updated 4 years ago
- Prosodic Speech Segmentation with Transformers☆22Updated 6 months ago
- Code for the paper "MULTI-BAND MASKING FOR WAVEFORM-BASED SINGING VOICE SEPARATION" that was accepted on EUSIPCO2022☆15Updated 2 years ago
- ☆15Updated 3 years ago
- Paderbox: A collection of utilities for audio / speech processing☆37Updated 3 months ago
- ICASSP 2021 accepted papers in term of voice conversion (VC)☆18Updated 3 years ago
- Please visit: https://thuhcsi.github.io/icassp2021-emotion-tts/☆32Updated last year
- A collection of papers related to speech model compression☆24Updated last year
- PodcastMix A dataset for separating music and speech in podcasts.☆43Updated last month
- Addressing the confounds of accompaniments in singer identification☆18Updated 4 years ago
- Pytorch implementation of "f0-consistent many-to-many non-parallel voice conversion via conditional autoencoder"☆28Updated 3 years ago
- PyTorch implementation of Continuous Speech Separation☆13Updated last year
- Streaming source separation for music and speech files, using the Open-Unmix LSTM architecture.☆17Updated last year
- Deep Speech Distances PyTorch☆27Updated 2 years ago
- Unsupervised Voice Activity Detection by Modeling Source and System Information using Zero Frequency Filtering☆18Updated 11 months ago
- ☆26Updated 3 years ago
- This is the code of the ICASSP 2020 paper "Joint phoneme alignment and text-informed speech separation on highly corrupted speech"☆15Updated 5 months ago
- ☆13Updated 2 years ago