eellak / gsoc2021-audio-annotation-tool
Creation of a multi user audio first annotation tool - GSoC 2021
☆29Updated 2 years ago
Alternatives and similar repositories for gsoc2021-audio-annotation-tool:
Users that are interested in gsoc2021-audio-annotation-tool are comparing it to the libraries listed below
- ☆22Updated 3 years ago
- NU-Wave 2: A General Neural Audio Upsampling Model for Various Sampling Rates [WIP]☆24Updated 2 years ago
- Qualtric or Qualtreat? Generate Qualtrics listening tests for Text-To-Speech evaluations.☆35Updated 10 months ago
- ☆41Updated last year
- Crowdsourced and Automatic Speech Prominence Estimation☆20Updated last year
- DSing ASR task: Resources and Baseline for an unaccompanied singing ASR.☆19Updated 3 years ago
- ☆33Updated 3 years ago
- Sequence alignement methods with helpers for PyTorch.☆24Updated 2 years ago
- with alignment learning and continuous wavelet transform☆20Updated 2 years ago
- PodcastMix A dataset for separating music and speech in podcasts.☆43Updated 8 months ago
- CML-TTS: A Multilingual Dataset for Speech Synthesis☆31Updated 8 months ago
- Code for the paper "MULTI-BAND MASKING FOR WAVEFORM-BASED SINGING VOICE SEPARATION" that was accepted on EUSIPCO2022☆15Updated 2 years ago
- **ICASSP 2022** 《Toward Degradation-Robust Voice Conversion》Using speech enhancement and end-to-end denoising training to improve degrada…☆23Updated 2 years ago
- ☆32Updated 3 years ago
- Source code for INTERSPEECH2020☆11Updated 4 years ago
- Ultrafast GAN based Vocoder for Text to Speech☆50Updated 2 years ago
- ERISHA is a mulitilingual multispeaker expressive speech synthesis framework. It can transfer the expressivity to the speaker's voice for…☆43Updated 4 years ago
- Official implementation of the paper "Laughter Synthesis using Pseudo Phonetic Tokens with a Large-scale In-the-wild Laughter Corpus" acc…☆75Updated last year
- A collection of papers related to speech model compression☆24Updated last year
- A toolset for easy formant extraction and visualization from wav files and TTS models☆30Updated 2 years ago
- This repository contains all the code necessary for running the multilingual distilwhisper from Ferraz et al. 2024 IEEE ICASSP paper.☆21Updated last year
- Unsupervised Voice Activity Detection by Modeling Source and System Information using Zero Frequency Filtering☆20Updated last year
- Just another FastSpeech 2 but cleaner code :)☆26Updated 9 months ago
- Singing Voice Speech modeling test☆35Updated 2 years ago
- wake-up word emotion recognition [APSIPA 2022]☆17Updated 2 years ago
- ☆17Updated 3 years ago
- Speechflow for emotion recognition related information decomposition☆10Updated 3 years ago
- Code for the paper: "Leveraging speaker attribute information using multi task learning for speaker verification and diarization" present…☆25Updated 2 years ago
- Pytorch implementation of "f0-consistent many-to-many non-parallel voice conversion via conditional autoencoder"☆29Updated 4 years ago
- Source Code for the Paper "UNIFIED KEYWORD SPOTTING AND AUDIO TAGGING ON MOBILE DEVICES WITH TRANSFORMERS"☆23Updated 2 years ago