eellak / gsoc2021-audio-annotation-toolLinks
Creation of a multi user audio first annotation tool - GSoC 2021
☆29Updated 2 years ago
Alternatives and similar repositories for gsoc2021-audio-annotation-tool
Users that are interested in gsoc2021-audio-annotation-tool are comparing it to the libraries listed below
Sorting:
- ☆22Updated 4 years ago
 - DSing ASR task: Resources and Baseline for an unaccompanied singing ASR.☆19Updated 3 years ago
 - ☆32Updated 3 years ago
 - wake-up word emotion recognition [APSIPA 2022]☆17Updated 2 years ago
 - A handy dataset of noises for ASR☆22Updated 6 years ago
 - Code for the paper "MULTI-BAND MASKING FOR WAVEFORM-BASED SINGING VOICE SEPARATION" that was accepted on EUSIPCO2022☆15Updated 3 years ago
 - NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment☆16Updated 3 years ago
 - A library of speech gadgets.☆14Updated 3 years ago
 - Ultrafast GAN based Vocoder for Text to Speech☆50Updated 3 years ago
 - ERISHA is a mulitilingual multispeaker expressive speech synthesis framework. It can transfer the expressivity to the speaker's voice for…☆43Updated 4 years ago
 - Filtering and Noise Adding Tool☆29Updated 3 years ago
 - ☆14Updated 3 years ago
 - This is an extension of kaldi speech recognition software which allows to perform decoding of speech with hybrid word and phoneme graphs.…☆11Updated 5 years ago
 - [ICLR 2022] "Audio Lottery: Speech Recognition Made Ultra-Lightweight, Noise-Robust, and Transferable", by Shaojin Ding, Tianlong Chen, Z…☆32Updated 3 years ago
 - PPSpeech: Phrase based Parallel End-to-End TTS System☆35Updated 5 years ago
 - ☆41Updated 2 years ago
 - System that ranks 2nd in DCASE 2022 Challenge Task 5: Few-shot Bioacoustic Event Detection☆28Updated 3 years ago
 - This repo contains the baseline model recipes and pre-trained model for GramVanni hindi ASR challenge☆15Updated 3 years ago
 - Pronunciation-assisted Subword Modeling☆31Updated 6 years ago
 - Phoneme Level Lyrics Alignment and Text-Informed Singing Voice Separation☆23Updated 3 years ago
 - Torch implementation of Whisper-guided DDPM based Voice Conversion☆49Updated 2 years ago
 - Semi-supervised Learning for Multi-speaker Text-to-speech Synthesis Using Discrete Speech Representation☆39Updated 5 years ago
 - A fundamental frequency estimation algorithm using features from the magnitude and phase spectrogram.☆24Updated 4 years ago
 - Autovocoder: Fast Waveform Generation from a Learned Speech Representation using Differentiable Digital Signal Processing☆70Updated 2 years ago
 - ☆15Updated 4 years ago
 - Open Source Speech/Text Data on AI☆18Updated 3 years ago
 - SpeechNAS-Better-Trade-off-between-Latency-and-Accuracy-for-Large-Scale-Speaker-Verification☆30Updated 2 years ago
 - A python implementation of a simple Unit Selection Text-to-Speech (TTS) synthesis system. It works with CMU-Arctic data by default☆11Updated 10 years ago
 - A toolset for easy formant extraction and visualization from wav files and TTS models☆31Updated 3 years ago
 - ☆56Updated 2 years ago