eellak / gsoc2021-audio-annotation-tool
Creation of a multi user audio first annotation tool - GSoC 2021
☆29Updated 2 years ago
Alternatives and similar repositories for gsoc2021-audio-annotation-tool:
Users that are interested in gsoc2021-audio-annotation-tool are comparing it to the libraries listed below
- ☆22Updated 3 years ago
- ☆40Updated 3 years ago
- Source code for INTERSPEECH2020☆11Updated 4 years ago
- Code for the paper "MULTI-BAND MASKING FOR WAVEFORM-BASED SINGING VOICE SEPARATION" that was accepted on EUSIPCO2022☆15Updated 2 years ago
- wake-up word emotion recognition [APSIPA 2022]☆17Updated 2 years ago
- PodcastMix A dataset for separating music and speech in podcasts.☆43Updated 7 months ago
- ☆32Updated 3 years ago
- NU-Wave 2: A General Neural Audio Upsampling Model for Various Sampling Rates [WIP]☆24Updated 2 years ago
- Filtering and Noise Adding Tool☆29Updated 2 years ago
- with alignment learning and continuous wavelet transform☆20Updated 2 years ago
- Digital Speech Processing in PyTorch.☆14Updated 2 years ago
- ☆11Updated 2 years ago
- A new metric for evaluating end-to-end speech recognition and disfluency removal systems☆19Updated 4 years ago
- Prosodic Speech Segmentation with Transformers☆25Updated last year
- ERISHA is a mulitilingual multispeaker expressive speech synthesis framework. It can transfer the expressivity to the speaker's voice for…☆43Updated 4 years ago
- Unsupervised Voice Activity Detection by Modeling Source and System Information using Zero Frequency Filtering☆19Updated last year
- ☆17Updated 3 years ago
- PnG BERT: Augmented BERT on Phonemes and Graphemes for Neural TTS☆22Updated 3 years ago
- A handy dataset of noises for ASR☆20Updated 5 years ago
- Grapheme-to-phoneme (G2P) conversion is the process of generating pronunciation for words based on their written form. It has a highly es…☆19Updated 3 years ago
- Code for the paper: "Leveraging speaker attribute information using multi task learning for speaker verification and diarization" present…☆25Updated 2 years ago
- ☆12Updated 2 months ago
- Official implementation of A cappella: Audio-visual Singing VoiceSeparation, from BMVC21☆16Updated 2 years ago
- MFA acoustic model training based on Opencpop☆14Updated 2 years ago
- visual-text to speech☆14Updated 2 years ago
- This repository contains all the code necessary for running the multilingual distilwhisper from Ferraz et al. 2024 IEEE ICASSP paper.☆20Updated last year
- ☆32Updated 4 years ago
- A library of speech gadgets.☆13Updated 2 years ago
- ☆41Updated last year
- LIGHTVOC AN UPSAMPLING-FREE GAN VOCODER BASED ON CONFORMER AND INVERSE SHORT-TIME FOURIER TRANSFORM☆16Updated 10 months ago