zhenghuatan / rVADfastView external linksLinks
This is the Python library for an unsupervised, fast method for robust voice activity detection (rVAD), as in the paper rVAD: An Unsupervised Segment-Based Robust Voice Activity Detection Method.
☆151Jun 5, 2025Updated 8 months ago
Alternatives and similar repositories for rVADfast
Users that are interested in rVADfast are comparing it to the libraries listed below
Sorting:
- Matlab and Python libraries for an unsupervised method for robust voice activity detection (rVAD), as in the paper rVAD: An Unsupervised …☆138Jan 20, 2024Updated 2 years ago
- Pytorch implementation of SELF-ATTENTIVE VAD, ICASSP 2021☆160Oct 26, 2021Updated 4 years ago
- NU-Wave 2: A General Neural Audio Upsampling Model for Various Sampling Rates [WIP]☆25Jul 5, 2022Updated 3 years ago
- Yin pitch estimator in PyTorch☆117Nov 7, 2022Updated 3 years ago
- ☆46Jun 6, 2021Updated 4 years ago
- Torch-based tool for quantizing high-dimensional vectors using additive codebooks☆54May 25, 2022Updated 3 years ago
- A torch implementation of a recursion which turns out to be useful for RNN-T.☆149Aug 25, 2023Updated 2 years ago
- ☆16Dec 23, 2021Updated 4 years ago
- Utilities for resampling and filtering audio data☆47Jan 9, 2020Updated 6 years ago
- A personal toolkit for single/multi-channel speech recognition & enhancement & separation.☆145Jul 6, 2023Updated 2 years ago
- Implementation of different noise embeddings for noise aware training of Kaldi acoustic models.☆13Feb 13, 2021Updated 5 years ago
- ☆17Apr 14, 2023Updated 2 years ago
- This is the code of the ICASSP 2020 paper "Joint phoneme alignment and text-informed speech separation on highly corrupted speech"☆15Apr 8, 2024Updated last year
- A library for speech data augmentation in time-domain☆682Aug 30, 2021Updated 4 years ago
- A robust pitch tracker using synchro-squeezed fft and frequency domain autocorrelation☆36Jan 17, 2024Updated 2 years ago
- Collection of audio-focused loss functions in PyTorch☆846Jul 30, 2024Updated last year
- Repository for our Interspeech2020 general-purpose voice activity detection (GPVAD) paper☆142Aug 3, 2023Updated 2 years ago
- CUDA-Warp RNN-Transducer☆216Feb 22, 2023Updated 2 years ago
- Clustering-based methods for overlapping diarization☆82Jan 12, 2024Updated 2 years ago
- ☆53May 15, 2025Updated 9 months ago
- Segment a given audio into utterances using a trained end-to-end ASR model.☆74Oct 9, 2020Updated 5 years ago
- Audio samples for the paper "TinyLSTMs: Efficient Neural Speech Enhancement for Hearing Aids"☆48Jun 3, 2020Updated 5 years ago
- ☆209Dec 4, 2023Updated 2 years ago
- ☆26Apr 21, 2021Updated 4 years ago
- Properly handle position-dependent phones in a subword lexicon FST☆31Oct 26, 2020Updated 5 years ago
- ☆14Jun 12, 2015Updated 10 years ago
- ☆16Apr 4, 2022Updated 3 years ago
- A wrapper around speech quality metrics MOSNet, BSSEval, STOI, PESQ, SRMR, SISDR☆1,034Jul 5, 2023Updated 2 years ago
- Incorporating KenLM language model with HuggingFace implementation of Wav2Vec2CTC Model using beam search decoding☆75Oct 11, 2021Updated 4 years ago
- A system works on singing voice synthesis☆79Jan 11, 2023Updated 3 years ago
- A python library for voice activity detection (VAD) for speech/non-speech segmentation.☆88Sep 7, 2022Updated 3 years ago
- An open source dataset for source separation☆472Feb 9, 2024Updated 2 years ago
- Streaming source separation for music and speech files, using the Open-Unmix LSTM architecture.☆22Dec 8, 2022Updated 3 years ago
- ☆17Apr 3, 2022Updated 3 years ago
- Python wrapper for kaldi's arpa2fst☆37Aug 27, 2025Updated 5 months ago
- Pitch-shift audio clips quickly with PyTorch (CUDA supported)! Additional utilities for searching efficient transformations are included.☆140Sep 25, 2024Updated last year
- An online speech recognition extension toolkit of Kaldi☆56Jun 23, 2021Updated 4 years ago
- Fast audio data augmentation in PyTorch. Inspired by audiomentations. Useful for deep learning.☆1,135Nov 24, 2025Updated 2 months ago
- My hybrid TTS network that combines, VALL-E, VoiceBox, SpeechFlow, Seamless and TortoiseTTS into one☆26Aug 5, 2024Updated last year