voithru / voice-activity-detectionView external linksLinks
Pytorch implementation of SELF-ATTENTIVE VAD, ICASSP 2021
☆160Oct 26, 2021Updated 4 years ago
Alternatives and similar repositories for voice-activity-detection
Users that are interested in voice-activity-detection are comparing it to the libraries listed below
Sorting:
- Wav2Vec2 finetune and inference code for IITP AI Grand Challenge☆36Feb 22, 2022Updated 3 years ago
- ☆21Feb 21, 2022Updated 3 years ago
- This is the Python library for an unsupervised, fast method for robust voice activity detection (rVAD), as in the paper rVAD: An Unsuperv…☆151Jun 5, 2025Updated 8 months ago
- A python library for voice activity detection (VAD) for speech/non-speech segmentation.☆88Sep 7, 2022Updated 3 years ago
- The codebase for Data-driven general-purpose voice activity detection.☆93Aug 3, 2023Updated 2 years ago
- Pytorch implementation of "spectro-temporal attention-based voice activity detection"☆13Jun 4, 2024Updated last year
- Voice Activity Detection (VAD) using deep learning.☆204Oct 14, 2019Updated 6 years ago
- Pytorch version of Voice Activity Detection (VAD) based on Deep Learning (https://github.com/filippogiruzzi)☆27Mar 20, 2021Updated 4 years ago
- Lightweight CNN for Robust Voice Activity Detection☆20Jun 30, 2023Updated 2 years ago
- Voice Activity Detection based on Deep Learning & TensorFlow☆371Mar 24, 2023Updated 2 years ago
- Simple DNN based Voice Activity Detection (VAD) using Pytorch☆42Feb 8, 2020Updated 6 years ago
- End-to-End Korean Automatic Speech Recognition leveraging PyTorch and Hydra.☆10Jan 21, 2022Updated 4 years ago
- Repository for speech paper reading☆33Aug 19, 2021Updated 4 years ago
- Repository for our Interspeech2020 general-purpose voice activity detection (GPVAD) paper☆142Aug 3, 2023Updated 2 years ago
- Transformer implementation speciaized in speech recognition tasks using Pytorch.☆65Nov 28, 2021Updated 4 years ago
- ☆27Oct 25, 2024Updated last year
- ☆17Apr 14, 2023Updated 2 years ago
- Matlab and Python libraries for an unsupervised method for robust voice activity detection (rVAD), as in the paper rVAD: An Unsupervised …☆138Jan 20, 2024Updated 2 years ago
- Voice activity detection (VAD) toolkit including DNN, bDNN, LSTM and ACAM based VAD. We also provide our directly recorded dataset.☆868Jun 9, 2021Updated 4 years ago
- This is an extension of kaldi speech recognition software which allows to perform decoding of speech with hybrid word and phoneme graphs.…☆11Feb 4, 2020Updated 6 years ago
- An End-to-End Architecture for Keyword Spotting and Voice Activity Detection☆381Mar 24, 2023Updated 2 years ago
- Segment a given audio into utterances using a trained end-to-end ASR model.☆74Oct 9, 2020Updated 5 years ago
- Rescoring methods for end-to-end Automatic Speech Recognition☆26Sep 23, 2020Updated 5 years ago
- Went online decode demo☆31Apr 28, 2021Updated 4 years ago
- PyTorch implementation of automatic speech recognition models.☆38Jan 10, 2021Updated 5 years ago
- A handy dataset of noises for ASR☆22May 29, 2019Updated 6 years ago
- DSing ASR task: Resources and Baseline for an unaccompanied singing ASR.☆19Nov 23, 2021Updated 4 years ago
- A pytorch based end2end speech recognition system.☆114Jan 16, 2021Updated 5 years ago
- ☆40Aug 15, 2021Updated 4 years ago
- End-to-End Keyword Spotting (E2E-KWS) using a character level LSTM☆43Nov 18, 2022Updated 3 years ago
- My hybrid TTS network that combines, VALL-E, VoiceBox, SpeechFlow, Seamless and TortoiseTTS into one☆26Aug 5, 2024Updated last year
- Simple Python library, distributed via binary wheels with few direct dependencies, for easily using wav2vec 2.0 models for speech recogni…☆23Aug 16, 2021Updated 4 years ago
- Modular and extensible speech recognition library leveraging pytorch-lightning and hydra.☆50May 19, 2021Updated 4 years ago
- End-to-end ASR/LM implementation with PyTorch☆594Aug 30, 2021Updated 4 years ago
- HiFTNet wav/audio super-resolution 16/24 kHz to 48 kHz☆24Jan 2, 2024Updated 2 years ago
- ☆76Mar 18, 2022Updated 3 years ago
- [ICLR 2022] "Audio Lottery: Speech Recognition Made Ultra-Lightweight, Noise-Robust, and Transferable", by Shaojin Ding, Tianlong Chen, Z…☆32Apr 8, 2022Updated 3 years ago
- A torch implementation of a recursion which turns out to be useful for RNN-T.☆149Aug 25, 2023Updated 2 years ago
- Voice activity engine benchmark framework☆21Jan 14, 2026Updated 3 weeks ago