Repository for our Interspeech2020 general-purpose voice activity detection (GPVAD) paper
☆141Aug 3, 2023Updated 2 years ago
Alternatives and similar repositories for GPV
Users that are interested in GPV are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- The codebase for Data-driven general-purpose voice activity detection.☆93Aug 3, 2023Updated 2 years ago
- Materials of public talks given By SJTU X-LANCE members☆14Dec 3, 2022Updated 3 years ago
- Repo for our pooling approach on the DCASE2018 task4☆15Jul 6, 2023Updated 2 years ago
- Code for reproducing experiments in "Domain-Adversarial Voice Activity Detection"☆23Mar 3, 2020Updated 6 years ago
- Permutation invariant training in PyTorch☆13Oct 2, 2020Updated 5 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- ☆53May 15, 2025Updated 11 months ago
- Voice activity detection (VAD) toolkit including DNN, bDNN, LSTM and ACAM based VAD. We also provide our directly recorded dataset.☆869Jun 9, 2021Updated 4 years ago
- Voice Activity Detection based on Deep Learning & TensorFlow☆370Mar 24, 2023Updated 3 years ago
- Pytorch implementation of SELF-ATTENTIVE VAD, ICASSP 2021☆159Oct 26, 2021Updated 4 years ago
- This is the Python library for an unsupervised, fast method for robust voice activity detection (rVAD), as in the paper rVAD: An Unsuperv…☆152Jun 5, 2025Updated 10 months ago
- Benchmark popular audio i/o packages☆151Dec 19, 2023Updated 2 years ago
- Simple DNN based Voice Activity Detection (VAD) using Pytorch☆42Feb 8, 2020Updated 6 years ago
- ADAPTING SELF-SUPERVISED MODELS TO MULTI-TALKER SPEECH RECOGNITION USING SPEAKER EMBEDDINGS☆33Mar 16, 2023Updated 3 years ago
- A python library for voice activity detection (VAD) for speech/non-speech segmentation.☆88Sep 7, 2022Updated 3 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Exploring Binary Classification Loss for Speaker Verification☆18Jul 18, 2023Updated 2 years ago
- Speech enhancement using mimic loss☆16Oct 25, 2019Updated 6 years ago
- Python loaders for many Real Room Impulse Response databases☆96Sep 30, 2024Updated last year
- Voice Activity Detection (VAD) using deep learning.☆204Oct 14, 2019Updated 6 years ago
- This repo contains the baseline model recipes and pre-trained model for GramVanni hindi ASR challenge☆15Mar 26, 2022Updated 4 years ago
- End-to-End Neural Diarization☆430Aug 30, 2021Updated 4 years ago
- A library for speech data augmentation in time-domain☆685Aug 30, 2021Updated 4 years ago
- Diarization scoring tools.☆263Apr 8, 2026Updated last week
- A curated list of awesome Speaker Diarization papers, libraries, datasets, and other resources.☆1,859Jul 22, 2025Updated 8 months ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Speech Dereverberation using Fully Convolutional Networks☆77Aug 18, 2020Updated 5 years ago
- A wrapper around speech quality metrics MOSNet, BSSEval, STOI, PESQ, SRMR, SISDR☆1,048Jul 5, 2023Updated 2 years ago
- This is the code of the ICASSP 2020 paper "Joint phoneme alignment and text-informed speech separation on highly corrupted speech"☆15Apr 8, 2024Updated 2 years ago
- Deep Xi: A deep learning approach to a priori SNR estimation implemented in TensorFlow 2/Keras. For speech enhancement and robust ASR.☆523Feb 17, 2022Updated 4 years ago
- Implementation of different noise embeddings for noise aware training of Kaldi acoustic models.☆13Feb 13, 2021Updated 5 years ago
- Production First and Production Ready End-to-End Text-to-Speech Toolkit☆416Nov 20, 2025Updated 4 months ago
- [InterSpeech 2020] "AutoSpeech: Neural Architecture Search for Speaker Recognition" by Shaojin Ding*, Tianlong Chen*, Xinyu Gong, Weiwei …☆208Dec 8, 2022Updated 3 years ago
- ☆76Oct 25, 2021Updated 4 years ago
- Implementation of Neural PLDA (NPLDA) model (A discriminative backend for Speaker Verification)☆100Apr 20, 2020Updated 5 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- This repo is for the SPL paper "Auto-Tuning Spectral Clustering for Speaker Diarization Using Normalized Maximum Eigengap"☆125Apr 8, 2022Updated 4 years ago
- ☆557Jun 11, 2021Updated 4 years ago
- Python library for Room Impulse Response (RIR) simulation with GPU acceleration☆589Jul 18, 2025Updated 8 months ago
- Tools for Speech Enhancement integrated with Kaldi☆430Jul 6, 2023Updated 2 years ago
- Big Impulse Response Dataset☆156Oct 19, 2022Updated 3 years ago
- MagicData-RAMC Dataset and Baseline☆59Sep 13, 2022Updated 3 years ago
- Repo associated to the DESED dataset, download and creation of data☆150Jul 16, 2024Updated last year