bigcash/awesome-vad

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/bigcash/awesome-vad)

bigcash / awesome-vad

A curated list of awesome voice activity detection

☆75

Alternatives and similar repositories for awesome-vad

Users that are interested in awesome-vad are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

TehreemFarooqi / Preparing-a-speech-recognition-dataset-using-YouTube-videos
View on GitHub
Using YouTube to prepare a speech recognition dataset for any language
☆10Mar 30, 2021Updated 5 years ago
hitz-zentroa / whisper-lm
View on GitHub
Add n-gram and large language model (LLM) support to Whisper models.
☆43May 6, 2025Updated last year
Sreyan88 / LipGER
View on GitHub
Code for InterSpeech 2024 Paper: LipGER: Visually-Conditioned Generative Error Correction for Robust Automatic Speech Recognition
☆19Jul 16, 2024Updated 2 years ago
dense-analysis / vim-speech
View on GitHub
Vim Speech Recognition Experiments
☆20May 30, 2025Updated last year
artie-inc / artie-bias-corpus
View on GitHub
Artie Bias Corpus: an audio corpus + code for detecting demographic bias
☆20Jul 21, 2020Updated 6 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
daniel03c1 / NAS_VAD
View on GitHub
☆26Oct 25, 2024Updated last year
amritkromana / disfluency_detection_from_audio
View on GitHub
☆35Aug 22, 2024Updated last year
desh2608 / kaldi-noise-vectors
View on GitHub
Implementation of different noise embeddings for noise aware training of Kaldi acoustic models.
☆13Feb 13, 2021Updated 5 years ago
nmstoker / SimpleSpeechLoop
View on GitHub
A very basic demonstration connecting speech recognition and text-to-speech
☆20May 3, 2020Updated 6 years ago
HLasse / multidiagnosis-speech
View on GitHub
☆10Jun 23, 2023Updated 3 years ago
charlesliucn / LanMIT
View on GitHub
📖 LanMIT: A Toolkit for Improving Language Models in Low-resourced Speech Recognition based on Kaldi.
☆22Jul 12, 2019Updated 7 years ago
speechio / asr-noises
View on GitHub
A handy dataset of noises for ASR
☆22May 29, 2019Updated 7 years ago
zy-du / Disentanglement-of-Emotional-Style-and-Speaker-Identity-for-Expressive-Voice-Conversion
View on GitHub
This is the implementation our Interspeech 2022 paper " Disentanglement of Emotional Style and Speaker Identity for Expressive Voice Conv…
☆21Sep 18, 2023Updated 2 years ago
Diamondfan / Child-ASR-Paper
View on GitHub
A list of papers for child ASR
☆54Oct 8, 2024Updated last year
End-to-end encrypted cloud storage - Proton Drive • Ad
Special offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
tzyll / ChineseHP
View on GitHub
Dataset for Pinyin Regularization in Error Correction for Chinese Speech Recognition with Large Language Models in Interspeech 2024.
☆16Jul 4, 2024Updated 2 years ago
asteroid-team / Libri_VAD
View on GitHub
Script to generate VAD dataset used in Asteroid recipe
☆21Sep 30, 2021Updated 4 years ago
gpu-poor / gramvaani_hindi_asr
View on GitHub
This repo contains the baseline model recipes and pre-trained model for GramVanni hindi ASR challenge
☆16Mar 26, 2022Updated 4 years ago
patrickvonplaten / Wav2Vec2_ParlanceCTCDecode
View on GitHub
☆11Nov 5, 2021Updated 4 years ago
revdotcom / words2num
View on GitHub
Convert words to numbers
☆21Apr 13, 2022Updated 4 years ago
SilvrDuck / AccentedSpeechRecognition
View on GitHub
Experiments on speech recognition robustness to accents and dialects
☆12Apr 2, 2019Updated 7 years ago
idnavid / speech_activity_detection
View on GitHub
Unsupervised speech activity detection system.
☆11Jul 2, 2018Updated 8 years ago
sajadalipour7 / Persian-Grapheme-To-Phoneme-With-Transformer
View on GitHub
Persian Grapheme To Phoneme with Transformer in Pytorch
☆11Sep 21, 2023Updated 2 years ago
Yaoming95 / UniPunc
View on GitHub
The case study and multilingfual performance of ICASSP submission
☆24Sep 24, 2022Updated 3 years ago
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
mikex86 / DeepSpeech-Java-Bindings
View on GitHub
Java Bindings for the C++ library DeepSpeech
☆10Jun 4, 2020Updated 6 years ago
smalekz / PCVC
View on GitHub
Persian Consonant Vowel Combination (PCVC) Speech Dataset
☆21Apr 22, 2025Updated last year
MTG / Podcastmix
View on GitHub
PodcastMix A dataset for separating music and speech in podcasts.
☆44Aug 20, 2024Updated last year
Okrio / FSPEN
View on GitHub
☆21Apr 27, 2024Updated 2 years ago
Chung-I / youtube-asr-crawler
View on GitHub
☆10Sep 19, 2022Updated 3 years ago
daihuangyu / speex_aec_kf
View on GitHub
speex aec kalman filter
☆15Mar 17, 2024Updated 2 years ago
bagustris / ssl-ser
View on GitHub
Repository for reproducing result in journal "Self-supervised learning for Speech Emotion Recognition"
☆10Mar 15, 2023Updated 3 years ago
talhanai / kaldi-diar-latte
View on GitHub
steps to perform text-based speaker diarization with kaldi toolkit
☆12Nov 2, 2018Updated 7 years ago
Mrunal-G / Casual-turn-taking-and-backchannel-prediction
View on GitHub
☆16Jun 25, 2024Updated 2 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
sarahjuan / iban
View on GitHub
☆14Jun 12, 2015Updated 11 years ago
Picovoice / cobra
View on GitHub
On-device voice activity detection (VAD) powered by deep learning
☆268Updated this week
yuhangear / wenet-android
View on GitHub
☆13Oct 27, 2021Updated 4 years ago
VITA-Group / Audio-Lottery
View on GitHub
[ICLR 2022] "Audio Lottery: Speech Recognition Made Ultra-Lightweight, Noise-Robust, and Transferable", by Shaojin Ding, Tianlong Chen, Z…
☆32Apr 8, 2022Updated 4 years ago
poleval / 2021-punctuation-restoration
View on GitHub
PolEval 2021 Task 1
☆15Jun 28, 2022Updated 4 years ago
laurensw75 / docker-Kaldi-NL
View on GitHub
Docker for building an environment for Dutch online and offline ASR.
☆12Feb 2, 2021Updated 5 years ago
introlab / uimvdr
View on GitHub
☆13Oct 11, 2024Updated last year