Anwarvic/VAD_Benchmark

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Anwarvic/VAD_Benchmark)

Anwarvic / VAD_Benchmark

Benchmarking different VAD models on AVA-Speech dataset

☆19

Alternatives and similar repositories for VAD_Benchmark

Users that are interested in VAD_Benchmark are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

TuZehai / Sheffield_Clarity_CEC1_Entry
View on GitHub
Implementation of Sheffield entry for Clarity enhancement challenge.
☆18Apr 19, 2022Updated 4 years ago
openmediatransport / libvmx
View on GitHub
VMX Codec
☆22Apr 9, 2026Updated 3 months ago
alvarobartt / covid-daily
View on GitHub
🦠 COVID-19 Daily Data from Worldometers with Python
☆13Feb 28, 2021Updated 5 years ago
openmediatransport / libomt
View on GitHub
A C wrapper for the libomnet library.
☆20Jun 2, 2026Updated last month
ImperialCollegeLondon / spear-tools
View on GitHub
SPEAR Challenge scripts and tools.
☆25Mar 17, 2023Updated 3 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
freds0 / data_augmentation_for_asr
View on GitHub
A set of audio augmentation techniques to perform noise insertion in datasets used for Automatic Speech Recognition.
☆49Oct 15, 2021Updated 4 years ago
jhuang448 / MultilingualALT
View on GitHub
Repo of the paper "Towards Building an End-to-End Multilingual Automatic Lyrics Transcription Model""
☆15Jun 28, 2024Updated 2 years ago
satyanamuduri / Speech-Enhancement-Using-GSC
View on GitHub
To Implement the Generalized Side Lobe Canceller with Fixed Beamformer,parallel blocking matrix and adaptive interference canceller achie…
☆29Oct 15, 2019Updated 6 years ago
CaA23187 / VAD-based-on-LSTM
View on GitHub
A LSTM for voice activity detection. In fact, this is a homework which I didn't expected.
☆13Dec 3, 2020Updated 5 years ago
BUTSpeechFIT / vae_dolphin
View on GitHub
☆10Jan 26, 2021Updated 5 years ago
carlthome / pmqd
View on GitHub
Perceived Music Quality Dataset
☆12Jul 1, 2024Updated 2 years ago
BUTSpeechFIT / cgmm_mvdr_online
View on GitHub
Implementation of CGMM-MVDR beamforming used for Clarity challenge
☆14Jan 14, 2022Updated 4 years ago
dr-pato / SSGD
View on GitHub
Code of the paper "Low-Latency Speech Separation Guided Diarization for Telephone Conversations"
☆15Dec 22, 2022Updated 3 years ago
jdvala / zoom_audio_transcribe
View on GitHub
Zoom Audio Transcription offline
☆34Sep 30, 2020Updated 5 years ago
Simple, predictable pricing with DigitalOcean hosting • Ad
Always know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
yinruiqing / fsmn
View on GitHub
Feedforward Sequential Memory Networks
☆18Aug 2, 2022Updated 3 years ago
DakeQQ / Voice-Activity-Detection-VAD-ONNX
View on GitHub
Utilizes ONNX Runtime for speech activity detection.
☆46Jun 25, 2026Updated 3 weeks ago
TeaPoly / warp-ctc-crf
View on GitHub
An extension of thu-spmi/CAT which contains a full-fledged implementation of CTC-CRF for Tensorflow.
☆12Jul 5, 2021Updated 5 years ago
FrancoisGrondin / smpphat
View on GitHub
☆16Mar 29, 2022Updated 4 years ago
Yifei-ZHAO96 / STAM-pytorch
View on GitHub
Pytorch implementation of "spectro-temporal attention-based voice activity detection"
☆13Jun 4, 2024Updated 2 years ago
Okrio / FSPEN
View on GitHub
☆21Apr 27, 2024Updated 2 years ago
f0k / minimp3py
View on GitHub
Python bindings for minimp3
☆17Sep 11, 2023Updated 2 years ago
wqmsybpw / numerical_PDE
View on GitHub
偏微分方程数值解作业
☆14Aug 10, 2020Updated 5 years ago
Pliploop / GDRetriever
View on GitHub
Official implementation of the paper - GD-Retriever: Controllable generative text-music retrieval with diffusion models (Accepted at ISMI…
☆19Sep 25, 2025Updated 9 months ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
Takaaki-Saeki / ssl_speech_restoration_v2
View on GitHub
☆17Dec 18, 2023Updated 2 years ago
NjuHaoZhang / ConvLSTM-AE_VAD_ICME2017
View on GitHub
ConvLSTM-AE_VAD_ICME2017 (code reimplementation)
☆21Oct 10, 2020Updated 5 years ago
matt-graham / phd-thesis
View on GitHub
Auxiliary variable Markov chain Monte Carlo methods
☆10Oct 24, 2017Updated 8 years ago
lucacoma / NeuralBeamspaceDomainFilter
View on GitHub
Unofficial Implementation of "Liu, W., Li, A., Wang, X., Yuan, M., Chen, Y., Zheng, C., & Li, X. (2022). A Neural Beamspace-Domain Filter…
☆19Oct 21, 2022Updated 3 years ago
k2-fsa / sherpa-mlx
View on GitHub
sherpa with mlx
☆15Aug 2, 2025Updated 11 months ago
novonotes / efficient-spherical-harmonic-evaluation
View on GitHub
http://jcgt.org/published/0002/02/06/
☆16Dec 23, 2020Updated 5 years ago
cadia-lvl / punctuation-prediction
View on GitHub
Support tools for punctuation and boundary detection for ASR output.
☆55Dec 8, 2022Updated 3 years ago
desh2608 / gss
View on GitHub
A simple package for Guided source separation (GSS)
☆134May 20, 2024Updated 2 years ago
JupiterEthan / CRN-causal
View on GitHub
☆69Apr 29, 2021Updated 5 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
Waino / morfessor-emprune
View on GitHub
Morfessor EM+Prune
☆10Jul 22, 2020Updated 6 years ago
elevoctech / ESMB-corpus
View on GitHub
☆21Oct 7, 2021Updated 4 years ago
rxtan2 / AVSeT
View on GitHub
☆17Oct 2, 2023Updated 2 years ago
Xianchao-Wu / wenet-deep-sparse-conformer
View on GitHub
☆15Aug 25, 2022Updated 3 years ago
sp-uhh / stcn-nmf
View on GitHub
VAE and STCN with NMF for single-channel speech enhancement
☆15Mar 24, 2021Updated 5 years ago
Okrio / CRUSE
View on GitHub
a lightweight network for monaural speech enhancement
☆58Oct 12, 2023Updated 2 years ago
espnet / warp-ctc
View on GitHub
Pytorch Bindings for warp-ctc maintained by ESPnet
☆17Feb 20, 2021Updated 5 years ago