F-Tag/python-vad

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/F-Tag/python-vad)

F-Tag / python-vad

py-webrtcvad wrapper for trimming speech clips

☆48

Alternatives and similar repositories for python-vad

Users that are interested in python-vad are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

chimechallenge / chime5-synchronisation
View on GitHub
CHiME-5 Baseline Array Synchronisation
☆12Sep 24, 2018Updated 7 years ago
arief25ramadhan / sound-source-localization
View on GitHub
Four neural network architectures to classify sound source direction
☆11Oct 3, 2020Updated 5 years ago
aispeech-lab / WASE
View on GitHub
PyTorch implementation of WASE described in our ICASSP 2021: "Wase: Learning When to Attend for Speaker Extraction in Cocktail Party Envi…
☆27Jan 11, 2022Updated 4 years ago
bond005 / vad
View on GitHub
Various algorithms for voice activity detection
☆22Jan 31, 2017Updated 9 years ago
JiJiJiang / ASV-Anti-Spoofing-DADA
View on GitHub
Dual-Adversarial Domain Adaptation for replay spoofing detection in automatic speaker verification.
☆19Updated this week
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
kunaljathal / VAD
View on GitHub
Voice Activity Detection System
☆21Jun 9, 2015Updated 11 years ago
TUIlmenauAMS / FilterBanks_FastPythonImplementation
View on GitHub
Filter Banks, Fast Python Implementation
☆42Jul 9, 2022Updated 4 years ago
mwv / vad
View on GitHub
Voice Activity Detector
☆74Mar 7, 2026Updated 4 months ago
daanzu / kaldi-fork-active-grammar
View on GitHub
☆10Updated this week
npuichigo / tarzan
View on GitHub
High-level API for tar-based dataset
☆12Feb 3, 2024Updated 2 years ago
Mikxox / EnCodec_Trainer
View on GitHub
☆67Apr 3, 2023Updated 3 years ago
marsbroshok / VAD-python
View on GitHub
Voice Activity Detector in Python
☆481Nov 17, 2020Updated 5 years ago
ano-demo / AdvAttacksASVspoof
View on GitHub
This is the implementation of the paper "Adversarial Attacks on Spoofing Countermeasures of automatic speaker verification".
☆42Mar 9, 2023Updated 3 years ago
anjandeepsahni / automatic_speech_recognition
View on GitHub
Speech to text transcription using RNN (Listen, Attend and Spell).
☆11Aug 23, 2019Updated 6 years ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
google / embedding-tests
View on GitHub
☆17Dec 13, 2019Updated 6 years ago
ishaaniwani / GCC-PHAT-SSL
View on GitHub
MATLAB Simulation Framework For Basic Sound Source Localization Using the GCC PHAT Algorithm
☆23Jun 25, 2019Updated 7 years ago
npuichigo / blazing-fast-io-tutorial
View on GitHub
Blazing fast data loading with HuggingFace Dataset and Ray Data
☆15Jan 12, 2024Updated 2 years ago
pseeth / torch-stft
View on GitHub
An STFT/iSTFT for PyTorch.
☆372Oct 31, 2023Updated 2 years ago
chenllliang / CTDNN
View on GitHub
MMM 2021: Crossed-Time Delay Neural Network for Speaker Recognition
☆11Dec 4, 2021Updated 4 years ago
jewhoguy / SRP-PHAT
View on GitHub
Sound source localization using SRP-PHAT
☆27Feb 17, 2019Updated 7 years ago
ghunkins / Binaural-Source-Localization-CNN
View on GitHub
A Deep Convolutional Neural Network (DCNN) designed for the task of localizing human speech to 168 location classes using binaural microp…
☆10Dec 16, 2017Updated 8 years ago
rishikksh20 / PPSpeech
View on GitHub
PPSpeech: Phrase based Parallel End-to-End TTS System
☆35Aug 31, 2020Updated 5 years ago
fgnt / sms_wsj
View on GitHub
SMS-WSJ: Spatialized Multi-Speaker Wall Street Journal database for multi-channel source separation and recognition
☆131Jun 7, 2024Updated 2 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
kakaobrain / g2pm
View on GitHub
A Neural Grapheme-to-Phoneme Conversion Package for Mandarin Chinese Based on a New Open Benchmark Dataset
☆367Dec 24, 2021Updated 4 years ago
prml-lab-speech-team / demo
View on GitHub
☆26Aug 8, 2024Updated last year
GWLee0524 / AMTL
View on GitHub
Asymmetric Multi-Task Learning code, If you want to use it, please let me know and cite AMTL paper
☆11Aug 3, 2016Updated 9 years ago
rajivpoddar / logmmse
View on GitHub
LogMMSE speech enhancement/noise reduction
☆89Apr 1, 2020Updated 6 years ago
georgepar / kaldi-grpc-server
View on GitHub
Deploy Kaldi models using grpc for bidirectional streaming.
☆17Sep 30, 2024Updated last year
IDRnD / antispoofing-features
View on GitHub
Code for the paper "Bag of features for voice anti-spoofing"
☆13Jul 6, 2023Updated 3 years ago
espnet / icassp2020-tts
View on GitHub
ESPnet-TTS Audio Sample HP
☆21Oct 25, 2019Updated 6 years ago
kyutai-labs / yomikomi
View on GitHub
A small rust-based data loader
☆36Updated this week
yuanx520 / chinese_image_captioning
View on GitHub
Image Captioning in Chinese
☆11Jul 2, 2017Updated 9 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
ShareXin / screenshot-rs
View on GitHub
Simple library that allows for simple method of asking for screenshots from various Linux/BSD desktops
☆10Jun 13, 2019Updated 7 years ago
ludlows / pesq-mex
View on GitHub
the MEX wrapper for PESQ (Perceptual Evaluation of Speech Quality)
☆15May 10, 2019Updated 7 years ago
aispeech-lab / TinyWASE
View on GitHub
PyTorch implementation of TinyWASE described in our paper "Compressing Speaker Extraction Model with Ultra-low Precision Quantization and…
☆11Jun 28, 2021Updated 5 years ago
zceng / LVCNet
View on GitHub
LVCNet: Efficient Condition-Dependent Modeling Network for Waveform Generation
☆80Feb 24, 2021Updated 5 years ago
fgnt / ci_sdr
View on GitHub
☆53May 15, 2025Updated last year
Yoctol / text-normalizer
View on GitHub
Normalize text string
☆12Nov 6, 2018Updated 7 years ago
matousc89 / signalz
View on GitHub
Data generators in Python
☆14Jun 10, 2019Updated 7 years ago