py-webrtcvad wrapper for trimming speech clips
☆48Jul 3, 2022Updated 3 years ago
Alternatives and similar repositories for python-vad
Users that are interested in python-vad are comparing it to the libraries listed below
Sorting:
- Filter Banks, Fast Python Implementation☆42Jul 9, 2022Updated 3 years ago
- Various algorithms for voice activity detection☆22Jan 31, 2017Updated 9 years ago
- CHiME-5 Baseline Array Synchronisation☆12Sep 24, 2018Updated 7 years ago
- MMM 2021: Crossed-Time Delay Neural Network for Speaker Recognition☆11Dec 4, 2021Updated 4 years ago
- Four neural network architectures to classify sound source direction☆11Oct 3, 2020Updated 5 years ago
- ☆26Aug 8, 2024Updated last year
- ☆10Nov 1, 2025Updated 3 months ago
- High-level API for tar-based dataset☆12Feb 3, 2024Updated 2 years ago
- ☆14Jun 10, 2020Updated 5 years ago
- Voice Activity Detector☆74Jan 22, 2026Updated last month
- Tensorflow Optimizers☆11Sep 1, 2019Updated 6 years ago
- PPSpeech: Phrase based Parallel End-to-End TTS System☆35Aug 31, 2020Updated 5 years ago
- Support tools for punctuation and boundary detection for ASR output.☆55Dec 8, 2022Updated 3 years ago
- Articulatory features estimation using Listen Attend and Spell architecture.☆33Apr 24, 2020Updated 5 years ago
- A Neural Grapheme-to-Phoneme Conversion Package for Mandarin Chinese Based on a New Open Benchmark Dataset☆361Dec 24, 2021Updated 4 years ago
- In order to demonstrate any signal accurately it is important to know the noise containt in the signal. Thus, a fundamental measure is th…☆13May 10, 2021Updated 4 years ago
- Normalize text string☆12Nov 6, 2018Updated 7 years ago
- LogMMSE speech enhancement/noise reduction☆90Apr 1, 2020Updated 5 years ago
- tensorflow speech synthesis c++ inference for voicenet☆16Mar 29, 2019Updated 6 years ago
- Blazing fast data loading with HuggingFace Dataset and Ray Data☆16Jan 12, 2024Updated 2 years ago
- Open source cross-platform implementation of MRCP protocol☆20Mar 3, 2022Updated 3 years ago
- Voice Activity Detector in Python☆480Nov 17, 2020Updated 5 years ago
- A package used to test webrtc apm functions, such as aec, ns☆17Feb 21, 2019Updated 7 years ago
- A CNN for denoising speech.☆17Jun 2, 2019Updated 6 years ago
- Dual-Adversarial Domain Adaptation for replay spoofing detection in automatic speaker verification.☆19Oct 28, 2025Updated 4 months ago
- Proof of concept for running moshi/hibiki using webrtc☆20Feb 28, 2025Updated last year
- Losses and decoders for end-to-end ASR and OCR☆34Oct 30, 2020Updated 5 years ago
- An STFT/iSTFT for PyTorch.☆369Oct 31, 2023Updated 2 years ago
- Deploy Kaldi models using grpc for bidirectional streaming.☆17Sep 30, 2024Updated last year
- Tools for speech processing, keyword spotting☆17Mar 11, 2020Updated 5 years ago
- Voice Activity Detection System☆21Jun 9, 2015Updated 10 years ago
- List of NN based singal processing papers☆22Jun 5, 2023Updated 2 years ago
- ESPnet extensions for semi-supervised end-to-end speech recognition. See also https://github.com/ShigekiKarita/espnet-semi-supervised/tre…☆38Feb 13, 2020Updated 6 years ago
- Python library for audio augmentation☆85Jul 6, 2023Updated 2 years ago
- LVCNet: Efficient Condition-Dependent Modeling Network for Waveform Generation☆80Feb 24, 2021Updated 5 years ago
- Small-footprint Keyword Spotting☆18Jul 28, 2019Updated 6 years ago
- wake word spotting with kaldi☆19Dec 3, 2020Updated 5 years ago
- This is the implementation of the paper "Adversarial Attacks on Spoofing Countermeasures of automatic speaker verification".☆42Mar 9, 2023Updated 2 years ago
- Unofficial implementation of wavenext vocoder☆59Aug 28, 2024Updated last year