daanzu / py-silero-vad-liteView external linksLinks
Lightweight wrapper for Silero VAD using internal ONNX Runtime and with no python package dependencies
☆15Nov 25, 2024Updated last year
Alternatives and similar repositories for py-silero-vad-lite
Users that are interested in py-silero-vad-lite are comparing it to the libraries listed below
Sorting:
- Docker image and scripts for training finetuned or completely personal Kaldi speech models. Particularly for use with kaldi-active-gramma…☆21Jan 24, 2022Updated 4 years ago
- High-level API for creating dragonfly grammars☆14Oct 11, 2021Updated 4 years ago
- In this repository, I try to combine k2 with speechbrain to decode well and fastly.☆16Jun 17, 2022Updated 3 years ago
- ☆11Aug 11, 2023Updated 2 years ago
- ☆11May 7, 2022Updated 3 years ago
- Speechflow for emotion recognition related information decomposition☆10Jul 27, 2021Updated 4 years ago
- A Benchmark Corpus for Low-Resource Cantonese Punctuation Restoration from Speech Transcripts☆16Dec 3, 2024Updated last year
- A list of similar sounding words to help disambiguate voice coding☆11May 20, 2020Updated 5 years ago
- An evaluation set for large-scale trained TTS models (Coming in Sep 2024)☆12Sep 2, 2024Updated last year
- ☆13Oct 27, 2021Updated 4 years ago
- A simple implementation for improving CosyVoice2 by GRPO method☆32Oct 17, 2025Updated 3 months ago
- Mic-controlled mouse clicks☆17Oct 6, 2025Updated 4 months ago
- The Official PyTorch Implementation of "Mel-McNet: A Mel-Scale Framework for Online Multichannel Speech Enhancement" [Interspeech 2025]☆21Jun 9, 2025Updated 8 months ago
- Control your computer by voice!☆13Dec 8, 2022Updated 3 years ago
- DUSTED: Spoken-Term Discovery using Discrete Speech Units☆18Oct 2, 2024Updated last year
- ☆11Oct 14, 2023Updated 2 years ago
- ☆14Aug 19, 2024Updated last year
- Uses the excellent silero VAD with onnxruntime C api for fast detection of audio segments with speech☆16Sep 20, 2024Updated last year
- 中文语音识别,automatic speech recognition(ASR)☆14Dec 30, 2021Updated 4 years ago
- Toolkit for training/adapting CMU Sphinx acoustic models☆17May 25, 2018Updated 7 years ago
- ☆16Dec 23, 2021Updated 4 years ago
- Voice activity engine benchmark framework☆21Jan 14, 2026Updated last month
- A selective noise filter architecture driven by a CNN and Wiener filter☆18Nov 21, 2019Updated 6 years ago
- Glow-TTS with Stochastic Duration Predictor and Stochastic Pitch Predictor☆18Jun 5, 2023Updated 2 years ago
- ☆15Nov 5, 2021Updated 4 years ago
- Repository for sharing the data in the Tamasheq language, one of the target languages for the low-resource speech translation track at IW…☆18Nov 30, 2022Updated 3 years ago
- ☆20Jul 22, 2022Updated 3 years ago
- Python wrapper for kaldi's arpa2fst☆38Aug 27, 2025Updated 5 months ago
- ☆32Dec 24, 2025Updated last month
- Grapheme-to-phoneme (G2P) conversion is the process of generating pronunciation for words based on their written form. It has a highly es…☆19Jun 14, 2021Updated 4 years ago
- [EMNLP 2025 Findings] Official code for EZ-VC: Easy Zero-shot Any-to-Any Voice Conversion☆33Sep 9, 2025Updated 5 months ago
- Official repository for Mamba-based Segmentation Model for Speaker Diarization☆45May 13, 2025Updated 9 months ago
- Chinese and English Bilinguish G2P☆22Jul 16, 2023Updated 2 years ago
- Official implementation of the Odyssey paper "A Probabilistic Fusion Framework for Spoofing Aware Speaker Verification"☆18Jun 24, 2022Updated 3 years ago
- End-to-end Text-to-Speech with Generative Adversarial Networks☆20Feb 6, 2021Updated 5 years ago
- singing voice conversion without f0☆23May 10, 2023Updated 2 years ago
- Baseline convolutional ASR system in PyTorch☆21Nov 16, 2023Updated 2 years ago
- Corpus of oral arguments (recorded speech + official transcripts) of the United States Supreme Court☆22Dec 8, 2022Updated 3 years ago
- ☆23Oct 17, 2024Updated last year