daanzu / py-webrtcvad-wheels
Python interface to the WebRTC Voice Activity Detector (VAD) [released with binary wheels!]
☆17Updated 4 months ago
Alternatives and similar repositories for py-webrtcvad-wheels:
Users that are interested in py-webrtcvad-wheels are comparing it to the libraries listed below
- Coqui STT Model Manager - install, manage and try out Coqui STT models from the Model Zoo☆25Updated 2 years ago
- TTS Client for Coqui TTS server☆13Updated 2 years ago
- A crash course for training speech recognition models using DeepSpeech.☆24Updated 3 years ago
- 🫠 check your data, before you wreck your model☆16Updated 2 years ago
- Reproducible experimental protocols for multimedia (audio, video, text) database☆98Updated last month
- Zero-shot Audio Classification using Whisper☆80Updated 2 years ago
- Read-only unofficial mirror of OpenFst☆44Updated 2 years ago
- Self-contained Python package for OpenFst☆51Updated 2 years ago
- A bidirectional recurrent neural network model with attention mechanism for restoring missing punctuation in unsegmented text☆36Updated 4 years ago
- Faster Whisper ASR transcription with CTranslate2☆20Updated 5 months ago
- JavaScript deployment for Howl, the wake word detection modeling toolkit for Firefox Voice☆10Updated 4 years ago
- An espeak-compatible, permissively-licensed IPA phonemizer (G2P) based on DeepPhonemizer. Usable as a drop-in replacement for espeak's GP…☆95Updated 5 months ago
- Stable timestamps and confidence score for words of OpenAI's Whisper outputs down to word-level.☆25Updated 2 years ago
- Tunable pipelines☆32Updated last month
- TensorFlowTTS: Real-Time State-of-the-art Speech Synthesis for Tensorflow 2 (supported including English, Korean, Chinese, German and Ea…☆14Updated 3 years ago
- A curated list of awesome voice activity detection☆45Updated 4 months ago
- The collection of bulding blocks building fine-tunable metric learning models☆32Updated 2 months ago
- 🐍 Coqui's machine learning job scheduler☆32Updated 3 years ago
- **ARCHIVED** Filesystem interface to 🤗 Hub☆58Updated last year
- Compute useful transcriptions metrics (CER, WER, SER, ...)☆27Updated 10 years ago
- Advanced data structures for handling temporal segments with attached labels.☆111Updated last month
- Gentle and praatio scripts for easy forced alignment☆18Updated 2 years ago
- TurnGPT: a Transformer-based Language Model for Predicting Turn-taking in Spoken Dialog☆48Updated 10 months ago
- Experiments with Hugging Face 🔬 🤗☆44Updated 7 months ago
- Awesome TTS☆56Updated 3 years ago
- Package for inference for punctuation, true-casing, and sentence boundary detection☆25Updated 9 months ago
- GUI Tool to create, manage and test Keyword Spotting models using TF 2.0☆12Updated 4 years ago
- MozoLM: A language model (LM) serving library☆44Updated last month
- Labeled data for homograph disambiguation☆57Updated last year
- Fast edit distance Python extension written in Cython/C++. Supports Levenshtein distance and Damerau Optimal String Alignment (OSA) dista…☆23Updated 6 months ago