daanzu / py-webrtcvad-wheels
Python interface to the WebRTC Voice Activity Detector (VAD) [released with binary wheels!]
☆20Updated 5 months ago
Alternatives and similar repositories for py-webrtcvad-wheels:
Users that are interested in py-webrtcvad-wheels are comparing it to the libraries listed below
- Faster Whisper ASR transcription with CTranslate2☆20Updated 6 months ago
- Reproducible experimental protocols for multimedia (audio, video, text) database☆100Updated 2 months ago
- Coqui STT Model Manager - install, manage and try out Coqui STT models from the Model Zoo☆25Updated 2 years ago
- GUI Tool to create, manage and test Keyword Spotting models using TF 2.0☆12Updated 4 years ago
- Zero-shot Audio Classification using Whisper☆80Updated 2 years ago
- On-device speaker recognition engine powered by deep learning☆34Updated last week
- OCTRA is a web-application for the orthographic transcription of audio files.☆39Updated this week
- A crash course for training speech recognition models using DeepSpeech.☆25Updated 3 years ago
- Experiments with Hugging Face 🔬 🤗☆44Updated 8 months ago
- streaming speech to text server using Whisper☆91Updated last year
- TTS Client for Coqui TTS server☆13Updated 2 years ago
- 🫠 check your data, before you wreck your model☆16Updated 2 years ago
- Stable timestamps and confidence score for words of OpenAI's Whisper outputs down to word-level.☆25Updated 2 years ago
- A python library to find differences between audio and transcriptions☆19Updated last year
- Simple Python library, distributed via binary wheels with few direct dependencies, for easily using wav2vec 2.0 models for speech recogni…☆24Updated 3 years ago
- A bidirectional recurrent neural network model with attention mechanism for restoring missing punctuation in unsegmented text☆36Updated 4 years ago
- Self-contained Python package for OpenFst☆51Updated 2 years ago
- Bilingual sentence similarity classifier using Tensorflow☆21Updated 5 years ago
- A collection of basic python modules for spoken natural language processing☆56Updated 5 years ago
- OpenAI Whisper Prompt Examples☆52Updated last year
- Tunable pipelines☆33Updated 2 months ago
- A curated list of awesome voice activity detection☆50Updated 5 months ago
- 🐍 Coqui's machine learning job scheduler☆32Updated 3 years ago
- A repo listing known open source voice tools, ordered by where they sit in the voice stack☆26Updated 2 years ago
- Keras(Tensorflow) implementations of Automatic Speech Recognition☆23Updated 3 years ago
- Dataset Release for Intent Classification from Speech☆46Updated 2 months ago
- ☆23Updated 2 years ago
- ☆39Updated last year
- On-device speaker diarization powered by deep learning☆44Updated last month
- Diff filtering, text mapping, and windowed transforms for LLM apps☆14Updated 2 weeks ago