daanzu / py-webrtcvad-wheelsLinks
Python interface to the WebRTC Voice Activity Detector (VAD) [released with binary wheels!]
☆20Updated 7 months ago
Alternatives and similar repositories for py-webrtcvad-wheels
Users that are interested in py-webrtcvad-wheels are comparing it to the libraries listed below
Sorting:
- Faster Whisper ASR transcription with CTranslate2☆22Updated 8 months ago
- A bidirectional recurrent neural network model with attention mechanism for restoring missing punctuation in unsegmented text☆36Updated 4 years ago
- Coqui STT Model Manager - install, manage and try out Coqui STT models from the Model Zoo☆26Updated 2 years ago
- Zero-shot Audio Classification using Whisper☆79Updated 2 years ago
- A crash course for training speech recognition models using DeepSpeech.☆25Updated 4 years ago
- TTS Client for Coqui TTS server☆13Updated 2 years ago
- 🐍 Coqui's machine learning job scheduler☆32Updated 3 years ago
- Reproducible experimental protocols for multimedia (audio, video, text) database☆102Updated 4 months ago
- Experiments with Hugging Face 🔬 🤗☆44Updated 10 months ago
- streaming speech to text server using Whisper☆93Updated 2 years ago
- Tunable pipelines☆34Updated 4 months ago
- Self-contained Python package for OpenFst☆51Updated 2 years ago
- Whisper combined with Silero VAD, for improved long-form transcriptions☆52Updated 2 years ago
- Real-time processing and delivery of sentences from a continuous stream of characters or text chunks.☆63Updated last week
- A fork of https://people.csail.mit.edu/hubert/git/pyaudio.git. Last synchronized on 20231119.☆41Updated 11 months ago
- Code for OpenAI Whisper Web App Demo☆93Updated 2 years ago
- TensorFlowTTS: Real-Time State-of-the-art Speech Synthesis for Tensorflow 2 (supported including English, Korean, Chinese, German and Ea…☆14Updated 4 years ago
- Identifying individual speakers in an audio stream based on the unique characteristics found in individual voices using Python☆18Updated 2 years ago
- OpenAI Whisper Prompt Examples☆52Updated last year
- Parallelized automatic corpus collection for ASR. Forked from https://github.com/EgorLakomkin/KTSpeechCrawler☆24Updated 4 years ago
- JavaScript deployment for Howl, the wake word detection modeling toolkit for Firefox Voice☆10Updated 4 years ago
- Stable timestamps and confidence score for words of OpenAI's Whisper outputs down to word-level.☆25Updated 2 years ago
- Go from raw audio files to a text-audio dataset automatically with OpenAI's Whisper.☆137Updated last year
- Seed Machine Translation Data☆32Updated 7 months ago
- ☆14Updated 2 years ago
- Extract knowledge from raw text☆13Updated 3 years ago
- ☆104Updated last month
- A collection of basic python modules for spoken natural language processing☆56Updated 5 years ago
- Fast and accurate natural language detection. Detector written in Python. Nito-ELD, ELD.☆17Updated last year
- 🫠 check your data, before you wreck your model☆16Updated 2 years ago