daanzu / py-webrtcvad-wheelsLinks
Python interface to the WebRTC Voice Activity Detector (VAD) [released with binary wheels!]
β21Updated 7 months ago
Alternatives and similar repositories for py-webrtcvad-wheels
Users that are interested in py-webrtcvad-wheels are comparing it to the libraries listed below
Sorting:
- SEPIA server to support open-source speech recognition via WebSocket connection.β128Updated 8 months ago
- Experiments with Hugging Face π¬ π€β44Updated 10 months ago
- A crash course for training speech recognition models using DeepSpeech.β25Updated 4 years ago
- Coqui STT Model Manager - install, manage and try out Coqui STT models from the Model Zooβ26Updated 2 years ago
- TTS Client for Coqui TTS serverβ13Updated 2 years ago
- Zero-shot Audio Classification using Whisperβ79Updated 2 years ago
- Server & client for DeepSpeech using WebSockets for real-time speech recognition in separate environmentsβ102Updated 5 years ago
- Easily perform OCR on portions of the screen, choosing from a selection of backends.β47Updated 2 weeks ago
- Experiments to test different speech recognition systems for SEPIA Frameworkβ60Updated 2 years ago
- On-device voice activity detection (VAD) powered by deep learningβ220Updated last week
- A collection of basic python modules for spoken natural language processingβ56Updated 5 years ago
- Code for OpenAI Whisper Web App Demoβ93Updated 2 years ago
- Read-only unofficial mirror of OpenFstβ44Updated 3 years ago
- π Coqui's machine learning job schedulerβ32Updated 3 years ago
- πΈSTT integration examplesβ129Updated 2 years ago
- An even smaller speech recognizer / force alignerβ35Updated 7 months ago
- πΈTTS recipes for different datasetsβ86Updated 2 years ago
- A bidirectional recurrent neural network model with attention mechanism for restoring missing punctuation in unsegmented textβ36Updated 4 years ago
- A model that predicts the punctuation of English, Italian, French and German texts.β80Updated 2 years ago
- A tokenizer, text cleaner, and phonemizer for many human languages.β320Updated 8 months ago
- Silence detection in audio stream using webrtcvadβ48Updated last year
- A minimalistic automatic speech recognition streamlit based webapp powered by OpenAI's Whisper "State of the Art" modelsβ66Updated 2 years ago
- On-device Speech-to-Index engine powered by deep learningβ36Updated 3 months ago
- Synchronize Whisper's timestamps over an existing accurate transcriptionβ153Updated last year
- π« check your data, before you wreck your modelβ16Updated 2 years ago
- πΈ - A general purpose model trainer, as flexible as it getsβ220Updated last year
- A fork of https://people.csail.mit.edu/hubert/git/pyaudio.git. Last synchronized on 20231119.β42Updated last year
- Voice analysis software (Python port of VoiceSauce)β59Updated 6 years ago
- Library and command line utility to do approximate string matching of a source against a bitext index and get matched source and target.β50Updated 2 months ago
- Model for recasing and repunctuating ASR transcriptsβ136Updated last year