daanzu / py-webrtcvad-wheelsLinks

Python interface to the WebRTC Voice Activity Detector (VAD) [released with binary wheels!]

☆21

Alternatives and similar repositories for py-webrtcvad-wheels

Users that are interested in py-webrtcvad-wheels are comparing it to the libraries listed below

Sorting:

SEPIA-Framework / sepia-stt-server
SEPIA server to support open-source speech recognition via WebSocket connection.
☆128Updated 8 months ago
loretoparisi / hf-experiments
Experiments with Hugging Face 🔬 🤗
☆44Updated 10 months ago
mozilla / deepspeech-playbook
A crash course for training speech recognition models using DeepSpeech.
☆25Updated 4 years ago
coqui-ai / stt-model-manager
Coqui STT Model Manager - install, manage and try out Coqui STT models from the Model Zoo
☆26Updated 2 years ago
thorstenMueller / cTTS
TTS Client for Coqui TTS server
☆13Updated 2 years ago
jumon / zac
Zero-shot Audio Classification using Whisper
☆79Updated 2 years ago
daanzu / deepspeech-websocket-server
Server & client for DeepSpeech using WebSockets for real-time speech recognition in separate environments
☆102Updated 5 years ago
wolfmanstout / screen-ocr
Easily perform OCR on portions of the screen, choosing from a selection of backends.
☆47Updated 2 weeks ago
fquirin / speech-recognition-experiments
Experiments to test different speech recognition systems for SEPIA Framework
☆60Updated 2 years ago
Picovoice / cobra
On-device voice activity detection (VAD) powered by deep learning
☆220Updated last week
gooofy / py-nltools
A collection of basic python modules for spoken natural language processing
☆56Updated 5 years ago
amrrs / openai-whisper-webapp
Code for OpenAI Whisper Web App Demo
☆93Updated 2 years ago
mjansche / openfst
Read-only unofficial mirror of OpenFst
☆44Updated 3 years ago
coqui-ai / snakepit
🐍 Coqui's machine learning job scheduler
☆32Updated 3 years ago
coqui-ai / STT-examples
🐸STT integration examples
☆129Updated 2 years ago
ReadAlongs / SoundSwallower
An even smaller speech recognizer / force aligner
☆35Updated 7 months ago
coqui-ai / TTS-recipes
🐸TTS recipes for different datasets
☆86Updated 2 years ago
chrisspen / punctuator2
A bidirectional recurrent neural network model with attention mechanism for restoring missing punctuation in unsegmented text
☆36Updated 4 years ago
oliverguhr / fullstop-deep-punctuation-prediction
A model that predicts the punctuation of English, Italian, French and German texts.
☆80Updated 2 years ago
rhasspy / gruut
A tokenizer, text cleaner, and phonemizer for many human languages.
☆320Updated 8 months ago
rhasspy / rhasspy-silence
Silence detection in audio stream using webrtcvad
☆48Updated last year
prateekralhan / OpenAI_Whisper_ASR
A minimalistic automatic speech recognition streamlit based webapp powered by OpenAI's Whisper "State of the Art" models
☆66Updated 2 years ago
Picovoice / octopus
On-device Speech-to-Index engine powered by deep learning
☆36Updated 3 months ago
EtienneAb3d / WhisperTimeSync
Synchronize Whisper's timestamps over an existing accurate transcription
☆153Updated last year
coqui-ai / data-checker
🫠 check your data, before you wreck your model
☆16Updated 2 years ago
coqui-ai / Trainer
🐸 - A general purpose model trainer, as flexible as it gets
☆220Updated last year
CristiFati / pyaudio
A fork of https://people.csail.mit.edu/hubert/git/pyaudio.git. Last synchronized on 20231119.
☆42Updated last year
voicesauce / opensauce-python
Voice analysis software (Python port of VoiceSauce)
☆59Updated 6 years ago
SYSTRAN / fuzzy-match
Library and command line utility to do approximate string matching of a source against a bitext index and get matched source and target.
☆50Updated 2 months ago
benob / recasepunc
Model for recasing and repunctuating ASR transcripts
☆136Updated last year