dawntcherian / Google-speech-to-text-python-websocket-server-using-microphone-stream
Python WebSocket server which converts input audio stream from microphone to text using Google speech to text
☆44Updated 2 years ago
Alternatives and similar repositories for Google-speech-to-text-python-websocket-server-using-microphone-stream:
Users that are interested in Google-speech-to-text-python-websocket-server-using-microphone-stream are comparing it to the libraries listed below
- Server & client for DeepSpeech using WebSockets for real-time speech recognition in separate environments☆102Updated 4 years ago
- [deprecated] Pretrained models for pyannote-audio 1.x☆72Updated 2 years ago
- A lightweight library to compute Diarization Error Rate (DER).☆59Updated last year
- Websockets <-> Riva proxy service. Audiocodes compatible.☆14Updated last year
- Speaker diarization python system based on binary key speaker modelling☆61Updated 3 years ago
- Code for Speaker Change Detection in Broadcast TV using Bidirectional Long Short-Term Memory Networks☆64Updated 4 years ago
- This repository contains text-to-speech (TTS) models and utilities designed produce synthetic training datasets for other speech-related …☆18Updated last year
- Speaker diarization scripts, based on AaltoASR☆190Updated 6 years ago
- Machine learning experiment to perform gender classification from raw audio.☆23Updated 6 years ago
- Classify daily life events using audio data.☆51Updated 5 years ago
- JavaScript deployment for Howl, the wake word detection modeling toolkit for Firefox Voice☆10Updated 4 years ago
- ♂️♀️ Detect a person's gender from a voice file (90.7% +/- 1.3% accuracy).☆81Updated 8 months ago
- Using speaker embedding for diarization in PyTorch☆18Updated 4 years ago
- Speaker diarization based on Kaldi x-vectors, tuned for 16k microphone data☆96Updated last year
- Removes silence segments from wav audio files☆29Updated 4 years ago
- This library has moved to https://github.com/googleapis/google-cloud-python/tree/main/packages/google-cloud-texttospeech☆126Updated last year
- ☆39Updated last year
- Extract frequency, power, width and dissonance of formants from wav files☆25Updated 2 years ago
- Traditional ASR (Signal & Cepstral Analysis, DTW, HMM) & DNNs (Custom Models + DeepSpeech) on Indian Accent Speech☆91Updated last year
- This repository is a collection of TTS Models in TFLite☆189Updated 4 years ago
- Zero-shot Audio Classification using Whisper☆78Updated 2 years ago
- ☆43Updated 2 years ago
- Stable timestamps and confidence score for words of OpenAI's Whisper outputs down to word-level.☆25Updated 2 years ago
- Experimental project to punctuate text using a embedding layer, single convolutional layer and output softmax layer written in Keras.☆83Updated 4 years ago
- 🐸TTS recipes for different datasets☆85Updated 2 years ago
- Deep Learning - one shot learning for speaker recognition using Filter Banks☆164Updated 7 months ago
- DeepSpeech based forced alignment tool☆237Updated 4 years ago
- Speaker change detection using SincNet and an LSTM/Transformer☆46Updated 7 months ago
- ☆43Updated 2 years ago
- Removing background noise in a sound file☆63Updated 5 years ago