dawntcherian / Google-speech-to-text-python-websocket-server-using-microphone-stream
Python WebSocket server which converts input audio stream from microphone to text using Google speech to text
☆44Updated last year
Related projects ⓘ
Alternatives and complementary repositories for Google-speech-to-text-python-websocket-server-using-microphone-stream
- [deprecated] Pretrained models for pyannote-audio 1.x☆71Updated 2 years ago
- ♂️♀️ Detect a person's gender from a voice file (90.7% +/- 1.3% accuracy).☆79Updated 4 months ago
- Code for Speaker Change Detection in Broadcast TV using Bidirectional Long Short-Term Memory Networks☆62Updated 4 years ago
- Machine learning experiment to perform gender classification from raw audio.☆23Updated 6 years ago
- Speaker diarization python system based on binary key speaker modelling☆61Updated 2 years ago
- Some simple wrappers around kaldi-asr intended to make using kaldi's (online) decoders as convenient as possible.☆170Updated 3 years ago
- A Python library for measuring the acoustic features of speech (simultaneous speech, high entropy) compared to ones of native speech.☆236Updated last year
- DeepSpeech based forced alignment tool☆233Updated 3 years ago
- A list of publically available audio data that anyone can download for ASR or other speech activities☆200Updated 3 years ago
- Python server for communicating with Kaldi from the browser using WebRTC☆67Updated last year
- Building a Deep learning model that predicts the gender of a speaker using TensorFlow 2☆109Updated last year
- Support tools for punctuation and boundary detection for ASR output.☆57Updated last year
- speaker diarization system using an LSTM☆49Updated last year
- Server framework for Kaldi ASR Toolkit☆96Updated last year
- Simplified diarization pipeline using some pretrained models - audio file to diarized segments in a few lines of code☆140Updated 6 months ago
- Spoken Language assessment☆41Updated 3 years ago
- Wav2Keyword is keyword spotting(KWS) based on Wav2Vec 2.0. This model shows state-of-the-art in Speech commands dataset V1 and V2.☆99Updated last year
- A lightweight library to compute Diarization Error Rate (DER).☆59Updated last year
- Voice based gender recognition using Mel-frequency cepstrum coefficients (MFCC) and Gaussian mixture models (GMM)☆202Updated last year
- Using speaker embedding for diarization in PyTorch☆18Updated 4 years ago
- Speech noise reduction which was generated using existing post-production techniques implemented in Python☆176Updated 2 years ago
- A curated list of awesome Speaker Diarization papers, libraries, datasets, and other resources.☆13Updated 4 years ago
- Server & client for DeepSpeech using WebSockets for real-time speech recognition in separate environments☆101Updated 4 years ago
- Speaker diarization based on Kaldi x-vectors, tuned for 16k microphone data☆95Updated last year
- Wrapper for pydub AudioSegment objects☆95Updated last year
- Automatic Speaker Recognition algorithms in Python☆93Updated 3 years ago
- Various speech datasets made available to the public☆98Updated last month
- STT Service based on Kaldi ASR☆15Updated 6 years ago
- Rescoring methods for end-to-end Automatic Speech Recognition☆27Updated 4 years ago
- Basic python tornado app for handling websocket audio☆10Updated last year