dawntcherian / Google-speech-to-text-python-websocket-server-using-microphone-stream
Python WebSocket server which converts input audio stream from microphone to text using Google speech to text
☆47Updated 2 years ago
Alternatives and similar repositories for Google-speech-to-text-python-websocket-server-using-microphone-stream:
Users that are interested in Google-speech-to-text-python-websocket-server-using-microphone-stream are comparing it to the libraries listed below
- ♂️♀️ Detect a person's gender from a voice file (90.7% +/- 1.3% accuracy).☆83Updated 10 months ago
- [deprecated] Pretrained models for pyannote-audio 1.x☆72Updated 2 years ago
- Wrapper for pydub AudioSegment objects☆96Updated 2 years ago
- Speaker diarization python system based on binary key speaker modelling☆61Updated 3 years ago
- Code for Speaker Change Detection in Broadcast TV using Bidirectional Long Short-Term Memory Networks☆65Updated 4 years ago
- A lightweight library to compute Diarization Error Rate (DER).☆59Updated last year
- DeepSpeech based forced alignment tool☆237Updated 4 years ago
- Paper: https://arxiv.org/abs/1702.02285☆64Updated 6 years ago
- This repository is a collection of TTS Models in TFLite☆192Updated 4 years ago
- Speaker identification using voice MFCCs and GMM☆54Updated 4 years ago
- A Voice Biometric Application using Watson Speech to Text☆85Updated 2 years ago
- Python server for communicating with Kaldi from the browser using WebRTC☆69Updated last year
- Speech noise reduction which was generated using existing post-production techniques implemented in Python☆177Updated 3 years ago
- Extract frequency, power, width and dissonance of formants from wav files☆25Updated 2 years ago
- Speaker Diarization is the problem of separating speakers in an audio. There could be any number of speakers and final result should stat…☆65Updated 4 years ago
- Server & client for DeepSpeech using WebSockets for real-time speech recognition in separate environments☆102Updated 4 years ago
- Support tools for punctuation and boundary detection for ASR output.☆57Updated 2 years ago
- Predicting emotions based on speech audio samples of American English, German and British English languages using Support Vector Machine,…☆19Updated 6 years ago
- Helsinki Prosody Corpus and A System for Predicting Prosodic Prominence from Text☆242Updated 5 years ago
- A collection of useful tools for handling speech recognition data☆30Updated 2 years ago
- Python bindings around the LAME encoder☆58Updated 3 months ago
- 🐸TTS recipes for different datasets☆87Updated 2 years ago
- Speaker diarization based on Kaldi x-vectors, tuned for 16k microphone data☆96Updated last year
- Using speaker embedding for diarization in PyTorch☆18Updated 4 years ago
- Some simple wrappers around kaldi-asr intended to make using kaldi's (online) decoders as convenient as possible.☆170Updated 4 years ago
- A deep learning model is developed which can predict the native country on the basis of the spoken english accent☆47Updated 5 years ago
- Simplified diarization pipeline using some pretrained models - audio file to diarized segments in a few lines of code☆148Updated 11 months ago
- A repository for dictionaries to be used with the Prosodylab-Aligner☆17Updated 10 years ago
- This repository is for wake-word detection in speech using recurrent neural networks☆17Updated 6 years ago
- 📝An easy-to-use package to restore punctuation of the text.☆115Updated 2 years ago