dawntcherian / Google-speech-to-text-python-websocket-server-using-microphone-stream
Python WebSocket server which converts input audio stream from microphone to text using Google speech to text
☆45Updated 2 years ago
Alternatives and similar repositories for Google-speech-to-text-python-websocket-server-using-microphone-stream:
Users that are interested in Google-speech-to-text-python-websocket-server-using-microphone-stream are comparing it to the libraries listed below
- Server & client for DeepSpeech using WebSockets for real-time speech recognition in separate environments☆102Updated 4 years ago
- Identifying people from small audio fragments☆170Updated 4 years ago
- Code for Speaker Change Detection in Broadcast TV using Bidirectional Long Short-Term Memory Networks☆64Updated 4 years ago
- Python server for communicating with Kaldi from the browser using WebRTC☆69Updated last year
- An HTML interface for finetuning the sync map output from aeneas☆53Updated 2 years ago
- Speaker diarization python system based on binary key speaker modelling☆61Updated 3 years ago
- Accent Classification in Speech☆25Updated 5 years ago
- speaker diarization system using an LSTM☆50Updated 2 years ago
- Rescoring methods for end-to-end Automatic Speech Recognition☆27Updated 4 years ago
- Simplified diarization pipeline using some pretrained models - audio file to diarized segments in a few lines of code☆146Updated 10 months ago
- A lightweight library to compute Diarization Error Rate (DER).☆59Updated last year
- Deep Learning - one shot learning for speaker recognition using Filter Banks☆165Updated 9 months ago
- [deprecated] Pretrained models for pyannote-audio 1.x☆72Updated 2 years ago
- Speaker diarization based on Kaldi x-vectors, tuned for 16k microphone data☆96Updated last year
- This library has moved to https://github.com/googleapis/google-cloud-python/tree/main/packages/google-cloud-texttospeech☆126Updated last year
- Small repo describing how to use Hugging Face's Wav2Vec2 with PyCTCDecode☆111Updated 2 years ago
- Automatic Speaker Recognition algorithms in Python☆95Updated 3 years ago
- Reproducible experimental protocols for multimedia (audio, video, text) database☆98Updated last month
- ☆84Updated 4 years ago
- ☆65Updated 2 years ago
- A collection of basic python modules for spoken natural language processing☆56Updated 5 years ago
- Advanced data structures for handling temporal segments with attached labels.☆111Updated last month
- A best practice for streaming audio from a browser microphone to Dialogflow or Google Cloud STT by using websockets.☆143Updated last month
- The human speaks a language with an accent. A particular accent necessarily reflects a person's linguistic background. The model defines …☆60Updated 3 years ago
- Some simple wrappers around kaldi-asr intended to make using kaldi's (online) decoders as convenient as possible.☆170Updated 4 years ago
- Estimating the Age, Height, and Gender of a speaker with their speech signal. https://arxiv.org/pdf/2110.13653.pdf☆65Updated 3 years ago
- GStreamer plugin around Kaldi's online neural network decoder☆185Updated 4 years ago
- A collection of useful tools for handling speech recognition data☆30Updated 2 years ago
- Converts spoken words into text form.☆76Updated last year
- Various speech datasets made available to the public☆115Updated 3 months ago