dawntcherian / Google-speech-to-text-python-websocket-server-using-microphone-stream
Python WebSocket server which converts input audio stream from microphone to text using Google speech to text
☆47Updated 2 years ago
Alternatives and similar repositories for Google-speech-to-text-python-websocket-server-using-microphone-stream
Users that are interested in Google-speech-to-text-python-websocket-server-using-microphone-stream are comparing it to the libraries listed below
Sorting:
- Server & client for DeepSpeech using WebSockets for real-time speech recognition in separate environments☆102Updated 4 years ago
- [deprecated] Pretrained models for pyannote-audio 1.x☆72Updated 2 years ago
- ♂️♀️ Detect a person's gender from a voice file (90.7% +/- 1.3% accuracy).☆84Updated 11 months ago
- Wrapper for pydub AudioSegment objects☆96Updated 2 years ago
- 📝An easy-to-use package to restore punctuation of the text.☆115Updated 2 years ago
- PyTorch Implementation of FastSpeech 2 : Fast and High-Quality End-to-End Text to Speech☆230Updated 2 years ago
- Code for Speaker Change Detection in Broadcast TV using Bidirectional Long Short-Term Memory Networks☆65Updated 4 years ago
- DeepSpeech based forced alignment tool☆237Updated 4 years ago
- Identifying people from small audio fragments☆170Updated 5 years ago
- End-to-end speech recognition using RNN Transducers in Tensorflow 2.0☆245Updated 4 years ago
- This library has moved to https://github.com/googleapis/google-cloud-python/tree/main/packages/google-cloud-texttospeech☆126Updated last year
- Text to Speech with PyTorch (English and Mongolian)☆185Updated 7 months ago
- Small repo describing how to use Hugging Face's Wav2Vec2 with PyCTCDecode☆111Updated 2 years ago
- automatically align transcribed audio and generate a wav2letter training corpus☆36Updated 2 years ago
- This program calculates the word error rate of hypothesis in ASR and print the aligned result.☆155Updated 5 years ago
- ESPnet Model Zoo☆250Updated last year
- A curated list of awesome Speaker Diarization papers, libraries, datasets, and other resources.☆16Updated 5 years ago
- A lightweight library to compute Diarization Error Rate (DER).☆59Updated last year
- The human speaks a language with an accent. A particular accent necessarily reflects a person's linguistic background. The model defines …☆60Updated 3 years ago
- CVSS: A Massively Multilingual Speech-to-Speech Translation Corpus☆194Updated 2 years ago
- GSoC'2021 | TensorFlow implementation of Wav2Vec2☆90Updated 3 years ago
- Support tools for punctuation and boundary detection for ASR output.☆57Updated 2 years ago
- Simple audio recorder that sends WAV from browser to server in Python (Flask).☆31Updated 2 years ago
- An HTML interface for finetuning the sync map output from aeneas☆53Updated 2 years ago
- Speaker diarization based on Kaldi x-vectors, tuned for 16k microphone data☆96Updated last year
- A list of publically available audio data that anyone can download for ASR or other speech activities☆209Updated 3 years ago
- ☆42Updated 3 years ago
- Speaker diarization python system based on binary key speaker modelling☆61Updated 3 years ago
- ☆67Updated 5 months ago
- Deep Learning - one shot learning for speaker recognition using Filter Banks☆168Updated 10 months ago