dawntcherian / Google-speech-to-text-python-websocket-server-using-microphone-stream
Python WebSocket server which converts input audio stream from microphone to text using Google speech to text
☆44Updated last year
Related projects ⓘ
Alternatives and complementary repositories for Google-speech-to-text-python-websocket-server-using-microphone-stream
- [deprecated] Pretrained models for pyannote-audio 1.x☆71Updated 2 years ago
- Speaker diarization python system based on binary key speaker modelling☆61Updated 2 years ago
- ♂️♀️ Detect a person's gender from a voice file (90.7% +/- 1.3% accuracy).☆79Updated 5 months ago
- A lightweight library to compute Diarization Error Rate (DER).☆59Updated last year
- Deep Learning - one shot learning for speaker recognition using Filter Banks☆160Updated 5 months ago
- A Voice Biometric Application using Watson Speech to Text☆81Updated last year
- A list of publically available audio data that anyone can download for ASR or other speech activities☆200Updated 3 years ago
- This repository is a collection of TTS Models in TFLite☆189Updated 3 years ago
- DeepSpeech based forced alignment tool☆235Updated 3 years ago
- ☆34Updated 10 months ago
- Paper: https://arxiv.org/abs/1702.02285☆62Updated 5 years ago
- A curated list of awesome Speaker Diarization papers, libraries, datasets, and other resources.☆13Updated 4 years ago
- Server & client for DeepSpeech using WebSockets for real-time speech recognition in separate environments☆101Updated 4 years ago
- Classify daily life events using audio data.☆48Updated 4 years ago
- Small repo describing how to use Hugging Face's Wav2Vec2 with PyCTCDecode☆110Updated 2 years ago
- Code for Speaker Change Detection in Broadcast TV using Bidirectional Long Short-Term Memory Networks☆62Updated 4 years ago
- This project is for the comparison of two audio files based on their MFCC's.☆32Updated 4 years ago
- Speaker diarization based on Kaldi x-vectors, tuned for 16k microphone data☆95Updated last year
- ☆84Updated 4 years ago
- Python server for communicating with Kaldi from the browser using WebRTC☆67Updated last year
- This program calculates the word error rate of hypothesis in ASR and print the aligned result.☆152Updated 4 years ago
- Advanced data structures for handling temporal segments with attached labels.☆99Updated 5 months ago
- CVSS: A Massively Multilingual Speech-to-Speech Translation Corpus☆183Updated 2 years ago
- An end-to-end system which makes use of a recurrent encoder-decoder deep neural network to translate speech from the Hindi (Fourth most s…☆17Updated 5 years ago
- A live speech recognition using Facebooks wav2vec 2.0 model.☆328Updated 9 months ago
- Some simple wrappers around kaldi-asr intended to make using kaldi's (online) decoders as convenient as possible.☆170Updated 3 years ago
- Analyzes signal, finds fundamental frequency, HNR etc☆14Updated 7 years ago
- Simplified diarization pipeline using some pretrained models - audio file to diarized segments in a few lines of code☆141Updated 6 months ago
- ☆41Updated last year
- Predicting emotions based on speech audio samples of American English, German and British English languages using Support Vector Machine,…☆18Updated 6 years ago