taekb / gcloud_speech_voice_recorderLinks
Flask-based web application that records sound (as PCM/WAV) and converts speech to text via Google Cloud Speech API using HTML, JavaScript, and Python
☆43Updated 8 years ago
Alternatives and similar repositories for gcloud_speech_voice_recorder
Users that are interested in gcloud_speech_voice_recorder are comparing it to the libraries listed below
Sorting:
- Identifying people from small audio fragments☆170Updated 5 years ago
- Use your data to create a speech recognition system in Kaldi. Fast.☆65Updated 6 years ago
- End to End Dialect Identification using Convolutional Neural Network☆53Updated 6 years ago
- Training an n-gram based Language Model using KenLM toolkit for Deep Speech 2☆115Updated 6 years ago
- A large, free audio sample database (10M words pronounced), a test bed for voice activity detection algorithms and for single-syllable wo…☆70Updated 8 years ago
- Keras Implementation of Deepmind's WaveNet for Supervised Learning Tasks☆64Updated 6 years ago
- Paper: https://arxiv.org/abs/1702.02285☆64Updated 7 years ago
- [deprecated] Pretrained models for pyannote-audio 1.x☆71Updated 3 years ago
- Speaker diarization python system based on binary key speaker modelling☆60Updated 3 years ago
- A recipe for creating a Speaker Identification system built on Kaldi.☆15Updated 6 years ago
- Automatic Speech Recognition Dataset Generation☆37Updated 7 years ago
- Multilingual and code-switching ASR challenges for low resource Indian languages.☆21Updated 4 years ago
- ☆38Updated 5 years ago
- The repository contains all the codes necessary for my project - Automatic Speech Recognition System in Hindi Language ( Project descript…☆28Updated 6 years ago
- A lightweight library to compute Diarization Error Rate (DER).☆62Updated 2 years ago
- Speech Recognition Scoring Toolkit☆13Updated 10 years ago
- 24-hour Automatic Speech Recognition☆27Updated 4 years ago
- Collection of research papers on cough classification☆40Updated 5 years ago
- Speaker diarization based on Kaldi x-vectors, tuned for 16k microphone data☆96Updated 2 years ago
- ♂️♀️ Detect a person's gender from a voice file (90.7% +/- 1.3% accuracy).☆90Updated last year
- Freesound Audio Tagging 2019☆95Updated 6 years ago
- mixlingual speech recognition system; hybrid (GMM+NNet) model; Kaldi + Keras☆71Updated 8 years ago
- Project to learn about speech recognition - both Speaker Diarization and other Speech Recognition applications.☆50Updated 8 years ago
- Speaker Diarization is the problem of separating speakers in an audio. There could be any number of speakers and final result should stat…☆64Updated 5 years ago
- ☆45Updated 6 years ago
- Automatic Dialect Detection Repository☆39Updated 3 years ago
- An implementation of RNN-Transducer loss in TF-2.0.☆46Updated this week
- Conv-LSTM-CTC speech recognition network (end-to-end), written in TensorFlow.☆72Updated 6 years ago
- End-to-End Speech Recognition using Neural Networks.☆35Updated last year
- A deep learning model is developed which can predict the native country on the basis of the spoken english accent☆51Updated 5 years ago