taekb / gcloud_speech_voice_recorderLinks
Flask-based web application that records sound (as PCM/WAV) and converts speech to text via Google Cloud Speech API using HTML, JavaScript, and Python
☆43Updated 7 years ago
Alternatives and similar repositories for gcloud_speech_voice_recorder
Users that are interested in gcloud_speech_voice_recorder are comparing it to the libraries listed below
Sorting:
- speaker_diarization done on toy dataset and tested on timit dataset☆7Updated 3 years ago
- Mispronunciation detection code for jingju singing voice☆20Updated 6 years ago
- ☆38Updated 5 years ago
- End-to-End Speech Recognition using Neural Networks.☆35Updated 9 months ago
- Classifying 10 different categories of Sound using Deep Learning.☆25Updated 6 years ago
- Implementation of "FastSpeech: Fast, Robust and Controllable Text to Speech"☆64Updated last year
- Speaker diarization python system based on binary key speaker modelling☆60Updated 3 years ago
- End to End Dialect Identification using Convolutional Neural Network☆52Updated 5 years ago
- Machine learning experiment to perform gender classification from raw audio.☆23Updated 6 years ago
- A set of scripts that extract speech features (so far MFCCs, FBANKs, STFT, and kinda dominant frequency) and trains CNN, LSTM, or CNN+LST…☆51Updated 2 years ago
- PyTorch implementation of Listen Attend and Spell Automatic Speech Recognition (ASR).☆38Updated 5 years ago
- Neural network based similarity scoring for diarization (pytorch implementation of "LSTM based Similarity Measurement with Spectral Clust…☆44Updated 4 years ago
- Project to learn about speech recognition - both Speaker Diarization and other Speech Recognition applications.☆48Updated 8 years ago
- Audio classification via transfer learning☆33Updated 5 years ago
- A recipe for creating a Speaker Identification system built on Kaldi.☆15Updated 5 years ago
- A large, free audio sample database (10M words pronounced), a test bed for voice activity detection algorithms and for single-syllable wo…☆69Updated 7 years ago
- Feature extractor for DL speech processing.☆65Updated 3 years ago
- Speaker diarization based on Kaldi x-vectors, tuned for 16k microphone data☆95Updated last year
- This repository contains the code and supplementary result for the paper "Unpaired Speech Enhancement by Acoustic and Adversarial Supervi…☆28Updated 5 years ago
- Trained speaker embedding deep learning models and evaluation pipelines in pytorch and tesorflow for speaker recognition.☆36Updated 5 years ago
- An online speech recognition extension toolkit of Kaldi☆56Updated 4 years ago
- Inspired work by the project of SER using ELM at Microsoft Research☆19Updated 6 years ago
- Text Independent Speaker Verification Using GE2E Loss☆84Updated 6 years ago
- Keras + pyTorch implimentation of "Deep Learning & 3D Convolutional Neural Networks for Speaker Verification"☆29Updated 6 years ago
- ☆60Updated 4 years ago
- Audio classification is a popular topic, here I implement several models using TenserFlow and Keras.☆24Updated 4 years ago
- Control mechanisms to the U-Net architecture for doing multiple source separation instruments☆52Updated 5 years ago
- Pytorch based phoneme recognition (TIMIT phoneme classification)☆34Updated 7 years ago
- Classification of environmental sounds using first order statistics and GLCM (Gray-Level Co-Occurrence Matrix ) features of a spectrogram…☆24Updated 4 years ago
- Rescoring methods for end-to-end Automatic Speech Recognition☆27Updated 4 years ago