JinScientist / voice-gender-recognitionLinks
Train a LSTM neural networks on Vox Forge public audio data set to recognize speaker's gender
☆13Updated 7 years ago
Alternatives and similar repositories for voice-gender-recognition
Users that are interested in voice-gender-recognition are comparing it to the libraries listed below
Sorting:
- Datasets of A Deep Convolutional Neural Network Based Virtual Elderly Companion Agent.☆37Updated 7 years ago
- Speech command recognition with capsule network & various NNs / KWS on Google Speech Command Dataset.☆25Updated 6 years ago
- Goodness of Pronunciation (GOP) for oral reading assessment.☆52Updated 3 years ago
- Speaker Diarization library in Python. Performs VAD, Segmentation, Linear Clustering, Hierarchical Clustering☆15Updated 8 years ago
- A repository for Chinese text normalization.☆20Updated 4 years ago
- Improving the Goodness of Pronunciation with DNNs and RNNs☆32Updated 7 years ago
- End to End Dialect Identification using Convolutional Neural Network☆53Updated 5 years ago
- it's ASR decoder and make graph project☆33Updated 3 years ago
- This Repository includes four different implementations of the Speaker Verification task including the GMM_UBM, Ivector, Deep-Speaker, an…☆32Updated 7 years ago
- magicspeech competition recipe☆18Updated 5 years ago
- A random forest classifier to predict the age-group and gender of a speaker from voice measurements.☆18Updated 6 years ago
- Compendium for the paper "Transparent pronunciation scoring using articulatorily weighted phoneme edit distance" by Karhila, Smolander, Y…☆25Updated 6 years ago
- wake word spotting with kaldi☆19Updated 4 years ago
- ☆38Updated 5 years ago
- implement end-to-end asr algorithm with tensorflow☆40Updated 7 years ago
- A SPMI Lab toolkit for language models.☆11Updated 8 years ago
- Machine learning experiment to perform gender classification from raw audio.☆23Updated 7 years ago
- Sound augmentation using Large-scale audio dataset (Audioset)☆45Updated 4 years ago
- A CNN audio classifier via spectrogram images.☆10Updated 8 years ago
- TTS model based on Transformer.☆58Updated 6 years ago
- Online streaming speaker change detection model in Pytorch☆42Updated 2 years ago
- Meta-embeddings are a probabilistic generalization of embeddings in machine learning.☆23Updated 6 years ago
- A packaged convolutional voice activity detector for noisy environments.☆14Updated 6 years ago
- Experiment with JNI access to some Kaldi functions.☆12Updated 6 years ago
- Filtering and Noise Adding Tool☆29Updated 3 years ago
- ☆16Updated 6 years ago
- Code for paper "Using Phonetic Posteriorgram Based Frame Pairing for Segmental Accent Conversion"☆36Updated 5 years ago
- Speaker diarization with GMM-UBM and MAP Adaptation☆30Updated 7 years ago
- Mispronunciation detection code for jingju singing voice☆20Updated 7 years ago
- ☆61Updated 2 years ago