samemon / Voice-to-Age
This program determines the age range of a person from their voice. It uses a simple Mel-log spectrogram approach with a multi-layer perceptron model with relu as an activation and softmax in the final layer.
☆15Updated 7 years ago
Alternatives and similar repositories for Voice-to-Age:
Users that are interested in Voice-to-Age are comparing it to the libraries listed below
- Online (real-time) decoder to be used with DeepSpeech2 model☆25Updated 5 years ago
- Unsupervised word segmentation and clustering of speech☆13Updated 8 years ago
- (Si)mply a (Re)search front-end for Text-To-Speech Synthesis.☆10Updated 6 years ago
- ABX and kaldi experiments on speech corpora made easy☆32Updated 7 months ago
- Prabhupadavani: A Code-mixed Speech Translation Data for 25 languages☆13Updated 2 years ago
- Demo page of our paper Efficiently Trainable Text-to-Speech System Based on Deep Convolutional Networks With Guided Attention, ICASSP 201…☆15Updated 3 years ago
- Kaldi style neural network training in pytorch for use in place of nnet3 in Kaldi.☆26Updated 9 months ago
- Similarity Learning applied to Speaker Verification and Semantic Textual Similarity☆12Updated 5 years ago
- Project to learn about speech recognition - both Speaker Diarization and other Speech Recognition applications.☆48Updated 8 years ago
- ☆16Updated 5 years ago
- A set of scripts to use in preparing a corpus for speech-to-text processing with the Kaldi Automatic Speech Recognition Library.☆15Updated 4 years ago
- Correspondence and autoencoder neural network training for speech using Pylearn2.☆13Updated 9 years ago
- From a large speech audio file and its corresponding body of text, automatically chunk the audio and text into (phrase, audio_snippet) pa…☆17Updated 9 years ago
- End-to-end deep learned Automatic Speech Recognition system☆8Updated 8 years ago
- Parallelized automatic corpus collection for ASR. Forked from https://github.com/EgorLakomkin/KTSpeechCrawler☆24Updated 4 years ago
- 24-hour Automatic Speech Recognition☆27Updated 3 years ago
- A packaged convolutional voice activity detector for noisy environments.☆14Updated 5 years ago
- WaveNet Vocoder Samples☆23Updated 5 years ago
- Enable RNNLM lattice rescoring with Pytorch [kaldi]☆12Updated 4 years ago
- Code for the paper "Investigating the effect of residual and highway connections in speech enhancement models"☆11Updated 6 years ago
- Multilingual acoustic word embedding approaches applied and evaluated on GlobalPhone data.☆11Updated 4 years ago
- ☆15Updated 2 years ago
- LogMMSE speech enhancement/noise reduction☆30Updated 4 years ago
- A punctuation transcription model to automatically add punctuation marks in an unpunctuated sentence or sentences.☆15Updated 4 years ago
- ☆15Updated 7 years ago
- The Additive Margin MobileNet1D is a new light weight deep learning model for Speaker Recognition which is based on the MobileNetV2 archi…☆30Updated last year
- Audio Keyword Search☆12Updated 6 years ago
- This app is intended to automatically create a corpus for ASR systems using pseudo-labeling.☆27Updated last year
- SpeechGLUE is a speech version of the GLUE benchmark, driven by text-to-speech.☆13Updated last year
- 📊 Easily apply audio-related machine learning models trained on the AudioSet dataset (527+ models/classes).☆29Updated 10 months ago