samemon / Voice-to-AgeLinks
This program determines the age range of a person from their voice. It uses a simple Mel-log spectrogram approach with a multi-layer perceptron model with relu as an activation and softmax in the final layer.
☆15Updated 7 years ago
Alternatives and similar repositories for Voice-to-Age
Users that are interested in Voice-to-Age are comparing it to the libraries listed below
Sorting:
- Representations of language in a model of visually grounded speech signal.☆23Updated 7 years ago
- Online (real-time) decoder to be used with DeepSpeech2 model☆25Updated 5 years ago
- Prabhupadavani: A Code-mixed Speech Translation Data for 25 languages☆13Updated 2 years ago
- Similarity Learning applied to Speaker Verification and Semantic Textual Similarity☆12Updated 5 years ago
- A TensorFlow Implementation of Punctuation Restoration.☆18Updated 4 years ago
- NLG Best Practices for Data-Efficient Modeling How to Train Production-Ready Models with Little Data☆10Updated 3 years ago
- Parallelized automatic corpus collection for ASR. Forked from https://github.com/EgorLakomkin/KTSpeechCrawler☆24Updated 4 years ago
- The History of Speech Recognition to the Year 2030☆13Updated 3 years ago
- Comprehensive Python library for speech and voice.☆32Updated 2 years ago
- The implementation of 'Watch, Listen, Attend and Spell’ (WLAS) network that learns to transcribe videos of mouth motion to character on p…☆11Updated 7 years ago
- Experiments and tutorials with and for torchaudio☆13Updated 4 years ago
- From a large speech audio file and its corresponding body of text, automatically chunk the audio and text into (phrase, audio_snippet) pa…☆17Updated 10 years ago
- Demo page of our paper Efficiently Trainable Text-to-Speech System Based on Deep Convolutional Networks With Guided Attention, ICASSP 201…☆15Updated 4 years ago
- End-to-end deep learned Automatic Speech Recognition system☆8Updated 8 years ago
- Project to learn about speech recognition - both Speaker Diarization and other Speech Recognition applications.☆48Updated 8 years ago
- Correspondence and autoencoder neural network training for speech using Pylearn2.☆14Updated 9 years ago
- (Si)mply a (Re)search front-end for Text-To-Speech Synthesis.☆10Updated 7 years ago
- LSTM/BOF model to encode Videos. Implementation of our BMVC paper "Story Understanding in Video Advertisements".☆14Updated 4 years ago
- Unsupervised word segmentation and clustering of speech