samemon / Voice-to-Age
This program determines the age range of a person from their voice. It uses a simple Mel-log spectrogram approach with a multi-layer perceptron model with relu as an activation and softmax in the final layer.
☆15Updated 7 years ago
Alternatives and similar repositories for Voice-to-Age:
Users that are interested in Voice-to-Age are comparing it to the libraries listed below
- Demo page of our paper Efficiently Trainable Text-to-Speech System Based on Deep Convolutional Networks With Guided Attention, ICASSP 201…☆15Updated 3 years ago
- Dialect identification using Siamese network☆15Updated 7 years ago
- Online (real-time) decoder to be used with DeepSpeech2 model☆24Updated 5 years ago
- A set of scripts to use in preparing a corpus for speech-to-text processing with the Kaldi Automatic Speech Recognition Library.☆15Updated 4 years ago
- Correspondence and autoencoder neural network training for speech using Pylearn2.☆13Updated 9 years ago
- readers that enable reading kaldi ark in tensorflow☆17Updated 7 years ago
- Representations of language in a model of visually grounded speech signal.☆23Updated 6 years ago
- NLG Best Practices for Data-Efficient Modeling How to Train Production-Ready Models with Little Data☆10Updated 3 years ago
- Similarity Learning applied to Speaker Verification and Semantic Textual Similarity☆12Updated 4 years ago
- An extensible speech synthesis system, build with PyTorch and the original code is from r9y9's https://github.com/r9y9/nnmnkwii_gallery☆26Updated 5 years ago
- Project to learn about speech recognition - both Speaker Diarization and other Speech Recognition applications.☆47Updated 8 years ago
- Code for AccentDB.☆20Updated 3 years ago
- The implementation of 'Watch, Listen, Attend and Spell’ (WLAS) network that learns to transcribe videos of mouth motion to character on p…☆11Updated 7 years ago
- Parallelized automatic corpus collection for ASR. Forked from https://github.com/EgorLakomkin/KTSpeechCrawler☆23Updated 4 years ago
- ABX and kaldi experiments on speech corpora made easy☆32Updated 5 months ago
- Mapping features using Deep Neural Networks (DNNs) with application to Voice Conversion (VC). The implementations are on top of Theano Py…☆33Updated 6 years ago
- A PyTorch implementation of Tacotron2, an end-to-end text-to-speech(TTS) system described in "Natural TTS Synthesis By Conditioning Waven…☆52Updated 6 years ago
- Unsupervised word segmentation and clustering of speech☆13Updated 8 years ago
- automatically align transcribed audio and generate a wav2letter training corpus☆36Updated last year
- Enable RNNLM lattice rescoring with Pytorch [kaldi]☆12Updated 4 years ago
- implement end-to-end asr algorithm with tensorflow☆40Updated 6 years ago
- Prabhupadavani: A Code-mixed Speech Translation Data for 25 languages☆13Updated 2 years ago
- SpeechGLUE is a speech version of the GLUE benchmark, driven by text-to-speech.☆13Updated last year
- Examples of cleaning up raw voices☆18Updated 3 years ago
- Anonymous ICLR Submission☆14Updated 5 years ago
- Convolutional neural networks for sound classification☆20Updated 7 years ago
- Multilingual acoustic word embedding approaches applied and evaluated on GlobalPhone data.☆11Updated 4 years ago
- A monster repo for random research, not organized in any particular way☆13Updated 8 years ago
- 📊 Easily apply audio-related machine learning models trained on the AudioSet dataset (527+ models/classes).☆29Updated 9 months ago
- Automatic Speech Recognition Dataset Generation☆37Updated 6 years ago