samemon / Voice-to-Age
This program determines the age range of a person from their voice. It uses a simple Mel-log spectrogram approach with a multi-layer perceptron model with relu as an activation and softmax in the final layer.
☆15Updated 7 years ago
Alternatives and similar repositories for Voice-to-Age:
Users that are interested in Voice-to-Age are comparing it to the libraries listed below
- Demo page of our paper Efficiently Trainable Text-to-Speech System Based on Deep Convolutional Networks With Guided Attention, ICASSP 201…☆15Updated 3 years ago
- NLG Best Practices for Data-Efficient Modeling How to Train Production-Ready Models with Little Data☆10Updated 3 years ago
- Using embedding-based loss functions for phonetics/speech recognition.☆17Updated 10 years ago
- A monster repo for random research, not organized in any particular way☆13Updated 8 years ago
- Representations of language in a model of visually grounded speech signal.☆23Updated 6 years ago
- Experiments and tutorials with and for torchaudio☆13Updated 3 years ago
- ☆21Updated 6 years ago
- Online (real-time) decoder to be used with DeepSpeech2 model☆24Updated 4 years ago
- ☆15Updated 6 years ago
- A neural network for filtering target speaker's voice from audio written in tensorflow☆21Updated 6 years ago
- codes for TokenManipulationGAN☆7Updated 4 years ago
- ☆16Updated 5 years ago
- Prabhupadavani: A Code-mixed Speech Translation Data for 25 languages☆13Updated 2 years ago
- A TensorFlow Implementation of Punctuation Restoration.☆18Updated 4 years ago
- Dialect identification using Siamese network☆15Updated 7 years ago
- Enable RNNLM lattice rescoring with Pytorch [kaldi]☆12Updated 4 years ago
- Unsupervised word segmentation and clustering of speech☆13Updated 7 years ago
- ☆8Updated 6 years ago
- Correspondence and autoencoder neural network training for speech using Pylearn2.☆13Updated 9 years ago
- ☆20Updated 5 years ago
- GPT-jax based on the official huggingface library☆13Updated 3 years ago
- The Additive Margin MobileNet1D is a new light weight deep learning model for Speaker Recognition which is based on the MobileNetV2 archi…☆29Updated last year
- Breaks a word into syllables using an LSTM-based neural network.☆19Updated last year
- ☆27Updated 5 years ago
- ABX and kaldi experiments on speech corpora made easy☆31Updated 3 months ago
- Kaldi style neural network training in pytorch for use in place of nnet3 in Kaldi.☆26Updated 5 months ago
- some tutorials for blog: simonjisu.github.io☆23Updated 3 years ago