samemon / Voice-to-Age
This program determines the age range of a person from their voice. It uses a simple Mel-log spectrogram approach with a multi-layer perceptron model with relu as an activation and softmax in the final layer.
☆15Updated 7 years ago
Alternatives and similar repositories for Voice-to-Age:
Users that are interested in Voice-to-Age are comparing it to the libraries listed below
- Demo page of our paper Efficiently Trainable Text-to-Speech System Based on Deep Convolutional Networks With Guided Attention, ICASSP 201…☆15Updated 3 years ago
- A Text2Speech Engine built in Pytorch.☆12Updated 6 years ago
- NLG Best Practices for Data-Efficient Modeling How to Train Production-Ready Models with Little Data☆10Updated 3 years ago
- VOiCES-subset☆8Updated 6 years ago
- A neural network for filtering target speaker's voice from audio written in tensorflow☆21Updated 6 years ago
- Experiments and tutorials with and for torchaudio☆13Updated 3 years ago
- Prabhupadavani: A Code-mixed Speech Translation Data for 25 languages☆13Updated 2 years ago
- Similarity Learning applied to Speaker Verification and Semantic Textual Similarity☆12Updated 4 years ago
- Representations of language in a model of visually grounded speech signal.☆23Updated 6 years ago
- The Hidden Markov Model Toolkit (HTK)☆12Updated 7 years ago
- Online (real-time) decoder to be used with DeepSpeech2 model☆24Updated 5 years ago
- This repository contains notebooks showing how to perform mixed precision training in tf.keras 2.0☆12Updated 5 years ago
- bumble bee transformer☆14Updated 3 years ago
- automatically align transcribed audio and generate a wav2letter training corpus☆36Updated last year
- Segmenting a given document using recursive xy-cut algorithm.☆12Updated 6 years ago
- WaveNet Vocoder Samples☆23Updated 5 years ago
- Unsupervised word segmentation and clustering of speech☆13Updated 8 years ago
- Implements EvoNorms B0 and S0 as proposed in Evolving Normalization-Activation Layers.☆11Updated 4 years ago
- LSTM/BOF model to encode Videos. Implementation of our BMVC paper "Story Understanding in Video Advertisements".☆14Updated 4 years ago
- A monster repo for random research, not organized in any particular way☆13Updated 8 years ago
- Compute useful transcriptions metrics (CER, WER, SER, ...)☆27Updated 10 years ago
- readers that enable reading kaldi ark in tensorflow☆17Updated 7 years ago
- INACTIVE - http://mzl.la/ghe-archive - Tools to create ARPA models from cmu pocketsphinx dictionaries for proper g2p generation☆21Updated 5 years ago
- A TensorFlow Implementation of Punctuation Restoration.☆18Updated 4 years ago
- ABX and kaldi experiments on speech corpora made easy☆31Updated 4 months ago
- Project to learn about speech recognition - both Speaker Diarization and other Speech Recognition applications.☆47Updated 8 years ago
- Data processing tools for preparing speech and labels for training TTS voices☆24Updated 4 years ago
- Using embedding-based loss functions for phonetics/speech recognition.☆17Updated 10 years ago
- ☆16Updated 5 years ago
- Anonymous ICLR Submission☆14Updated 5 years ago