samemon / Voice-to-Age
This program determines the age range of a person from their voice. It uses a simple Mel-log spectrogram approach with a multi-layer perceptron model with relu as an activation and softmax in the final layer.
☆15Updated 7 years ago
Alternatives and similar repositories for Voice-to-Age:
Users that are interested in Voice-to-Age are comparing it to the libraries listed below
- Prabhupadavani: A Code-mixed Speech Translation Data for 25 languages☆13Updated 2 years ago
- NLG Best Practices for Data-Efficient Modeling How to Train Production-Ready Models with Little Data☆10Updated 3 years ago
- A TensorFlow Implementation of Punctuation Restoration.☆18Updated 4 years ago
- Online (real-time) decoder to be used with DeepSpeech2 model☆24Updated 4 years ago
- A neural network for filtering target speaker's voice from audio written in tensorflow☆21Updated 6 years ago
- Code for AccentDB.☆20Updated 3 years ago
- Parallelized automatic corpus collection for ASR. Forked from https://github.com/EgorLakomkin/KTSpeechCrawler☆23Updated 3 years ago
- Demo page of our paper Efficiently Trainable Text-to-Speech System Based on Deep Convolutional Networks With Guided Attention, ICASSP 201…☆15Updated 3 years ago
- VOiCES-subset☆8Updated 6 years ago
- Anonymous ICLR Submission☆14Updated 5 years ago
- some tutorials for blog: simonjisu.github.io☆23Updated 3 years ago
- creating audio preprocessing features in TensorFlow keras layers,☆14Updated 3 years ago
- ☆12Updated 3 years ago
- Similarity Learning applied to Speaker Verification and Semantic Textual Similarity☆12Updated 4 years ago
- GPT-jax based on the official huggingface library☆13Updated 3 years ago
- Identifying individual speakers in an audio stream based on the unique characteristics found in individual voices using Python☆18Updated last year
- The Hidden Markov Model Toolkit (HTK)☆12Updated 7 years ago
- Comprehensive Python library for speech and voice.☆33Updated 2 years ago
- Using Gradio interface to build UI for converting text to speech☆12Updated 4 years ago
- Dialect identification using Siamese network☆15Updated 7 years ago
- Segmenting a given document using recursive xy-cut algorithm.☆12Updated 6 years ago
- automatically align transcribed audio and generate a wav2letter training corpus☆36Updated last year
- bumble bee transformer☆14Updated 3 years ago
- ☆20Updated 5 years ago
- A monster repo for random research, not organized in any particular way☆13Updated 8 years ago
- VertMetric: An abstractive summarization evaluation package. VERT stands for Versatile Evaluation of Reduced Texts.☆11Updated 6 years ago
- Project to learn about speech recognition - both Speaker Diarization and other Speech Recognition applications.☆47Updated 8 years ago
- Representations of language in a model of visually grounded speech signal.☆23Updated 6 years ago
- Experiments with generating GPT-2 fanfiction on specified topics.☆11Updated 5 years ago
- Implements EvoNorms B0 and S0 as proposed in Evolving Normalization-Activation Layers.☆11Updated 4 years ago