samemon / Voice-to-AgeLinks
This program determines the age range of a person from their voice. It uses a simple Mel-log spectrogram approach with a multi-layer perceptron model with relu as an activation and softmax in the final layer.
☆15Updated 7 years ago
Alternatives and similar repositories for Voice-to-Age
Users that are interested in Voice-to-Age are comparing it to the libraries listed below
Sorting:
- Using embedding-based loss functions for phonetics/speech recognition.☆17Updated 10 years ago
- Online (real-time) decoder to be used with DeepSpeech2 model☆25Updated 5 years ago
- Comprehensive Python library for speech and voice.☆32Updated 2 years ago
- Prabhupadavani: A Code-mixed Speech Translation Data for 25 languages☆13Updated 2 years ago
- Automatic Speech Recognition Dataset Generation☆37Updated 6 years ago
- NLG Best Practices for Data-Efficient Modeling How to Train Production-Ready Models with Little Data☆10Updated 3 years ago
- ☆15Updated 4 years ago
- Converts TensorFlow checkpoints (with index, meta and data files) to PyTorch, HDF5 and JSON☆18Updated 4 years ago
- ☆21Updated 6 years ago
- ☆16Updated 5 years ago
- ☆12Updated 3 years ago
- A Text2Speech Engine built in Pytorch.☆12Updated 6 years ago
- A neural network for filtering target speaker's voice from audio written in tensorflow☆21Updated 7 years ago
- Experiments and tutorials with and for torchaudio☆13Updated 4 years ago
- bumble bee transformer☆14Updated 4 years ago
- Project to learn about speech recognition - both Speaker Diarization and other Speech Recognition applications.☆49Updated 8 years ago
- Representations of language in a model of visually grounded speech signal.☆23Updated 7 years ago
- Conversational AI Benchmark.☆68Updated 2 years ago
- LSTM/BOF model to encode Videos. Implementation of our BMVC paper "Story Understanding in Video Advertisements".☆14Updated 4 years ago
- Unsupervised word segmentation and clustering of speech☆13Updated 8 years ago
- Demo page of our paper Efficiently Trainable Text-to-Speech System Based on Deep Convolutional Networks With Guided Attention, ICASSP 201…☆15Updated 4 years ago
- This sample includes simeple CNN classifier for music and audio-folder dataloader just like ImageFolder in torchvision.☆11Updated 6 years ago
- 📊 Easily apply audio-related machine learning models trained on the AudioSet dataset (527+ models/classes).☆30Updated last year
- Cross-lingual Fact-to-Text Alignment and Generation for Low-Resource Languages☆9Updated 2 years ago
- Experiments with Hugging Face 🔬 🤗☆44Updated 10 months ago
- Dataset Release for Intent Classification from Speech☆47Updated 4 months ago
- Dialect identification using Siamese network☆15Updated 7 years ago
- Experiments for the blog post "No, We Don't Have to Choose Batch Sizes As Powers Of 2"☆20Updated 3 years ago
- A TensorFlow Implementation of Punctuation Restoration.☆18Updated 4 years ago
- Anonymous ICLR Submission☆14Updated 5 years ago