samemon / Voice-to-AgeLinks
This program determines the age range of a person from their voice. It uses a simple Mel-log spectrogram approach with a multi-layer perceptron model with relu as an activation and softmax in the final layer.
☆15Updated 8 years ago
Alternatives and similar repositories for Voice-to-Age
Users that are interested in Voice-to-Age are comparing it to the libraries listed below
Sorting:
- Project to learn about speech recognition - both Speaker Diarization and other Speech Recognition applications.☆50Updated 8 years ago
- Conversational AI Benchmark.☆68Updated 2 years ago
- Speaker Diarization is the first step in many early audio processing and aims to solve the problem ”who spoke when”. It therefore relies …☆12Updated 7 years ago
- my approach to the kaggle speech recognition challenge☆25Updated 7 years ago
- ABX and kaldi experiments on speech corpora made easy☆33Updated last year
- Unsupervised word segmentation and clustering of speech☆13Updated 8 years ago
- Open Source Speech Inferencing Libary for Indic Languages☆13Updated 3 years ago
- A collection of basic python modules for spoken natural language processing☆55Updated 6 years ago
- ☆27Updated 6 years ago
- Feature extractor for DL speech processing.☆66Updated 3 years ago
- Automatic Speech Recognition Dataset Generation☆37Updated 7 years ago
- The implementation of 'Watch, Listen, Attend and Spell’ (WLAS) network that learns to transcribe videos of mouth motion to character on p…☆11Updated 7 years ago
- Urban Sound Classification : striving towards a fair comparison☆17Updated 5 years ago
- Stats 479 Project☆22Updated 6 years ago
- Online (real-time) decoder to be used with DeepSpeech2 model☆25Updated 5 years ago
- Unsupervised Speaker Clustering & Speaker Recognition☆13Updated 6 years ago
- readers that enable reading kaldi ark in tensorflow☆17Updated 7 years ago
- Prabhupadavani: A Code-mixed Speech Translation Data for 25 languages☆13Updated 3 years ago
- Jupyter Notebooks for creating Speech datasets☆46Updated 6 years ago
- Multi-lingual Text Processing☆96Updated 6 years ago
- ☆33Updated 6 years ago
- An LSTM RNN for restoring missing punctuation in unsegmented text.☆78Updated 9 years ago
- A simple implementation of the paper https://arxiv.org/pdf/1910.00716v1.pdf☆31Updated 3 years ago
- Audio processing using deep neural networks. Speaker identification using voice embeddings.☆13Updated 3 years ago
- Language Model Fine-tuning for Moby Dick☆42Updated 6 years ago
- Collection of models and extensions for deployment in PyTorch☆24Updated 3 years ago
- Comprehensive Python library for speech and voice.☆32Updated 3 years ago
- Normalize text string☆12Updated 7 years ago
- Speech-to-text based on wav2letter built for transfer learning☆98Updated 3 years ago
- Running Mozilla's implementation of Baidu DeepSpeech on Google Colaboratory☆16Updated 6 years ago