samemon / Voice-to-AgeLinks
This program determines the age range of a person from their voice. It uses a simple Mel-log spectrogram approach with a multi-layer perceptron model with relu as an activation and softmax in the final layer.
☆15Updated 7 years ago
Alternatives and similar repositories for Voice-to-Age
Users that are interested in Voice-to-Age are comparing it to the libraries listed below
Sorting:
- Conversational AI Benchmark.☆68Updated 2 years ago
- Project to learn about speech recognition - both Speaker Diarization and other Speech Recognition applications.☆50Updated 8 years ago
- Unsupervised word segmentation and clustering of speech☆13Updated 8 years ago
- Comprehensive Python library for speech and voice.☆32Updated 2 years ago
- Automatic Speech Recognition Dataset Generation☆37Updated 7 years ago
- Feature extractor for DL speech processing.☆66Updated 3 years ago
- Jupyter Notebooks for creating Speech datasets☆46Updated 6 years ago
- Prabhupadavani: A Code-mixed Speech Translation Data for 25 languages☆13Updated 2 years ago
- Dialect identification using Siamese network☆15Updated 7 years ago
- ☆76Updated 3 years ago
- The implementation of 'Watch, Listen, Attend and Spell’ (WLAS) network that learns to transcribe videos of mouth motion to character on p…☆11Updated 7 years ago
- Demo page of our paper Efficiently Trainable Text-to-Speech System Based on Deep Convolutional Networks With Guided Attention, ICASSP 201…☆15Updated 4 years ago
- Urban Sound Classification : striving towards a fair comparison☆17Updated 4 years ago
- Code for AccentDB.☆23Updated 4 years ago
- Online (real-time) decoder to be used with DeepSpeech2 model☆25Updated 5 years ago
- 🔉 A web app to play, visualize, and annotate your audio files for machine learning☆120Updated 5 years ago
- Speech-to-text based on wav2letter built for transfer learning☆98Updated 2 years ago
- Learning embeddings for laughter categorization☆34Updated 6 years ago
- Advanced data structures for handling temporal segments with attached labels.☆118Updated last week
- Kaldi style neural network training in pytorch for use in place of nnet3 in Kaldi.☆26Updated last year
- The Additive Margin MobileNet1D is a new light weight deep learning model for Speaker Recognition which is based on the MobileNetV2 archi…☆30Updated last year
- A collection of basic python modules for spoken natural language processing☆55Updated 5 years ago
- automatically align transcribed audio and generate a wav2letter training corpus☆36Updated 2 years ago
- A set of scripts to use in preparing a corpus for speech-to-text processing with the Kaldi Automatic Speech Recognition Library.☆15Updated 5 years ago
- Speaker Diarization is the first step in many early audio processing and aims to solve the problem ”who spoke when”. It therefore relies …☆12Updated 6 years ago
- A recipe for creating a Speaker Identification system built on Kaldi.☆15Updated 5 years ago
- Dataset Release for Intent Classification from Speech☆47Updated 6 months ago
- Unsupervised Speaker Clustering & Speaker Recognition☆13Updated 6 years ago
- ABX and kaldi experiments on speech corpora made easy☆33Updated 11 months ago
- Automatically constructing corpus for automatic speech recognition from YouTube videos☆156Updated 5 years ago