mxochicale / htk
The Hidden Markov Model Toolkit (HTK)
☆12Updated 7 years ago
Alternatives and similar repositories for htk:
Users that are interested in htk are comparing it to the libraries listed below
- Implementation of Differential Learning Rate in Keras☆11Updated 5 years ago
- PyTorch implementations of neural network models for keyword spotting☆11Updated 4 years ago
- Simple face alignment library by using face_recognition and opencv☆16Updated 5 years ago
- This program determines the age range of a person from their voice. It uses a simple Mel-log spectrogram approach with a multi-layer perc…☆15Updated 7 years ago
- Python tools for feature extraction and dimensionality reduction on image data for articulatory phonetics.☆7Updated 5 years ago
- Automatic Measurement of Vowel Duration for Consonant Vowel Consonant (CVC) sound files (JASA 2016)☆14Updated 7 years ago
- MnasNet by Pytorch and NCNN☆9Updated 6 years ago
- The Additive Margin MobileNet1D is a new light weight deep learning model for Speaker Recognition which is based on the MobileNetV2 archi…☆29Updated last year
- A Text2Speech Engine built in Pytorch.☆11Updated 6 years ago
- Speech command recognition DenseNet transfer learning from UrbanSound8k in keras tensorflow☆17Updated 7 years ago
- Implementation in Keras of Effnet (https://arxiv.org/abs/1801.06434)☆21Updated 6 years ago
- LogMMSE speech enhancement/noise reduction☆30Updated 4 years ago
- Implementation of joint bayesian model, written in python.☆11Updated 3 years ago
- Dialect identification using Siamese network☆15Updated 7 years ago
- ToneNet: A CNN Model of Tone Classification of Mandarin Chinese☆17Updated 5 years ago
- ☆27Updated 5 years ago
- Demo page of our paper Efficiently Trainable Text-to-Speech System Based on Deep Convolutional Networks With Guided Attention, ICASSP 201…☆15Updated 3 years ago
- Video classification tools using 3D ResNet☆22Updated 7 years ago
- The Additive Margin SincNet (AM-SincNet) is a new approach for speaker recognition problems which is based in the neural network architec…☆43Updated last year
- Voice Conversion using Tacotron.☆11Updated 2 years ago
- Audio command recognition by DTW and classification☆7Updated 4 years ago
- Universal Python binding for the LMDB 'Lightning' Database☆13Updated 7 years ago
- A very naive and simple benchmark between dlib and pytorch in terms of space and time☆19Updated 4 years ago
- MirasVoice is a data set consisting speech samples from bilinguals to train neural network for optimization of speaker verification algor…☆17Updated 4 years ago
- Converts TensorFlow checkpoints (with index, meta and data files) to PyTorch, HDF5 and JSON☆18Updated 3 years ago
- Python scripts to facilitate easy working☆11Updated 7 months ago
- Automated Lip Reading using Deep Reinforcement Learning☆30Updated 6 years ago
- Compute useful transcriptions metrics (CER, WER, SER, ...)☆27Updated 10 years ago
- Code for our ACML and INTERSPEECH papers: "Speaker Diarization as a Fully Online Bandit Learning Problem in MiniVox".☆27Updated 3 years ago
- Three experiments for data efficient video transformers.☆9Updated 2 years ago