mxochicale / htkLinks
The Hidden Markov Model Toolkit (HTK)
☆14Updated 8 years ago
Alternatives and similar repositories for htk
Users that are interested in htk are comparing it to the libraries listed below
Sorting:
- Implementation of Differential Learning Rate in Keras☆11Updated 6 years ago
- PyTorch implementations of neural network models for keyword spotting☆11Updated 5 years ago
- MirasVoice is a data set consisting speech samples from bilinguals to train neural network for optimization of speaker verification algor…☆19Updated 5 years ago
- mirror of VoxCeleb dataset - a large-scale speaker identification dataset☆74Updated 6 years ago
- The Additive Margin MobileNet1D is a new light weight deep learning model for Speaker Recognition which is based on the MobileNetV2 archi…☆30Updated 2 years ago
- Comprehensive Python library for speech and voice.☆32Updated 3 years ago
- Project to learn about speech recognition - both Speaker Diarization and other Speech Recognition applications.☆50Updated 9 years ago
- Pytorch Code for S2IGAN☆41Updated 5 years ago
- Speaker recognition ,Voiceprint recognition☆53Updated 6 years ago
- Won't it be cool to build a speech assistant like Alexa or Siri yourself without voice API and network connection?☆34Updated 7 years ago
- Simple face alignment library by using face_recognition and opencv☆16Updated 6 years ago
- Web-based tool for straight-forward class annotation of audio files☆11Updated 5 years ago
- End to End Dialect Identification using Convolutional Neural Network☆53Updated 6 years ago
- A Text-To-Speech Model Developed Using 🐸STT☆12Updated 3 years ago
- automatically align transcribed audio and generate a wav2letter training corpus☆36Updated 2 years ago
- Sequence Modelling with CTC☆52Updated 3 years ago
- end-to-end voicebot that answers open domain questions.☆10Updated 4 years ago
- Bob is a free signal-processing and machine learning toolbox originally developed by the Biometrics group at Idiap Research Institute, in…☆48Updated 2 years ago
- An end to end ASR Transformer model training repo☆13Updated 4 years ago
- The Additive Margin SincNet (AM-SincNet) is a new approach for speaker recognition problems which is based in the neural network architec…☆46Updated 2 years ago
- Automatic Speech Recognition Dataset Generation☆37Updated 7 years ago
- Best Collection of Articles and code for Audio Classification☆15Updated 6 years ago
- Implementation in Keras of Effnet (https://arxiv.org/abs/1801.06434)☆21Updated 7 years ago
- End to End Multiview Lip Reading☆10Updated 8 years ago
- ESC: Dataset for Environmental Sound Classification - paper replication data☆83Updated 8 years ago
- Deploy DL/ ML inference pipelines with minimal extra code.☆102Updated last year
- A deep learning model for classifying audio frames into [SPEECH, KCHI, CHI, MAL, FEM] classes.☆49Updated 2 months ago
- Speaker Identification System (upto 100% accuracy); built using Python 2.7 and python_speech_features library☆212Updated 5 years ago
- A simple audio feature extraction library☆81Updated 6 years ago
- Tensorflow Implementation of FaceNet: A Unified Embedding for Face Recognition and Clustering to find the celebrity whose face matches th…☆31Updated 3 years ago