mxochicale / htk
The Hidden Markov Model Toolkit (HTK)
☆12Updated 7 years ago
Alternatives and similar repositories for htk:
Users that are interested in htk are comparing it to the libraries listed below
- PyTorch implementations of neural network models for keyword spotting☆11Updated 4 years ago
- Implementation of Differential Learning Rate in Keras☆11Updated 5 years ago
- Automatic Measurement of Vowel Duration for Consonant Vowel Consonant (CVC) sound files (JASA 2016)☆14Updated 8 years ago
- The Additive Margin MobileNet1D is a new light weight deep learning model for Speaker Recognition which is based on the MobileNetV2 archi…☆30Updated last year
- Code for our ACML and INTERSPEECH papers: "Speaker Diarization as a Fully Online Bandit Learning Problem in MiniVox".☆27Updated 3 years ago
- A very naive and simple benchmark between dlib and pytorch in terms of space and time☆19Updated 4 years ago
- ☆12Updated 3 years ago
- ToneNet: A CNN Model of Tone Classification of Mandarin Chinese☆17Updated 5 years ago
- ☆24Updated 5 years ago
- Python tools for feature extraction and dimensionality reduction on image data for articulatory phonetics.☆7Updated 5 years ago
- Collection of models and extensions for deployment in PyTorch☆24Updated 2 years ago
- Code base for WaveTransformer: A novel architecture for automated audio captioning☆43Updated 4 years ago
- Won't it be cool to build a speech assistant like Alexa or Siri yourself without voice API and network connection?☆34Updated 6 years ago
- ⏩ Generating speech in a single forward pass without any attention!☆12Updated 3 years ago
- creating audio preprocessing features in TensorFlow keras layers,☆14Updated 3 years ago
- Simple speech recognition using dynamic time warping with examples☆29Updated 5 years ago
- Audio data augmentation examples☆34Updated 6 years ago
- DOneLogin Android: Facial verification for Two-Factors Authentication (2FA) on Android platform☆11Updated 4 years ago
- Voice Conversion using Tacotron.☆11Updated 2 years ago
- Implementation in Keras of Effnet (https://arxiv.org/abs/1801.06434)☆21Updated 6 years ago
- bumble bee transformer☆14Updated 3 years ago
- Universal Python binding for the LMDB 'Lightning' Database☆13Updated 7 years ago
- Experiments and tutorials with and for torchaudio☆13Updated 3 years ago
- Demo page of our paper Efficiently Trainable Text-to-Speech System Based on Deep Convolutional Networks With Guided Attention, ICASSP 201…☆15Updated 3 years ago
- Anonymous ICLR Submission☆14Updated 5 years ago
- MirasVoice is a data set consisting speech samples from bilinguals to train neural network for optimization of speaker verification algor…☆17Updated 5 years ago
- The History of Speech Recognition to the Year 2030☆12Updated 3 years ago
- A database of clean and noisy speech for audio research☆10Updated 7 years ago
- Dialect identification using Siamese network☆15Updated 7 years ago
- Three experiments for data efficient video transformers.☆9Updated 3 years ago