open-speech / kaldi-ioLinks
c++ Kaldi IO lib (static and dynamic).
☆25Updated 6 years ago
Alternatives and similar repositories for kaldi-io
Users that are interested in kaldi-io are comparing it to the libraries listed below
Sorting:
- tensorflow speech synthesis c++ inference for voicenet☆16Updated 6 years ago
- ☆48Updated 4 years ago
- Juicer is a Weighted Finite State Transducer (WFST) based decoder for Automatic Speech Recognition (ASR).☆62Updated 9 years ago
- Python wrapper for OpenFST and its extensions from Kaldi. Also support reading/writing ark/scp files☆53Updated 2 months ago
- ☆21Updated 5 years ago
- ☆41Updated 7 years ago
- Custom decoders for Kaldi☆80Updated 6 years ago
- Python package implementing the TD-PSOLA algorithm for speech processing☆43Updated 8 years ago
- ☆76Updated 3 years ago
- ☆34Updated 6 years ago
- A modified version of Speech Signal Processing Toolkit (SPTK)☆89Updated 3 years ago
- Meta-embeddings are a probabilistic generalization of embeddings in machine learning.☆23Updated 6 years ago
- "An Improved Deep Embedding Learning Method for Short Duration Speaker Verification" pytorch implementation☆19Updated 7 years ago
- it's ASR decoder and make graph project☆33Updated 3 years ago
- MultiSpeaker Tacotron2 using LifeLong Learning.☆13Updated 6 years ago
- ChiNese Text Normalization (CNTN) tool for Text-to-speech system☆36Updated 7 years ago
- Kaldi extended by Kaituo XU with new features in nnet1.☆12Updated 6 years ago
- A C++ library of "World" - A high-quality speech analysis, manipulation and synthesis system -☆60Updated 6 years ago
- Long audio alignment using Kaldi☆23Updated 4 years ago
- DeepMind's Tacotron-2 Tensorflow implementation☆19Updated 6 years ago
- Script for converting kaldi GMM/HMM models to HTK format☆11Updated last year
- Tacotron text to speech in C++(synthesize only)☆77Updated 6 years ago
- streaming attention networks for end-to-end automatic speech recognition☆55Updated 5 years ago
- A KALDI/C++ implementation of GoogleBrain's SpecAugment: A Simple Data Augmentation Method for Automatic Speech Recognition☆14Updated 6 years ago
- maracas is a library for corrupting audio files with additive and convolutive noise.☆72Updated 8 years ago
- MagPhase Vocoder: Speech analysis/synthesis system for TTS and related applications.☆80Updated 6 years ago
- working on parallel wavenet☆25Updated 7 years ago
- Conversion of recurrent neural network language models to weighted finite state transducers☆58Updated 7 years ago
- ☆61Updated 2 years ago
- vad wraper on webrtcvad☆24Updated 8 years ago