open-speech / kaldi-io
c++ Kaldi IO lib (static and dynamic).
☆25Updated 6 years ago
Alternatives and similar repositories for kaldi-io:
Users that are interested in kaldi-io are comparing it to the libraries listed below
- tensorflow speech synthesis c++ inference for voicenet☆16Updated 5 years ago
- ☆48Updated 4 years ago
- This repository is an extension of GAN based speech enhancement called SEGAN, and we present two modifications to make model training mor…☆37Updated last year
- MultiSpeaker Tacotron2 using LifeLong Learning.☆13Updated 5 years ago
- Meta-embeddings are a probabilistic generalization of embeddings in machine learning.☆22Updated 6 years ago
- Python wrapper for OpenFST and its extensions from Kaldi. Also support reading/writing ark/scp files☆47Updated 7 months ago
- C++ implementation of End to End TTS which combines both Tacatron2 and LPCNET Vocoder.☆32Updated 5 years ago
- WIP: pytorch FFI wrapper for Kaldi chain loss (a.k.a. Lattice Free MMI)☆20Updated 6 years ago
- Custom decoders for Kaldi☆79Updated 5 years ago
- ☆34Updated 5 years ago
- Filtering and Noise Adding Tool☆29Updated 2 years ago
- ChiNese Text Normalization (CNTN) tool for Text-to-speech system☆35Updated 6 years ago
- GlottDNN vocoder and tools for training DNN excitation models☆32Updated 3 years ago
- ICASSP 2020 ESPnet-TTS: Merlin baseline system☆36Updated 5 years ago
- Utilities for resampling and filtering audio data☆46Updated 5 years ago
- A pitch tracker inspired by David Talkin's RAPT (Robust Algorithm for Pitch Tracking) written in Python.☆47Updated 8 years ago
- ☆20Updated 5 years ago
- ☆42Updated 6 years ago
- A KALDI/C++ implementation of GoogleBrain's SpecAugment: A Simple Data Augmentation Method for Automatic Speech Recognition☆14Updated 5 years ago
- In this work we propose two postprocessing approaches applying convolutional neural networks (CNNs) either in the time domain or the ceps…☆28Updated 4 years ago
- streaming attention networks for end-to-end automatic speech recognition☆55Updated 4 years ago
- ☆24Updated 4 years ago
- ☆41Updated 6 years ago
- Juicer is a Weighted Finite State Transducer (WFST) based decoder for Automatic Speech Recognition (ASR).☆61Updated 9 years ago
- Pulse Model vocoder☆42Updated 6 years ago
- A C++ library of "World" - A high-quality speech analysis, manipulation and synthesis system -☆55Updated 5 years ago
- A pytorch implementation of FFTNet.☆36Updated 6 years ago
- Speech enhancement system for the CHiME-5 dinner party scenario☆109Updated 2 weeks ago
- ☆15Updated 5 years ago
- 2018/2019 TTS framework integrating state of the art open source methods☆47Updated 5 years ago