open-speech / kaldi-ioLinks
c++ Kaldi IO lib (static and dynamic).
☆25Updated 6 years ago
Alternatives and similar repositories for kaldi-io
Users that are interested in kaldi-io are comparing it to the libraries listed below
Sorting:
- tensorflow speech synthesis c++ inference for voicenet☆16Updated 6 years ago
- ☆48Updated 4 years ago
- ☆20Updated 5 years ago
- Juicer is a Weighted Finite State Transducer (WFST) based decoder for Automatic Speech Recognition (ASR).☆62Updated 9 years ago
- ☆41Updated 7 years ago
- Custom decoders for Kaldi☆79Updated 6 years ago
- DeepMind's Tacotron-2 Tensorflow implementation☆19Updated 6 years ago
- A C++ library of "World" - A high-quality speech analysis, manipulation and synthesis system -☆60Updated 6 years ago
- A modified version of Speech Signal Processing Toolkit (SPTK)☆89Updated 3 years ago
- Long audio alignment using Kaldi☆23Updated 4 years ago
- Python wrapper for OpenFST and its extensions from Kaldi. Also support reading/writing ark/scp files☆53Updated 3 months ago
- MagPhase Vocoder: Speech analysis/synthesis system for TTS and related applications.☆80Updated 5 years ago
- MultiSpeaker Tacotron2 using LifeLong Learning.☆13Updated 5 years ago
- C++ implementation of End to End TTS which combines both Tacatron2 and LPCNET Vocoder.☆32Updated 5 years ago
- maracas is a library for corrupting audio files with additive and convolutive noise.☆72Updated 7 years ago
- Lattice combination algorithm to combine inaccurate transcripts with hypothesis lattices☆16Updated last year
- Python package implementing the TD-PSOLA algorithm for speech processing☆42Updated 7 years ago
- ☆15Updated 5 years ago
- "An Improved Deep Embedding Learning Method for Short Duration Speaker Verification" pytorch implementation☆19Updated 6 years ago
- ☆76Updated 3 years ago
- A KALDI/C++ implementation of GoogleBrain's SpecAugment: A Simple Data Augmentation Method for Automatic Speech Recognition☆14Updated 5 years ago
- This repository is an extension of GAN based speech enhancement called SEGAN, and we present two modifications to make model training mor…☆37Updated 2 years ago
- A pitch tracker inspired by David Talkin's RAPT (Robust Algorithm for Pitch Tracking) written in Python.☆48Updated 9 years ago
- In this work we propose two postprocessing approaches applying convolutional neural networks (CNNs) either in the time domain or the ceps…☆28Updated 5 years ago
- Bayesian spEEch Recognizer☆55Updated 4 years ago
- Tacotron text to speech in C++(synthesize only)☆76Updated 5 years ago
- ChiNese Text Normalization (CNTN) tool for Text-to-speech system☆36Updated 7 years ago
- WIP: pytorch FFI wrapper for Kaldi chain loss (a.k.a. Lattice Free MMI)☆20Updated 6 years ago
- Meta-embeddings are a probabilistic generalization of embeddings in machine learning.☆22Updated 6 years ago
- ☆34Updated 6 years ago