r9y9 / pysptkLinks
A python wrapper for Speech Signal Processing Toolkit (SPTK).
☆444Updated last year
Alternatives and similar repositories for pysptk
Users that are interested in pysptk are comparing it to the libraries listed below
Sorting:
- A Python wrapper for the high-quality vocoder "World"☆760Updated 5 months ago
- WaveNet-Vocoder implementation with pytorch.☆300Updated 5 years ago
- Library to build speech synthesis systems designed for easy and fast prototyping.☆398Updated last year
- Python functions for reading kaldi data formats. Useful for rapid prototyping with python.☆377Updated 2 years ago
- PyTorch implementation of Tacotron speech synthesis model.☆310Updated 6 years ago
- ☆152Updated last year
- A PyTorch implementation of Style Tokens: Unsupervised Style Modeling, Control and Transfer in End-to-End Speech Synthesis☆368Updated 2 years ago
- Speech Enhancement Generative Adversarial Network in PyTorch☆396Updated last year
- Mel cepstral distortion (MCD) computations in python.☆224Updated 8 years ago
- Python implementation of the Short Term Objective Intelligibility measure☆344Updated last year
- A vocoder framework which had been widely used in research community since 1999.☆181Updated 6 years ago
- Different implementations of "Weighted Prediction Error" for speech dereverberation☆524Updated 3 months ago
- An STFT/iSTFT for PyTorch.☆361Updated last year
- A pure python module for reading and writing kaldi ark files☆259Updated 4 months ago
- A python library for working with praat, textgrids, time aligned audio transcripts, and audio files. It is primarily used for extracting …☆331Updated last year
- Implementation of "MOSNet: Deep Learning based Objective Assessment for Voice Conversion"☆371Updated 11 months ago
- A tensorflow implementation of the "Style Tokens: Unsupervised Style Modeling, Control and Transfer in End-to-End Speech Synthesis"☆367Updated 6 years ago
- A PyTorch implementation of "Robust Universal Neural Vocoding"☆239Updated 4 years ago
- Deep Xi: A deep learning approach to a priori SNR estimation implemented in TensorFlow 2/Keras. For speech enhancement and robust ASR.☆513Updated 3 years ago
- Tools for Speech Enhancement integrated with Kaldi☆416Updated 2 years ago
- CSS10: A Collection of Single Speaker Speech Datasets for 10 Languages☆475Updated 5 years ago
- A WaveRNN implementation☆200Updated 5 years ago
- The DARPA TIMIT Acoustic-Phonetic Continuous Speech Corpus.☆309Updated 3 years ago
- Two-talker Speech Separation with LSTM/BLSTM by Permutation Invariant Training method.☆310Updated 3 years ago
- Deep speaker embeddings in PyTorch, including x-vectors. Code used in this work: https://arxiv.org/abs/2007.16196☆317Updated 4 years ago
- Speaker embedding (d-vector) trained with GE2E loss☆282Updated last year
- An open source dataset for source separation☆432Updated last year
- A library for speech data augmentation in time-domain☆666Updated 3 years ago
- A statistical model-based Voice Activity Detection☆192Updated 6 years ago
- ESPnet Model Zoo☆255Updated 2 years ago