r9y9 / pysptkLinks
A python wrapper for Speech Signal Processing Toolkit (SPTK).
☆447Updated last year
Alternatives and similar repositories for pysptk
Users that are interested in pysptk are comparing it to the libraries listed below
Sorting:
- A Python wrapper for the high-quality vocoder "World"☆768Updated 9 months ago
- WaveNet-Vocoder implementation with pytorch.☆300Updated 5 years ago
- ☆154Updated last year
- Python implementation of the Short Term Objective Intelligibility measure☆351Updated last year
- PyTorch implementation of Tacotron speech synthesis model.☆309Updated 6 years ago
- Different implementations of "Weighted Prediction Error" for speech dereverberation☆536Updated 7 months ago
- A PyTorch implementation of "Robust Universal Neural Vocoding"☆238Updated 4 years ago
- Python functions for reading kaldi data formats. Useful for rapid prototyping with python.☆377Updated 2 years ago
- A PyTorch implementation of Style Tokens: Unsupervised Style Modeling, Control and Transfer in End-to-End Speech Synthesis☆368Updated 2 years ago
- Mel cepstral distortion (MCD) computations in python.☆227Updated 8 years ago
- Library to build speech synthesis systems designed for easy and fast prototyping.☆399Updated last year
- Two-talker Speech Separation with LSTM/BLSTM by Permutation Invariant Training method.☆309Updated 3 years ago
- A vocoder framework which had been widely used in research community since 1999.☆181Updated 6 years ago
- Tools for Speech Enhancement integrated with Kaldi☆423Updated 2 years ago
- Speech Denoising with Deep Feature Losses☆189Updated 5 years ago
- Deep Xi: A deep learning approach to a priori SNR estimation implemented in TensorFlow 2/Keras. For speech enhancement and robust ASR.☆518Updated 3 years ago
- An open source dataset for source separation☆450Updated last year
- Implementation of "MOSNet: Deep Learning based Objective Assessment for Voice Conversion"☆375Updated last year
- A tensorflow implementation of the "Style Tokens: Unsupervised Style Modeling, Control and Transfer in End-to-End Speech Synthesis"☆366Updated 6 years ago
- A python library for working with praat, textgrids, time aligned audio transcripts, and audio files. It is primarily used for extracting …☆338Updated last year
- The DARPA TIMIT Acoustic-Phonetic Continuous Speech Corpus.☆312Updated 3 years ago
- Speech Enhancement Generative Adversarial Network in PyTorch☆402Updated 2 years ago
- An STFT/iSTFT for PyTorch.☆366Updated last year
- A pure python module for reading and writing kaldi ark files☆263Updated 7 months ago
- A statistical model-based Voice Activity Detection☆194Updated 6 years ago
- A library for speech data augmentation in time-domain☆676Updated 4 years ago
- A toolkit for reproducible evaluation, diagnostic, and error analysis of speaker diarization systems☆229Updated last month
- Deep speaker embeddings in PyTorch, including x-vectors. Code used in this work: https://arxiv.org/abs/2007.16196☆319Updated 4 years ago
- A WaveRNN implementation☆200Updated 6 years ago
- A pytroch implementation of the GAN-TTS: HIGH FIDELITY SPEECH SYNTHESIS WITH ADVERSARIAL NETWORKS☆232Updated 5 years ago