r9y9 / pysptkLinks
A python wrapper for Speech Signal Processing Toolkit (SPTK).
☆447Updated last year
Alternatives and similar repositories for pysptk
Users that are interested in pysptk are comparing it to the libraries listed below
Sorting:
- A Python wrapper for the high-quality vocoder "World"☆770Updated 9 months ago
- WaveNet-Vocoder implementation with pytorch.☆300Updated 5 years ago
- ☆154Updated last year
- Library to build speech synthesis systems designed for easy and fast prototyping.☆399Updated last year
- Python implementation of the Short Term Objective Intelligibility measure☆351Updated last year
- A python library for working with praat, textgrids, time aligned audio transcripts, and audio files. It is primarily used for extracting …☆338Updated last year
- Different implementations of "Weighted Prediction Error" for speech dereverberation☆541Updated 7 months ago
- A PyTorch implementation of Style Tokens: Unsupervised Style Modeling, Control and Transfer in End-to-End Speech Synthesis☆369Updated 2 years ago
- Speech Enhancement Generative Adversarial Network in PyTorch☆406Updated 2 years ago
- A vocoder framework which had been widely used in research community since 1999.☆181Updated 6 years ago
- Python functions for reading kaldi data formats. Useful for rapid prototyping with python.☆378Updated 2 years ago
- A tensorflow implementation of the "Style Tokens: Unsupervised Style Modeling, Control and Transfer in End-to-End Speech Synthesis"☆366Updated 6 years ago
- Tools for Speech Enhancement integrated with Kaldi☆423Updated 2 years ago
- Mel cepstral distortion (MCD) computations in python.☆227Updated 8 years ago
- A PyTorch implementation of "Robust Universal Neural Vocoding"☆238Updated 5 years ago
- Implementation of "MOSNet: Deep Learning based Objective Assessment for Voice Conversion"☆376Updated last year
- CSS10: A Collection of Single Speaker Speech Datasets for 10 Languages☆473Updated 5 years ago
- Speech Denoising with Deep Feature Losses☆189Updated 5 years ago
- Deep Xi: A deep learning approach to a priori SNR estimation implemented in TensorFlow 2/Keras. For speech enhancement and robust ASR.☆518Updated 3 years ago
- A library for speech data augmentation in time-domain☆677Updated 4 years ago
- Two-talker Speech Separation with LSTM/BLSTM by Permutation Invariant Training method.☆309Updated 3 years ago
- PyTorch implementation of Tacotron speech synthesis model.☆308Updated 6 years ago
- An STFT/iSTFT for PyTorch.☆366Updated 2 years ago
- A pure python module for reading and writing kaldi ark files☆267Updated 8 months ago
- A list of publically available audio data that anyone can download for ASR or other speech activities☆228Updated 4 years ago
- An open source dataset for source separation☆450Updated last year
- A pytroch implementation of the GAN-TTS: HIGH FIDELITY SPEECH SYNTHESIS WITH ADVERSARIAL NETWORKS☆232Updated 5 years ago
- Speaker embedding (d-vector) trained with GE2E loss☆287Updated last year
- A Python module for interacting with Praat TextGrid files. Also includes a class for reading HTK .mlf files into Praat☆296Updated 2 years ago
- Implementation of the Griffin and Lim algorithm to recover an audio signal from a magnitude-only spectrogram.☆176Updated 7 years ago