r9y9 / pysptk
A python wrapper for Speech Signal Processing Toolkit (SPTK).
☆441Updated 4 months ago
Related projects ⓘ
Alternatives and complementary repositories for pysptk
- A Python wrapper for the high-quality vocoder "World"☆725Updated last year
- WaveNet-Vocoder implementation with pytorch.☆297Updated 4 years ago
- Library to build speech synthesis systems designed for easy and fast prototyping.☆393Updated 4 months ago
- ☆149Updated 11 months ago
- Different implementations of "Weighted Prediction Error" for speech dereverberation☆494Updated 2 months ago
- Python implementation of the Short Term Objective Intelligibility measure☆327Updated 10 months ago
- Python functions for reading kaldi data formats. Useful for rapid prototyping with python.☆376Updated last year
- Tools for Speech Enhancement integrated with Kaldi☆399Updated last year
- Mel cepstral distortion (MCD) computations in python.☆213Updated 7 years ago
- A suite of speech signal processing tools☆226Updated this week
- A library for speech data augmentation in time-domain☆647Updated 3 years ago
- Implementation of "MOSNet: Deep Learning based Objective Assessment for Voice Conversion"☆346Updated 4 months ago
- A vocoder framework which had been widely used in research community since 1999.☆176Updated 5 years ago
- Speech Enhancement Generative Adversarial Network in PyTorch☆379Updated last year
- A pure python module for reading and writing kaldi ark files☆249Updated last year
- A statistical model-based Voice Activity Detection☆190Updated 5 years ago
- A PyTorch implementation of "Robust Universal Neural Vocoding"☆237Updated 4 years ago
- PyTorch implementation of Tacotron speech synthesis model.☆309Updated 5 years ago
- Speech Denoising with Deep Feature Losses☆185Updated 4 years ago
- The Emotional Voices Database: Towards Controlling the Emotional Expressiveness in Voice Generation Systems☆254Updated last year
- A PyTorch implementation of Style Tokens: Unsupervised Style Modeling, Control and Transfer in End-to-End Speech Synthesis☆362Updated last year
- Two-talker Speech Separation with LSTM/BLSTM by Permutation Invariant Training method.☆306Updated 2 years ago
- A python library for working with praat, textgrids, time aligned audio transcripts, and audio files. It is primarily used for extracting …☆313Updated 11 months ago
- An open source dataset for source separation☆380Updated 9 months ago
- Deep neural network based speech enhancement toolkit☆211Updated 5 years ago
- Voice Activity Detection based on Deep Learning & TensorFlow☆355Updated last year
- A toolkit for reproducible evaluation, diagnostic, and error analysis of speaker diarization systems☆187Updated last year
- End-to-End Neural Diarization☆376Updated 3 years ago
- Speaker embedding (d-vector) trained with GE2E loss☆273Updated 10 months ago
- A wrapper around speech quality metrics MOSNet, BSSEval, STOI, PESQ, SRMR, SISDR☆908Updated last year