r9y9 / pysptk
A python wrapper for Speech Signal Processing Toolkit (SPTK).
☆441Updated 10 months ago
Alternatives and similar repositories for pysptk
Users that are interested in pysptk are comparing it to the libraries listed below
Sorting:
- A Python wrapper for the high-quality vocoder "World"☆749Updated 3 months ago
- Library to build speech synthesis systems designed for easy and fast prototyping.☆397Updated 10 months ago
- WaveNet-Vocoder implementation with pytorch.☆298Updated 4 years ago
- Python implementation of the Short Term Objective Intelligibility measure☆339Updated last year
- ☆152Updated last year
- Speaker embedding (d-vector) trained with GE2E loss☆282Updated last year
- Python functions for reading kaldi data formats. Useful for rapid prototyping with python.☆376Updated last year
- Tools for Speech Enhancement integrated with Kaldi☆413Updated last year
- A pure python module for reading and writing kaldi ark files☆258Updated 2 months ago
- A vocoder framework which had been widely used in research community since 1999.☆180Updated 6 years ago
- A library for speech data augmentation in time-domain☆659Updated 3 years ago
- PyTorch implementation of Tacotron speech synthesis model.☆310Updated 5 years ago
- A PyTorch implementation of "Robust Universal Neural Vocoding"☆239Updated 4 years ago
- Mel cepstral distortion (MCD) computations in python.☆223Updated 7 years ago
- Speech Enhancement Generative Adversarial Network in PyTorch☆393Updated last year
- A python library for working with praat, textgrids, time aligned audio transcripts, and audio files. It is primarily used for extracting …☆329Updated last year
- Deep neural network based speech enhancement toolkit☆216Updated 5 years ago
- End-to-End Neural Diarization☆401Updated 3 years ago
- A suite of speech signal processing tools☆233Updated this week
- An open source dataset for source separation☆419Updated last year
- A toolkit for reproducible evaluation, diagnostic, and error analysis of speaker diarization systems☆206Updated 2 months ago
- Different implementations of "Weighted Prediction Error" for speech dereverberation☆519Updated last month
- A PyTorch implementation of Style Tokens: Unsupervised Style Modeling, Control and Transfer in End-to-End Speech Synthesis☆366Updated 2 years ago
- Deep speaker embeddings in PyTorch, including x-vectors. Code used in this work: https://arxiv.org/abs/2007.16196☆315Updated 4 years ago
- see README☆342Updated 9 months ago
- End-2-end speech synthesis with recurrent neural networks☆226Updated last year
- Two-talker Speech Separation with LSTM/BLSTM by Permutation Invariant Training method.☆310Updated 3 years ago
- Implementation of "MOSNet: Deep Learning based Objective Assessment for Voice Conversion"☆366Updated 9 months ago
- A tensorflow implementation of the "Style Tokens: Unsupervised Style Modeling, Control and Transfer in End-to-End Speech Synthesis"☆367Updated 6 years ago
- A statistical model-based Voice Activity Detection☆192Updated 6 years ago