r9y9/pysptk

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/r9y9/pysptk)

r9y9 / pysptk

A python wrapper for Speech Signal Processing Toolkit (SPTK).

☆451

Alternatives and similar repositories for pysptk

Users that are interested in pysptk are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

JeremyCCHsu / Python-Wrapper-for-World-Vocoder
View on GitHub
A Python wrapper for the high-quality vocoder "World"
☆790Jan 21, 2025Updated last year
r9y9 / nnmnkwii
View on GitHub
Library to build speech synthesis systems designed for easy and fast prototyping.
☆399Jun 29, 2024Updated 2 years ago
r9y9 / gantts
View on GitHub
PyTorch implementation of GAN-based text-to-speech synthesis and voice conversion (VC)
☆518Nov 1, 2020Updated 5 years ago
descriptinc / cargan
View on GitHub
Official repository for the paper "Chunked Autoregressive GAN for Conditional Waveform Synthesis"
☆193Dec 8, 2022Updated 3 years ago
mmorise / World
View on GitHub
A high-quality speech analysis, manipulation and synthesis system
☆1,332Feb 18, 2026Updated 5 months ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
sp-nitech / SPTK
View on GitHub
A suite of speech signal processing tools
☆247Jul 14, 2026Updated 2 weeks ago
kan-bayashi / ParallelWaveGAN
View on GitHub
Unofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN & HiFi-GAN & StyleMelGAN) with Pytorch
☆1,646Apr 22, 2024Updated 2 years ago
r9y9 / SPTK
View on GitHub
A modified version of Speech Signal Processing Toolkit (SPTK)
☆89Jun 5, 2022Updated 4 years ago
lochenchou / MOSNet
View on GitHub
Implementation of "MOSNet: Deep Learning based Objective Assessment for Voice Conversion"
☆379Jul 21, 2024Updated 2 years ago
k2kobayashi / sprocket
View on GitHub
Voice Conversion Tool Kit
☆608Feb 27, 2023Updated 3 years ago
NVIDIA / mellotron
View on GitHub
Mellotron: a multispeaker voice synthesis model based on Tacotron 2 GST that can make a voice emote and sing without emotive or singing t…
☆869Jul 22, 2023Updated 3 years ago
bigpon / vcc20_baseline_cyclevae
View on GitHub
Voice Conversion Challenge 2020 CycleVAE baseline system
☆131Oct 19, 2020Updated 5 years ago
descriptinc / melgan-neurips
View on GitHub
GAN-based Mel-Spectrogram Inversion Network for Text-to-Speech Synthesis
☆1,040Aug 28, 2023Updated 2 years ago
auspicious3000 / SpeechSplit
View on GitHub
Unsupervised Speech Decomposition Via Triple Information Bottleneck
☆697Oct 23, 2024Updated last year
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
bshall / UniversalVocoding
View on GitHub
A PyTorch implementation of "Robust Universal Neural Vocoding"
☆238Nov 14, 2020Updated 5 years ago
sp-nitech / diffsptk
View on GitHub
A differentiable version of SPTK
☆201Jul 14, 2026Updated 2 weeks ago
bshall / ZeroSpeech
View on GitHub
VQ-VAE for Acoustic Unit Discovery and Voice Conversion
☆339Jul 6, 2023Updated 3 years ago
dipjyoti92 / SC-WaveRNN
View on GitHub
Official PyTorch implementation of Speaker Conditional WaveRNN
☆110Jun 22, 2022Updated 4 years ago
kan-bayashi / PytorchWaveNetVocoder
View on GitHub
WaveNet-Vocoder implementation with pytorch.
☆301Jun 8, 2020Updated 6 years ago
r9y9 / pyreaper
View on GitHub
A python wrapper for REAPER
☆81Jan 22, 2025Updated last year
jxzhanggg / nonparaSeq2seqVC_code
View on GitHub
Implementation code of non-parallel sequence-to-sequence VC
☆248Mar 24, 2023Updated 3 years ago
auspicious3000 / autovc
View on GitHub
AutoVC: Zero-Shot Voice Style Transfer with Only Autoencoder Loss
☆1,100Oct 23, 2024Updated last year
ljuvela / GELP
View on GitHub
☆27Apr 21, 2021Updated 5 years ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
xiph / LPCNet
View on GitHub
Efficient neural speech synthesis
☆1,219Sep 21, 2024Updated last year
k2kobayashi / crank
View on GitHub
A toolkit for non-parallel voice conversion based on vector-quantized variational autoencoder
☆171Jul 25, 2024Updated 2 years ago
liusongxiang / StarGAN-Voice-Conversion
View on GitHub
This is a pytorch implementation of the paper: StarGAN-VC: Non-parallel many-to-many voice conversion with star generative adversarial ne…
☆523Oct 11, 2019Updated 6 years ago
MattShannon / mcd
View on GitHub
Mel cepstral distortion (MCD) computations in python.
☆231Jun 13, 2017Updated 9 years ago
aliutkus / speechmetrics
View on GitHub
A wrapper around speech quality metrics MOSNet, BSSEval, STOI, PESQ, SRMR, SISDR
☆1,050Jul 5, 2023Updated 3 years ago
geneing / WaveRNN-Pytorch
View on GitHub
Fatcord's Alternative WaveRNN (Faster training)
☆132Nov 29, 2020Updated 5 years ago
fatchord / WaveRNN
View on GitHub
WaveRNN Vocoder + TTS
☆2,188Jul 2, 2022Updated 4 years ago
r9y9 / nnmnkwii_gallery
View on GitHub
A collection of examples demonstrating how we can build speech synthesis systems using nnmnkwii.
☆70May 15, 2020Updated 6 years ago
seungwonpark / melgan
View on GitHub
MelGAN vocoder (compatible with NVIDIA/tacotron2)
☆650Oct 3, 2020Updated 5 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
mkotha / WaveRNN
View on GitHub
A WaveRNN implementation
☆201Oct 14, 2019Updated 6 years ago
google / REAPER
View on GitHub
☆412Nov 30, 2021Updated 4 years ago
facebookresearch / vocoder-benchmark
View on GitHub
A repository for benchmarking neural vocoders by their quality and speed.
☆213May 30, 2025Updated last year
tts-tutorial / survey
View on GitHub
A Survey on Neural Speech Synthesis https://arxiv.org/pdf/2106.15561.pdf
☆371Nov 5, 2021Updated 4 years ago
mairaksi / PiENet
View on GitHub
Pitch estimation network (PiENet) for noise-robust neural F0 estimation of speech signals
☆50Jul 24, 2019Updated 7 years ago
joansj / blow
View on GitHub
Code to train and run Blow
☆145Sep 4, 2019Updated 6 years ago
CSTR-Edinburgh / merlin
View on GitHub
This is now the official location of the Merlin project.
☆1,320Mar 3, 2020Updated 6 years ago