ttaoREtw/semi-tts

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/ttaoREtw/semi-tts)

ttaoREtw / semi-tts

Semi-supervised Learning for Multi-speaker Text-to-speech Synthesis Using Discrete Speech Representation

☆39

Alternatives and similar repositories for semi-tts

Users that are interested in semi-tts are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

xushengyuan / FastSing2
View on GitHub
An imporved version of Fastsinging singing voice synthesising system.
☆21Nov 3, 2020Updated 5 years ago
ljuvela / GELP
View on GitHub
☆27Apr 21, 2021Updated 5 years ago
bigpon / QPPWG
View on GitHub
Quasi-Periodic Parallel WaveGAN Pytorch implementation
☆46Oct 29, 2022Updated 3 years ago
dipjyoti92 / SC-WaveRNN
View on GitHub
Official PyTorch implementation of Speaker Conditional WaveRNN
☆110Jun 22, 2022Updated 4 years ago
andi611 / ZeroSpeech-TTS-without-T
View on GitHub
A Pytorch implementation for the ZeroSpeech 2019 challenge.
☆112Nov 12, 2019Updated 6 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
Deepest-Project / AlignTTS
View on GitHub
Implementation of the AlignTTS
☆77Jul 6, 2023Updated 3 years ago
nii-yamagishilab / Extended_VQVAE
View on GitHub
☆64Aug 14, 2023Updated 2 years ago
bshall / VectorQuantizedCPC
View on GitHub
Vector-Quantized Contrastive Predictive Coding for Acoustic Unit Discovery and Voice Conversion
☆142Sep 1, 2020Updated 5 years ago
Sytronik / deep-griffinlim-iteration
View on GitHub
PyTorch implementation for Deep Griffin-Lim Iteration paper(https://arxiv.org/abs/1903.03971)
☆39Oct 12, 2019Updated 6 years ago
babe269 / performant
View on GitHub
A toolset for easy formant extraction and visualization from wav files and TTS models
☆33Sep 2, 2022Updated 3 years ago
jayneelparekh / sp2si-code
View on GitHub
Contains code for our work on speech to singing conversion (ICASSP 2020)
☆50Oct 27, 2020Updated 5 years ago
sarulab-speech / multi-speaker-dgp
View on GitHub
Official implementation of DGP-based multi-speaker speech synthesis with PyTorch
☆24Mar 23, 2021Updated 5 years ago
yanggeng1995 / WaveRNN
View on GitHub
☆34Jul 16, 2019Updated 7 years ago
L0SG / NanoFlow
View on GitHub
PyTorch implementation of the paper "NanoFlow: Scalable Normalizing Flows with Sublinear Parameter Complexity." (NeurIPS 2020)
☆67Dec 28, 2020Updated 5 years ago
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
geneing / WaveRNN-Pytorch
View on GitHub
Fatcord's Alternative WaveRNN (Faster training)
☆132Nov 29, 2020Updated 5 years ago
bshall / UniversalVocoding
View on GitHub
A PyTorch implementation of "Robust Universal Neural Vocoding"
☆238Nov 14, 2020Updated 5 years ago
cyhuang-tw / AdaIN-VC
View on GitHub
An unofficial implementation of the paper "One-shot Voice Conversion by Separating Speaker and Content Representations with Instance Norm…
☆119May 27, 2021Updated 5 years ago
zceng / LVCNet
View on GitHub
LVCNet: Efficient Condition-Dependent Modeling Network for Waveform Generation
☆80Feb 24, 2021Updated 5 years ago
helenacuesta / multif0-estimation-polyvocals
View on GitHub
Code for ISMIR 2020 paper: "Multiple F0 Estimation in Vocal Ensembles using Convolutional Neural Networks"
☆55Nov 20, 2024Updated last year
tarepan / VoiceConversionLab
View on GitHub
Collect Voice Conversion researches
☆97Updated this week
tzuhsien / Voice-conversion-evaluation
View on GitHub
An evaluation toolkit for voice conversion models.
☆42Jul 11, 2021Updated 5 years ago
AppleHolic / multiband_melgan
View on GitHub
An unofficial implementation of https://arxiv.org/abs/2005.05106
☆50Mar 10, 2021Updated 5 years ago
ex3ndr / supervoice-gpt
View on GitHub
GPT-style network for phonemization with durations of text
☆68Mar 21, 2024Updated 2 years ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
yanggeng1995 / Multi-band-WaveRNN
View on GitHub
☆45Dec 16, 2019Updated 6 years ago
hrbigelow / ae-wavenet
View on GitHub
Wavenet Autoencoder for Unsupervised speech representation learning (after Chorowski, Jan 2019)
☆176Sep 16, 2020Updated 5 years ago
thuhcsi / VAENAR-TTS
View on GitHub
The official implementation of VAENAR-TTS, a VAE based non-autoregressive TTS model.
☆144Jul 8, 2021Updated 5 years ago
r9y9 / icassp2020-espnet-tts-merlin-baseline
View on GitHub
ICASSP 2020 ESPnet-TTS: Merlin baseline system
☆37Oct 28, 2019Updated 6 years ago
NeuroWave-ai / CUCVAE-TTS
View on GitHub
☆25Mar 12, 2022Updated 4 years ago
liusongxiang / efficient_tts
View on GitHub
Pytorch implementation of "Efficienttts: an efficient and high-quality text-to-speech architecture"
☆116Dec 22, 2021Updated 4 years ago
Chung-I / youtube-asr-crawler
View on GitHub
☆10Sep 19, 2022Updated 3 years ago
shang0712 / HierTTS
View on GitHub
☆47Apr 16, 2023Updated 3 years ago
ivanvovk / compressed-tacotron2-pytorch
View on GitHub
Compressed version of Tacotron 2 using Tensor Train + Waveglow.
☆22Dec 26, 2019Updated 6 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
ivanvovk / durian-pytorch
View on GitHub
Implementation of "Duration Informed Attention Network for Multimodal Synthesis" paper in PyTorch.
☆184Aug 12, 2020Updated 5 years ago
ANLGBOY / WaveNODE
View on GitHub
Pytorch Implementation of WaveNODE
☆64Sep 4, 2020Updated 5 years ago
Aria-K-Alethia / speaking-rate-controllable-hifi-gan
View on GitHub
☆16Apr 4, 2022Updated 4 years ago
LeoniusChen / Attentions-in-Tacotron
View on GitHub
☆69Mar 31, 2021Updated 5 years ago
nii-yamagishilab / multi-speaker-tacotron
View on GitHub
VCTK multi-speaker tacotron for ICASSP 2020
☆266Mar 29, 2022Updated 4 years ago
cyhuang-tw / AutoVC
View on GitHub
An unofficial implementation of the paper "AutoVC: Zero-Shot Voice Style Transfer with Only Autoencoder Loss".
☆34Apr 26, 2021Updated 5 years ago
MiscellaneousStuff / PhoneLM
View on GitHub
(R&D) Text to speech using phonemes as inputs and audio codec codes as outputs. Loosely based on MegaByte, VALL-E and Encodec.
☆48Sep 4, 2023Updated 2 years ago