lexkoro / StyleTTSLinks

☆11

Alternatives and similar repositories for StyleTTS

Users that are interested in StyleTTS are comparing it to the libraries listed below

Sorting:

audiodemo / voice-conversion
Vocoder-Free Non-Parallel Conversion of Whispered Speech With Masked Cycle-Consistent Generative Adversarial Networks
☆17Updated last year
yoongi43 / VRVQ
Implementation of the paper "Variable Bitrate Residual Vector Quantization for Audio Coding"
☆11Updated 3 months ago
b-sigpro / sed-hsmm
Onset-and-Offset-Aware Sound Event Detection
☆17Updated 5 months ago
ryota-komatsu / speech_resynth
Speech Resynthesis and Language Modeling
☆20Updated last month
p1an-lin-jung / wv_tts
☆19Updated last year
shengcanxu / canoSpeech
text to speech
☆10Updated last year
amphionspace / tts-evaluation
An evaluation set for large-scale trained TTS models (Coming in Sep 2024)
☆12Updated 10 months ago
v-nhandt21 / MusicVoiceConversion
Sing any popular song with your voice
☆11Updated 3 years ago
idiap / zff_vad
Unsupervised Voice Activity Detection by Modeling Source and System Information using Zero Frequency Filtering
☆21Updated last year
huutuongtu / Lightvoc
LIGHTVOC AN UPSAMPLING-FREE GAN VOCODER BASED ON CONFORMER AND INVERSE SHORT-TIME FOURIER TRANSFORM
☆18Updated last year
lexkoro / cfm-vc
☆11Updated 4 months ago
reppy4620 / x-vits
☆13Updated 8 months ago
ex3ndr / supervoice-hybrid
My hybrid TTS network that combines, VALL-E, VoiceBox, SpeechFlow, Seamless and TortoiseTTS into one
☆26Updated 11 months ago
uthree / ddsp-vocoder
☆10Updated 8 months ago
meaningTeam / tidy-tunes
Tidy Tunes is an easy-to-use pipeline for mining high-quality audio data for speech generation models. To do so, it chains multiple open …
☆21Updated last week
Aria-K-Alethia / speaking-rate-controllable-hifi-gan
☆16Updated 3 years ago
mcf330 / efts2code
source code of EfficientTTS 2
☆14Updated last year
KdaiP / conformer-RoPE
Conformer block with Rotary Position Embedding, modified from lucidrains' implement
☆14Updated 10 months ago
speechnovateur / languagecodec_tmp
Temporary anonymous version
☆22Updated last year
sarulab-speech / spatial_voice_conversion
Spatial Voice Conversion: Voice Conversion Preserving Spatial Information and Non-target Signals
☆17Updated 11 months ago
bshall / dusted
DUSTED: Spoken-Term Discovery using Discrete Speech Units
☆17Updated 9 months ago
thuhcsi / PortableTTS
☆12Updated 2 years ago
reppy4620 / convnext_tts
Unofficial implementation of ConvNeXt-TTS powered by lightning
☆17Updated 8 months ago
Respaired / RiFornet_Vocoder
a Neural Vocoder supporting Ring Attention, Conformer and NSF.
☆19Updated 5 months ago
PanagiotisP / svs-multiband
Code for the paper "MULTI-BAND MASKING FOR WAVEFORM-BASED SINGING VOICE SEPARATION" that was accepted on EUSIPCO2022
☆15Updated 3 years ago
rishikksh20 / NU-Wave2-pytorch
NU-Wave 2: A General Neural Audio Upsampling Model for Various Sampling Rates [WIP]
☆24Updated 3 years ago
asuni / PitchSqueezer
A robust pitch tracker using synchro-squeezed fft and frequency domain autocorrelation
☆34Updated last year
ex3ndr / supervoice-vocoder
Production-ready vocoder using BigVSAN
☆11Updated last year
MaxMax2016 / StreamingHiFiGAN
An Open-source Streaming High-fidelity Neural Audio Codec
☆11Updated last year
iamanigeeit / present
☆13Updated 10 months ago