deepaudio / deepaudio-ttsLinks

☆12

Alternatives and similar repositories for deepaudio-tts

Users that are interested in deepaudio-tts are comparing it to the libraries listed below

Sorting:

ex3ndr / supervoice-hybrid
My hybrid TTS network that combines, VALL-E, VoiceBox, SpeechFlow, Seamless and TortoiseTTS into one
☆26Updated 11 months ago
AkshathRaghav / tinyspeech
Code release for "TinySpeech: Attention Condensers for Deep Speech Recognition Neural Networks on Edge Devices"
☆19Updated last month
ShoukanLabs / VoPho
A collection of all our phonemeizers for dataset construction and inference
☆24Updated 4 months ago
reppy4620 / vocoders
My vocoder experiments
☆30Updated last week
miccio-dk / NISQA
NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment
☆16Updated 3 years ago
uthree / ddsp-vocoder
☆10Updated 8 months ago
KdaiP / conformer-RoPE
Conformer block with Rotary Position Embedding, modified from lucidrains' implement
☆14Updated 10 months ago
reppy4620 / convnext_tts
Unofficial implementation of ConvNeXt-TTS powered by lightning
☆17Updated 8 months ago
iamanigeeit / present
☆13Updated 10 months ago
audiodemo / voice-conversion
Vocoder-Free Non-Parallel Conversion of Whispered Speech With Masked Cycle-Consistent Generative Adversarial Networks
☆17Updated last year
pengzhendong / audio-pipeline
☆22Updated 9 months ago
meaningTeam / tidy-tunes
Tidy Tunes is an easy-to-use pipeline for mining high-quality audio data for speech generation models. To do so, it chains multiple open …
☆21Updated last week
hcy71o / MB-iSTFT-VITS-with-AutoVocoder
Incorporating AutoVocoder to MB-iSTFT-VITS
☆48Updated 2 years ago
amphionspace / tts-evaluation
An evaluation set for large-scale trained TTS models (Coming in Sep 2024)
☆12Updated 10 months ago
ORI-Muchim / Grad-TTS
'Grad-TTS' with Multilingual Cleaners
☆10Updated last year
ex3ndr / supervoice-gpt-facodec
GPT for FACodec
☆13Updated last year
reppy4620 / x-vits
☆13Updated 8 months ago
OlaWod / PitchVC
PitchVC: Pitch Conditioned Any-to-Many Voice Conversion
☆33Updated last year
asuni / PitchSqueezer
A robust pitch tracker using synchro-squeezed fft and frequency domain autocorrelation
☆34Updated last year
MiniXC / LightningFastSpeech2
☆56Updated 2 years ago
Tikai7 / DiTTO-TTS
DiTTo-TTS: Diffusion Transformers for Scalable Text-to-Speech without Domain-Specific Factors
☆28Updated 5 months ago
Scarfmonster / HiFiPLN
Multispeaker Community Vocoder Model for DiffSinger
☆37Updated 2 months ago
Infinity-INF / fast-phasr
Phonemes and durations labeling based on whisper small
☆11Updated last year
Plachtaa / ASTRAL-quantization
speaker-disentangled speech linguistic content quantizer
☆21Updated 3 months ago
p1an-lin-jung / wv_tts
☆19Updated last year
p0p4k / vits3_pytorch
☆29Updated last year
p1an-lin-jung / WavThruVec_pytorch
An implementation of Charactr, Inc's "WavThruVec: Latent speech representation as intermediate features for neural speech synthesis"
☆28Updated last year
ex3ndr / supervoice-vocoder
Production-ready vocoder using BigVSAN
☆11Updated last year
MaxMax2016 / StreamingHiFiGAN
An Open-source Streaming High-fidelity Neural Audio Codec
☆11Updated last year
ajaybati / miipher2.0
Reimplementation of Miipher
☆22Updated last year