ditto-tts / ditto-tts.github.ioLinks

Official Demo Page for DiTTo-TTS: Efficient and Scalable Zero-Shot Text-to-Speech with Diffusion Transformer

☆38

Alternatives and similar repositories for ditto-tts.github.io

Users that are interested in ditto-tts.github.io are comparing it to the libraries listed below

Sorting:

keonlee9420 / evaluate-zero-shot-tts
Evaluation Protocol for Large-Scale Zero-Shot TTS Literature
☆93Updated 10 months ago
neosapience / editts
Official implementation of EdiTTS: Score-based Editing for Controllable Text-to-Speech (INTERSPEECH 2022)
☆121Updated 3 years ago
hs-oh-prml / DurFlexEVC
☆82Updated last year
justinlovelace / SESD
☆61Updated last year
adelacvg / detail_tts
All generative model in one for better TTS model
☆74Updated last year
cantabile-kwok / vec2wav2.0
Code for vec2wav 2.0, a speech token vocoder for VC. Paper: https://arxiv.org/abs/2409.01995
☆78Updated last year
liuhuadai / ViT-TTS
PyTorch Implementation of ViT-TTS (EMNLP'23)
☆11Updated 2 years ago
lakahaga / dc-comix-tts
Implementation of DCComix TTS: An End-to-End Expressive TTS with Discrete Code Collaborated with Mixer
☆75Updated 2 years ago
Edresson / ZS-TTS-Evaluation
☆44Updated last year
asappresearch / simple-tts
Contains the code associated with the ICLR submission for our text-to-speech diffusion model
☆57Updated 2 years ago
hcy71o / AutoVocoder
Autovocoder: Fast Waveform Generation from a Learned Speech Representation using Differentiable Digital Signal Processing
☆71Updated 3 years ago
AlanBaade / SyllableLM
Official Code for SyllableLM: Learning Coarse Semantic Units for Speech Language Models
☆59Updated 7 months ago
bfs18 / e2_tts
☆70Updated last year
Aria-K-Alethia / laughter-synthesis
Official implementation of the paper "Laughter Synthesis using Pseudo Phonetic Tokens with a Large-scale In-the-wild Laughter Corpus" acc…
☆77Updated 2 years ago
XiangLi2022 / CM-TTS
[Findings of NAACL 2024] Source code of paper CM-TTS: Enhancing Real Time Text-to-Speech Synthesis Efficiency through Weighted Samplers a…
☆69Updated last year
hs-oh-prml / DiffProsody
☆68Updated 2 years ago
line / promptttspp
PromptTTS++: Controlling Speaker Identity in Prompt-Based Text-To-Speech Using Natural Language Descriptions
☆83Updated last year
Takaaki-Saeki / zm-text-tts
[IJCAI'23] Learning to Speak from Text for Low-Resource TTS
☆64Updated 2 years ago
monglechap / fluenttts
FluentTTS: Text-dependent Fine-grained Style Control for Multi-style TTS
☆20Updated 3 years ago
revsic / torch-whisper-guided-vc
Torch implementation of Whisper-guided DDPM based Voice Conversion
☆49Updated 2 years ago
AI-S2-Lab / FluentEditor
[InterSpeech'2024] FluentEditor:Text-based Speech Editing by Considering Acoustic and Prosody Consistency
☆59Updated last year
keonlee9420 / Robust_Fine_Grained_Prosody_Control
PyTorch Implementation of Robust and fine-grained prosody control of end-to-end speech synthesis
☆41Updated 3 years ago
BakerBunker / FreeV
[InterSpeech 24] FreeV: Free Lunch For Vocoders Through Pseudo Inversed Mel Filter
☆93Updated last year
supertone-inc / super-monotonic-align
☆167Updated last year
revsic / torch-nansy
Torch implementation of NANSY, Neural Analysis and Synthesis, arXiv:2110.14513
☆64Updated 2 years ago
jzmzhong / Automatic-Prosody-Annotator-with-SSWP-CLAP
An automatic prosodic boundary annotation tool for Text-to-Speech Synthesis (TTS).
☆51Updated last year
PlayVoice / BigVGAN
BigVGAN with Neural Source-Filter
☆56Updated 2 years ago
3loi / NaturalVoices
☆59Updated 3 months ago
yangdongchao / ALMTokenizer2
The open source code of ALMTokenizer2: Towards Low bit-rate and Semantic-rich Audio Tokenizer with Flow-based Scalar Diffusion Transforme…
☆42Updated 5 months ago
yangdongchao / SimpleSpeech
The open source code for SimpleSpeech series
☆145Updated last year