glory20h / VoiceLDM
VoiceLDM: Text-to-Speech with Environmental Context
☆175Updated 8 months ago
Alternatives and similar repositories for VoiceLDM:
Users that are interested in VoiceLDM are comparing it to the libraries listed below
- Official implementation of the paper "BigCodec: Pushing the Limits of Low-Bitrate Neural Speech Codec"☆158Updated 7 months ago
- The open source code for SimpleSpeech series☆138Updated 6 months ago
- Official Implementation of EnCLAP (ICASSP 2024)☆91Updated 11 months ago
- Models and code for RepCodec: A Speech Representation Codec for Speech Tokenization☆175Updated 9 months ago
- Pytorch implementation of "Grad-TTS: A Diffusion Probabilistic Model for Text-to-Speech"☆190Updated last year
- An neural full-band audio codec for general audio sampled at 48 kHz with 7.5 kps or 4.5 kbps.☆144Updated last month
- Unified Speech Language Model for paper "SpeechTokenizer: Unified Speech Tokenizer for Speech Large Language Models"(ICLR 2024)☆142Updated last year
- Style-Controllable Zero-Shot Text to Speech Synthesizer based on VALL-E☆138Updated 6 months ago
- Reference-aware automatic speech evaluation toolkit☆153Updated 5 months ago
- SSR-Speech: Towards Stable, Safe and Robust Zero-shot Speech Editing and Synthesis☆130Updated 4 months ago
- Official Implement of Multi-Stage Multi-Codebook (MSMC) TTS