wespeech / awesome-ttsLinks
☆21Updated 3 years ago
Alternatives and similar repositories for awesome-tts
Users that are interested in awesome-tts are comparing it to the libraries listed below
Sorting:
- Audio-FLAN☆153Updated 2 months ago
- ☆92Updated 6 months ago
- The official source code of UniAudio☆93Updated last year
- ☆71Updated last year
- ☆69Updated 4 years ago
- Curated list for papers, codes and resources related to Text-to-Audio (TTA) Generation☆28Updated this week
- ☆65Updated last year
- Robust Singing Voice Transcription and MIDI Extraction☆77Updated 6 months ago
- The baseline system for the ICASSP2024 ICMC-ASR Challenge.☆50Updated last year
- Source for the Interspeech 2024 Paper "Scaling up masked audio encoder learning for general audio classification"☆65Updated last month
- Huawei Grad-TTS for Chinese☆50Updated last year
- ☆56Updated last year
- Findings of ACL 2023 | AlignSTS: a speech-to-singing (STS) model based on modality disentanglement and cross-modal alignment☆67Updated 10 months ago
- Xmart青年论坛仓库,存放历史学生论坛和前沿讲座的视频回放和讲义,获取最新Xmart预告欢迎关注公众号【XLANCE Lab】☆19Updated last month
- Vocoder NSF-HiFiGAN (Moved into deepaudio)☆53Updated 2 years ago
- ☆75Updated 3 years ago
- UMETTS: A Unified Framework for Emotional Text-to-Speech Synthesis with Multimodal Prompts☆30Updated 5 months ago
- Unofficial SoundStream implementation of Pytorch with training code and 16kHz pretrained checkpoint☆68Updated last year
- ☆57Updated 2 years ago
- ☆40Updated 9 months ago
- Code for vec2wav 2.0, a speech token vocoder for VC. Paper: https://arxiv.org/abs/2409.01995☆76Updated 6 months ago
- ☆54Updated 7 months ago
- VoxInstruct: Expressive Human Instruction-to-Speech Generation with Unified Multilingual Codec Language Modelling☆78Updated 6 months ago
- The official PyTorch implementation of "Inter-SubNet: Speech Enhancement with Subband Interaction", accepted by ICASSP 2023.☆96Updated 2 years ago
- [ICASSP 2024] StoryTTS: A Highly Expressive Text-to-Speech Dataset with Rich Textual Expressiveness Annotations☆140Updated last year
- Survey on speech generation work.☆19Updated last year
- Awesome Neural Codec Models, Text-to-Speech Synthesizers & Speech Language Models☆164Updated last week
- The official repository of SpeechCraft dataset, a large-scale expressive bilingual speech dataset with natural language descriptions.☆133Updated last month
- Music generation☆24Updated last year
- ☆20Updated last month