TouchSky-Lab / Awesome-Text-to-Speech-TTSLinks
Awesome TTS
☆59Updated 3 years ago
Alternatives and similar repositories for Awesome-Text-to-Speech-TTS
Users that are interested in Awesome-Text-to-Speech-TTS are comparing it to the libraries listed below
Sorting:
- Putting flows on top of neural transducers for better TTS☆62Updated last week
- flow mirror models from JZX AI Labs☆45Updated 8 months ago
- Official release of StyleTalk dataset.☆64Updated 11 months ago
- ACM MM 2024 FlashSpeech: Efficient Zero-Shot Speech Synthesis☆138Updated 8 months ago
- Implementation of Google's USM speech model in Pytorch☆31Updated 2 months ago
- We introduce the LLAMA1 Test Set, a comprehensive open-domain world knowledge QA dataset for evaluating question-answering systems. We pr…☆18Updated last year
- Unified Speech Language Model for paper "SpeechTokenizer: Unified Speech Tokenizer for Speech Large Language Models"(ICLR 2024)☆145Updated last year
- ☆40Updated 3 months ago
- X-E-Speech: Joint Training Framework of Non-Autoregressive Cross-lingual Emotional Text-to-Speech and Voice Conversion☆91Updated last year
- 《SpeechGen: Unlocking the Generative Power of Speech Language Models with Prompts》☆73Updated last year
- An unofficial PyTorch implementation of VALL-E☆87Updated last week
- Official Code for ParrotTTS☆51Updated 7 months ago
- VoiceBank-2023 is the speech corpus specially designed for constructing personalized Mandarin text-to-speech (TTS) systems.☆39Updated last year
- The YouTube Text-To-Speech dataset is comprised of waveform audio extracted from YouTube videos alongside their English transcriptions☆51Updated 4 years ago
- ☆65Updated last year
- Audio tokenization, in the fastest way possible!☆52Updated 9 months ago
- Huawei Grad-TTS for Chinese☆50Updated last year
- one script for xls-r/xlsr/whisper fine-tuning☆41Updated last year
- SpeechAgents: Human-Communication Simulation with Multi-Modal Multi-Agent Systems☆82Updated last year
- Monotonic Alignment Search☆92Updated 2 years ago
- The official repo of Qwen2-Audio chat & pretrained large audio language model proposed by Alibaba Cloud.☆35Updated 8 months ago
- GPT-style network for phonemization with durations of text☆66Updated last year
- Streamable Text-to-Speech model using a language modeling approach, without vector quantization☆87Updated 2 weeks ago
- A TTS model that makes a speaker speak new languages☆76Updated 11 months ago
- ☆30Updated 2 months ago
- An easy-to-use, fast, and easily integrable tool for evaluating audio LLM☆104Updated this week
- ☆57Updated 11 months ago
- Joint CTC-S2S Phoneme-level ASR for Voice Conversion and TTS (Text-Mel Alignment)☆119Updated 2 years ago
- The open source code for LLM-Codec☆134Updated 9 months ago
- An evolving, large-scale and multi-domain ASR corpus for low-resource languages with automated crawling, transcription and refinement☆157Updated last week