amphionspace / Awesome-Zero-Shot-TTS-PapersView external linksLinks
☆10Sep 2, 2024Updated last year
Alternatives and similar repositories for Awesome-Zero-Shot-TTS-Papers
Users that are interested in Awesome-Zero-Shot-TTS-Papers are comparing it to the libraries listed below
Sorting:
- Source code for "BLOOM-Net: Blockwise Optimization for Masking Networks Toward Scalable and Efficient Speech Enhancement"☆14Feb 13, 2022Updated 4 years ago
- An evaluation set for large-scale trained TTS models (Coming in Sep 2024)☆12Sep 2, 2024Updated last year
- Megatts2 use HierSpeechpp's vocoder☆18Dec 2, 2024Updated last year
- ☆24Feb 28, 2023Updated 2 years ago
- PyTorch implementation of TinyWASE described in our paper "Compressing Speaker Extraction Model with Ultra-low Precision Quantization and…☆11Jun 28, 2021Updated 4 years ago
- ☆25Mar 6, 2024Updated last year
- ☆23Dec 6, 2025Updated 2 months ago
- Official PyTorch implementation of (ICME2025 oral) "AutoStyle-TTS: Retrieval-Augmented Generation based Automatic Style Matching Text-to-…☆17Feb 1, 2026Updated 2 weeks ago
- Openfst mirror with some fixes☆14Aug 23, 2024Updated last year
- Using OpenVINO to speed up MeloTTS inference☆15Nov 1, 2024Updated last year
- A Benchmark Corpus for Low-Resource Cantonese Punctuation Restoration from Speech Transcripts☆16Dec 3, 2024Updated last year
- ☆11Mar 22, 2023Updated 2 years ago
- ☆15Nov 10, 2025Updated 3 months ago
- Code for the paper "JELLY: Joint Emotion Recognition and Context Reasoning with LLMs for Conversational Speech Synthesis"☆14Nov 5, 2024Updated last year
- StyleTTS2 + Vocos as a Decoder☆13Mar 24, 2025Updated 10 months ago
- DST is a Decoder-only simultaneous machine translation model, which can conduct policy decision and translation concurrently☆11Jun 6, 2024Updated last year
- Tracking beer/wine using Audio Event Detection with Machine Learning☆15Jun 16, 2024Updated last year
- Multispeaker Community Vocoder Model for DiffSinger☆39Aug 11, 2025Updated 6 months ago
- Code release for "TinySpeech: Attention Condensers for Deep Speech Recognition Neural Networks on Edge Devices"☆21Jun 7, 2025Updated 8 months ago
- ☆17Mar 30, 2023Updated 2 years ago
- ☆14Aug 19, 2024Updated last year
- Clean and modernized implementation of FastSpeech2/LightSpeech using IPA☆18Aug 16, 2024Updated last year
- Forced alignment decoder for Whisper.☆14Mar 13, 2024Updated last year
- A Study of Low-Resource Speech Commands Recognition Based on Adversarial Reprogramming☆19Oct 12, 2023Updated 2 years ago
- 端到端语音识别实现;包含LAS、CTC、RNNT解码方式,模型SA(MHA)、LSTM、CNN、DFSMN等☆15Jun 4, 2021Updated 4 years ago
- The official PyTorch implementation of "Inter-SubNet: Speech Enhancement with Subband Interaction", accepted by ICASSP 2023.☆100May 24, 2023Updated 2 years ago
- A Comprehensive Mandarin Speech Dataset for Young Children Aged 3-5☆48Mar 19, 2025Updated 10 months ago
- ESLTTS dataset☆16Feb 6, 2025Updated last year
- ☆18Aug 23, 2024Updated last year
- (WIP) A retrain of F5-TTS on permissively-licensed data☆13Apr 6, 2025Updated 10 months ago
- ☆29Nov 4, 2025Updated 3 months ago
- ☆12Jun 10, 2021Updated 4 years ago
- A STFT/iSTFT written up in PyTorch using 1D Convolutions☆32Jul 9, 2024Updated last year
- Unofficial implementation of ConvNeXt-TTS powered by lightning☆18Oct 20, 2024Updated last year
- This is a balanced dataset for English homograph disambiguation (HD), generated with Meta's Llama 2-Chat 70B model.☆22Jan 22, 2024Updated 2 years ago
- Simple inference for Vits2 TTS Using ONNXRUNTIME and espeak-ng on C++☆18Apr 17, 2024Updated last year
- TTS FrontEnd DataSet: Polyphone / Prosody / TextNormalization☆103Feb 5, 2024Updated 2 years ago
- All generative model in one for better TTS model☆74Sep 8, 2024Updated last year
- LIGHTVOC AN UPSAMPLING-FREE GAN VOCODER BASED ON CONFORMER AND INVERSE SHORT-TIME FOURIER TRANSFORM☆18May 17, 2024Updated last year