amphionspace / Awesome-Zero-Shot-TTS-Papers
View external linksLinks

☆10

Alternatives and similar repositories for Awesome-Zero-Shot-TTS-Papers

Users that are interested in Awesome-Zero-Shot-TTS-Papers are comparing it to the libraries listed below

Sorting:

kimsunwiub / BLOOM-Net
View on GitHub
Source code for "BLOOM-Net: Blockwise Optimization for Masking Networks Toward Scalable and Efficient Speech Enhancement"
☆14Feb 13, 2022Updated 4 years ago
amphionspace / tts-evaluation
View on GitHub
An evaluation set for large-scale trained TTS models (Coming in Sep 2024)
☆12Sep 2, 2024Updated last year
liuhuang31 / Megatts2_HierSpeechpp
View on GitHub
Megatts2 use HierSpeechpp's vocoder
☆18Dec 2, 2024Updated last year
JonathanDZ / TF-FaSNet
View on GitHub
☆24Feb 28, 2023Updated 2 years ago
aispeech-lab / TinyWASE
View on GitHub
PyTorch implementation of TinyWASE described in our paper "Compressing Speaker Extraction Model with Ultra-low Precision Quantization and…
☆11Jun 28, 2021Updated 4 years ago
choiHkk / Transformer-TTS-V2
View on GitHub
☆25Mar 6, 2024Updated last year
yxduir / LLM-SRT
View on GitHub
☆23Dec 6, 2025Updated 2 months ago
Chengyuann / AutoStyle-TTS
View on GitHub
Official PyTorch implementation of (ICME2025 oral) "AutoStyle-TTS: Retrieval-Augmented Generation based Automatic Style Matching Text-to-…
☆17Feb 1, 2026Updated 2 weeks ago
alphacep / openfst
View on GitHub
Openfst mirror with some fixes
☆14Aug 23, 2024Updated last year
zhaohb / MeloTTS-OV
View on GitHub
Using OpenVINO to speed up MeloTTS inference
☆15Nov 1, 2024Updated last year
cpii-cai / PunCantonese
View on GitHub
A Benchmark Corpus for Low-Resource Cantonese Punctuation Restoration from Speech Transcripts
☆16Dec 3, 2024Updated last year
prairie-schooner / wav2vec-vc
View on GitHub
☆11Mar 22, 2023Updated 2 years ago
lourson1091 / audiobertscore
View on GitHub
☆15Nov 10, 2025Updated 3 months ago
jh-cha-prml / JELLY
View on GitHub
Code for the paper "JELLY: Joint Emotion Recognition and Context Reasoning with LLMs for Conversational Speech Synthesis"
☆14Nov 5, 2024Updated last year
5Hyeons / StyleTTS2-Vocos
View on GitHub
StyleTTS2 + Vocos as a Decoder
☆13Mar 24, 2025Updated 10 months ago
ictnlp / DST
View on GitHub
DST is a Decoder-only simultaneous machine translation model, which can conduct policy decision and translation concurrently
☆11Jun 6, 2024Updated last year
jonnor / brewing-audio-event-detection
View on GitHub
Tracking beer/wine using Audio Event Detection with Machine Learning
☆15Jun 16, 2024Updated last year
Scarfmonster / HiFiPLN
View on GitHub
Multispeaker Community Vocoder Model for DiffSinger
☆39Aug 11, 2025Updated 6 months ago
AkshathRaghav / tinyspeech
View on GitHub
Code release for "TinySpeech: Attention Condensers for Deep Speech Recognition Neural Networks on Edge Devices"
☆21Jun 7, 2025Updated 8 months ago
sp-uhh / uncertainty-SE
View on GitHub
☆17Mar 30, 2023Updated 2 years ago
iamanigeeit / present
View on GitHub
☆14Aug 19, 2024Updated last year
lars76 / fastspeech2-clean
View on GitHub
Clean and modernized implementation of FastSpeech2/LightSpeech using IPA
☆18Aug 16, 2024Updated last year
ArenAcikgoz / Whisper-Alignment
View on GitHub
Forced alignment decoder for Whisper.
☆14Mar 13, 2024Updated last year
dodohow1011 / SpeechAdvReprogram
View on GitHub
A Study of Low-Resource Speech Commands Recognition Based on Adversarial Reprogramming
☆19Oct 12, 2023Updated 2 years ago
whull / end2end_ASR
View on GitHub
端到端语音识别实现；包含LAS、CTC、RNNT解码方式，模型SA(MHA)、LSTM、CNN、DFSMN等
☆15Jun 4, 2021Updated 4 years ago
RookieJunChen / Inter-SubNet
View on GitHub
The official PyTorch implementation of "Inter-SubNet: Speech Enhancement with Subband Interaction", accepted by ICASSP 2023.
☆100May 24, 2023Updated 2 years ago
flageval-baai / ChildMandarin
View on GitHub
A Comprehensive Mandarin Speech Dataset for Young Children Aged 3-5
☆48Mar 19, 2025Updated 10 months ago
mushanshanshan / ESLTTS
View on GitHub
ESLTTS dataset
☆16Feb 6, 2025Updated last year
hbwu-ntu / EmoCtrlTTS-Eval
View on GitHub
☆18Aug 23, 2024Updated last year
fakerybakery / OpenF5-TTS
View on GitHub
(WIP) A retrain of F5-TTS on permissively-licensed data
☆13Apr 6, 2025Updated 10 months ago
Anuttacon / speech_drame
View on GitHub
☆29Nov 4, 2025Updated 3 months ago
ICASSP2021-tutorial9 / Distant_conversational_ASR_and_analysis
View on GitHub
☆12Jun 10, 2021Updated 4 years ago
echocatzh / conv-stft
View on GitHub
A STFT/iSTFT written up in PyTorch using 1D Convolutions
☆32Jul 9, 2024Updated last year
reppy4620 / convnext_tts
View on GitHub
Unofficial implementation of ConvNeXt-TTS powered by lightning
☆18Oct 20, 2024Updated last year
facebookresearch / llama-hd-dataset
View on GitHub
This is a balanced dataset for English homograph disambiguation (HD), generated with Meta's Llama 2-Chat 70B model.
☆22Jan 22, 2024Updated 2 years ago
DDATT / Vits2-onnx-cpp
View on GitHub
Simple inference for Vits2 TTS Using ONNXRUNTIME and espeak-ng on C++
☆18Apr 17, 2024Updated last year
Jackiexiao / tts-frontend-dataset
View on GitHub
TTS FrontEnd DataSet: Polyphone / Prosody / TextNormalization
☆103Feb 5, 2024Updated 2 years ago
adelacvg / detail_tts
View on GitHub
All generative model in one for better TTS model
☆74Sep 8, 2024Updated last year
huutuongtu / Lightvoc
View on GitHub
LIGHTVOC AN UPSAMPLING-FREE GAN VOCODER BASED ON CONFORMER AND INVERSE SHORT-TIME FOURIER TRANSFORM
☆18May 17, 2024Updated last year

amphionspace / Awesome-Zero-Shot-TTS-PapersView external linksLinks

Alternatives and similar repositories for Awesome-Zero-Shot-TTS-Papers

amphionspace / Awesome-Zero-Shot-TTS-Papers
View external linksLinks