HeCheng0625 / Amphion

☆10

Related projects ⓘ

Alternatives and complementary repositories for Amphion

insunhwang89 / StyleVC
☆30Updated last year
scutcsq / Neural-Transducers-for-Two-Stage-Text-to-Speech-via-Semantic-Token-Prediction
Unofficial pytorch reproduction for the paper "Utilizing Neural Transducers for Two-Stage Text-to-Speech via Semantic Token Prediction" (…
☆58Updated 7 months ago
lakahaga / dc-comix-tts
Implementation of DCComix TTS: An End-to-End Expressive TTS with Discrete Code Collaborated with Mixer
☆76Updated last year
hcy71o / SNAC
Unofficial Pytorch implementation of SNAC: Speaker-normalized affine coupling layer in flow-based architecture for zero-shot multi-speake…
☆56Updated last year
cnaigithub / Auto_Tuning_Zeroshot_TTS_and_VC
Official implementation of "Automatic Tuning of Loss Trade-offs without Hyper-parameter Search in End-to-End Zero-Shot Speech Synthesis",…
☆79Updated last year
choiHkk / pitch-control-vits
☆31Updated last year
ogunlao / glowtts_stdp
Glow-TTS with Stochastic Duration Predictor and Stochastic Pitch Predictor
☆18Updated last year
adelacvg / diff-vits
☆39Updated last year
choiHkk / CVAEJETS
Conditional Variational Auto-Encoder with Jointly Training FastSpeech2(+Conformer) and HiFi-GAN for End to End Text to Speech
☆46Updated 2 years ago
Joshua-1995 / LearnableUpsamplingLayer-Pytorch
Pytorch implementation of LearnableUpsamplingLayer (NaturalSpeech, Tan et al., 2022)
☆54Updated 8 months ago
prml-lab-speech-team / demo
☆25Updated 3 months ago
Labmem-Zhouyx / CDFSE_FastSpeech2
The Official Implementation of “Content-Dependent Fine-Grained Speaker Embedding for Zero-Shot Speaker Adaptation in Text-to-Speech Synth…
☆81Updated last year
seastar105 / pflow-encodec
Implementation of TTS model based on NVIDIA P-Flow TTS Paper
☆67Updated 6 months ago
hs-oh-prml / DurFlexEVC
☆50Updated 9 months ago
skysbird / g2p-zh-en
Chinese and English Bilinguish G2P
☆20Updated last year
cantabile-kwok / vec2wav2.0
Code for vec2wav 2.0, a speech token vocoder for VC. Paper: https://arxiv.org/abs/2409.01995
☆51Updated last week
0417keito / PromptTTS2
[WIP] Unofficial Implementation of Microsoft's PromptTTS2
☆51Updated last year
zhengmidon / singaligner
a compact audio-to-phoneme aligner for singing voice
☆10Updated 10 months ago
line / promptttspp
PromptTTS++: Controlling Speaker Identity in Prompt-Based Text-To-Speech Using Natural Language Descriptions
☆61Updated last month
X-E-Speech / X-E-Speech-code
X-E-Speech: Joint Training Framework of Non-Autoregressive Cross-lingual Emotional Text-to-Speech and Voice Conversion
☆71Updated 7 months ago
lifeiteng / SoundStorm
☆70Updated last year
adelacvg / detail_tts
All generative model in one for better TTS model
☆66Updated 2 months ago
PhonemeHallucinator / Phoneme_Hallucinator
☆45Updated last year
hs-oh-prml / DiffProsody
☆62Updated last year
hs-oh-prml / EmotionControllableTextToSpeech
☆21Updated 3 years ago
keonlee9420 / Robust_Fine_Grained_Prosody_Control
PyTorch Implementation of Robust and fine-grained prosody control of end-to-end speech synthesis
☆41Updated 2 years ago
choiHkk / VITSinger
Singing Voice Speech modeling test
☆35Updated 2 years ago
IS2AI / KazEmoTTS
An open-source Kazakh Emotional Text-to-Speech Dataset
☆26Updated 7 months ago
p0p4k / vits3_pytorch
☆28Updated last year