yihuitang/StyleTTS_Mandarin

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/yihuitang/StyleTTS_Mandarin)

yihuitang / StyleTTS_Mandarin

Implementation of StyleTTS for Mandarin

☆11

Alternatives and similar repositories for StyleTTS_Mandarin

Users that are interested in StyleTTS_Mandarin are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

liuhuang31 / HiFTNet-sr
View on GitHub
HiFTNet wav/audio super-resolution 16/24 kHz to 48 kHz
☆24Jan 2, 2024Updated 2 years ago
liuhuang31 / g2pw_once
View on GitHub
G2pw's inference speed is accelerated by about 8-10 times. Change loop generated predictive data to only once and model loop prediction b…
☆14Dec 30, 2023Updated 2 years ago
nivibilla / efficient-vits-finetuning
View on GitHub
Finetuning VITS Efficiently
☆32Nov 6, 2023Updated 2 years ago
yl4579 / AuxiliaryASR
View on GitHub
Joint CTC-S2S Phoneme-level ASR for Voice Conversion and TTS (Text-Mel Alignment)
☆127Jun 16, 2022Updated 4 years ago
jiangqizheng / art
View on GitHub
基于serverless实现的《图片艺术化应用》
☆10Sep 8, 2020Updated 5 years ago
Open source password manager - Proton Pass • Ad
Securely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
anton-kashkin / hifi_vc
View on GitHub
☆25Jan 24, 2023Updated 3 years ago
uthree / fastersvc
View on GitHub
☆26Mar 20, 2024Updated 2 years ago
yl4579 / PL-BERT
View on GitHub
Phoneme-Level BERT for Enhanced Prosody of Text-to-Speech with Grapheme Predictions
☆270Jan 13, 2025Updated last year
chorusai / arpa2ipa
View on GitHub
A converter from Arpabet to IPA (see https://en.wikipedia.org/wiki/Arpabet)
☆17Jan 2, 2018Updated 8 years ago
shivammehta25 / BetterFastSpeech2
View on GitHub
Just another FastSpeech 2 but cleaner code :)
☆29Jun 28, 2024Updated 2 years ago
tonnetonne814 / MB-iSTFT-VITS-44100-Ja
View on GitHub
44100Hz日本語音源に対応した MB-iSTFT-VITS: Lightweight and High-Fidelity End-to-End Text-to-Speech with Multi-Band Generation and Inverse Short-Tim…
☆39Jun 2, 2023Updated 3 years ago
ruqqq / blockchainparser
View on GitHub
Go library for parsing raw bitcoin block files.
☆10Nov 1, 2017Updated 8 years ago
v3ucn / Bert-vits2-Extra-Stream-webui-api
View on GitHub
基于Bert-vits2-Extra项目添加的流式推理和流式接口api功能
☆16Apr 12, 2024Updated 2 years ago
Zz-ww / VITS-BigVGAN-SpanPSP-Chinese
View on GitHub
基于PyTorch的VITS-BigVGAN的tts中文模型，加入韵律预测模型。
☆198Sep 15, 2022Updated 3 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
pkufool / cppinyin
View on GitHub
Converting Chinese sentences into pinyin sequences, implemented in C++, very fast and easy to deploy.
☆23Jan 5, 2026Updated 6 months ago
smart-audio / audio_diarization_annotation
View on GitHub
Audio Diarization Annotation tool
☆30Nov 8, 2019Updated 6 years ago
tonnetonne814 / unofficial-vits2-44100-Ja
View on GitHub
44100Hz日本語音源に対応させた unofficial vits2-TTS implementation in pytorchです。
☆24Sep 1, 2023Updated 2 years ago
adelacvg / ttts
View on GitHub
Train the next generation of TTS systems.
☆169Sep 13, 2024Updated last year
OlaWod / PitchVC
View on GitHub
PitchVC: Pitch Conditioned Any-to-Many Voice Conversion
☆35Jun 6, 2024Updated 2 years ago
KentoNishi / torch-time-stretch
View on GitHub
Time-stretch audio clips quickly with PyTorch (CUDA supported)! Additional utilities for searching efficient transformations are included…
☆40Sep 5, 2022Updated 3 years ago
JvanKatwijk / sdr-j-sw
View on GitHub
shortwave reception software
☆14Jul 17, 2018Updated 8 years ago
jishengpeng / ControlSpeech
View on GitHub
[ACL 2025 Main] ControlSpeech: Towards Simultaneous Zero-shot Speaker Cloning and Zero-shot Language Style Control With Decoupled Codec
☆276Nov 22, 2024Updated last year
flamed-tts / Flamed-TTS
View on GitHub
This repository implement a novel zero-shot TTS framework, named Flamed-TTS, focusing on the efficient generation and dynamic pacing in …
☆57Aug 9, 2025Updated 11 months ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
mcf330 / efts2code
View on GitHub
source code of EfficientTTS 2
☆21Feb 18, 2024Updated 2 years ago
shanggangli / tianchi-Sina-weibo-interactive-prediction
View on GitHub
天池项目：新浪微博互动预测
☆10Apr 25, 2020Updated 6 years ago
ssbuild / t5_finetuning
View on GitHub
clue chatyuan finetuning
☆17Mar 10, 2025Updated last year
b04901014 / UUVC
View on GitHub
Official implementation for the paper: A Unified One-Shot Prosody and Speaker Conversion System with Self-Supervised Discrete Speech Unit…
☆83Jan 7, 2023Updated 3 years ago
feliksh / SCDS
View on GitHub
Code for "Speaker Clustering using Dominant Sets", ICPR 2018
☆11Nov 28, 2020Updated 5 years ago
Dahan-Wang / Rethinking-Flow-and-Diffusion-Bridge-Models-for-Speech-Enhancement
View on GitHub
☆39Feb 23, 2026Updated 5 months ago
chenllliang / Genshin-Impact-NPC-Audio-Texts-Dataset
View on GitHub
收集了原神所有角色的中文语音文本内容
☆15Jun 16, 2021Updated 5 years ago
Daisyqk / Automatic-Prosody-Annotation
View on GitHub
☆112Mar 9, 2026Updated 4 months ago
tomasJwYU / AutoPrepDemo
View on GitHub
AutoPrep: An Automatic Preprocessing Framework for In-the-Wild Speech Data
☆36Dec 31, 2023Updated 2 years ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
techiaith / docker-huggingface-stt-cy
View on GitHub
Adnabod lleferydd Cymraeg i'r Gymraeg gyda HuggingFace // Speech Recognition for Welsh with HuggingFace
☆13Nov 29, 2022Updated 3 years ago
IS2AI / KazEmoTTS
View on GitHub
An open-source Kazakh Emotional Text-to-Speech Dataset
☆36Aug 1, 2025Updated 11 months ago
hayeong0 / DDDM-VC
View on GitHub
Official Pytorch Implementation for "DDDM-VC: Decoupled Denoising Diffusion Models with Disentangled Representation and Prior Mixup for V…
☆244Jul 31, 2024Updated last year
yanghaha0908 / FastHuBERT
View on GitHub
Official implementation for Fast-HuBERT: An Efficient Training Framework for Self-Supervised Speech Representation Learning
☆100Nov 20, 2024Updated last year
hcy71o / MB-iSTFT-VITS-with-AutoVocoder
View on GitHub
Incorporating AutoVocoder to MB-iSTFT-VITS
☆47Dec 1, 2022Updated 3 years ago
Yifei-ZHAO96 / STAM-pytorch
View on GitHub
Pytorch implementation of "spectro-temporal attention-based voice activity detection"
☆13Jun 4, 2024Updated 2 years ago
lmine / img2sound
View on GitHub
Convert image in sound
☆10Dec 28, 2012Updated 13 years ago