tonnetonne814/SiFi-VITS2-44100-Ja

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/tonnetonne814/SiFi-VITS2-44100-Ja)

tonnetonne814 / SiFi-VITS2-44100-Ja

DDPM-based Pitch Generation and Pitch Controllable Voice Synthesis.

☆55

Alternatives and similar repositories for SiFi-VITS2-44100-Ja

Users that are interested in SiFi-VITS2-44100-Ja are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

tonnetonne814 / QuickVC-44100-Ja_HuBERT
View on GitHub
44100Hz日本語HuBERTに対応した QuickVC: Any-to-many Voice Conversion Using Inverse Short-time Fourier Transform for Faster Conversion です。
☆16May 21, 2023Updated 3 years ago
tonnetonne814 / PL-Bert-VITS2
View on GitHub
VITS2 using Phoneme-Level Japanese BERT
☆14Dec 17, 2023Updated 2 years ago
p0p4k / Matcha-TTS-2
View on GitHub
E2E TTS using Conditional Flow Matching (Experimental*)
☆71Nov 10, 2023Updated 2 years ago
reppy4620 / x-vits
View on GitHub
☆14Aug 1, 2025Updated 11 months ago
ZehuaKcrissLi / GTR-Voice
View on GitHub
☆16Nov 11, 2024Updated last year
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
redmist328 / APNet2
View on GitHub
Source code of APNet2, a vocoder
☆60Nov 23, 2023Updated 2 years ago
line / promptttspp
View on GitHub
PromptTTS++: Controlling Speaker Identity in Prompt-Based Text-To-Speech Using Natural Language Descriptions
☆86Oct 11, 2024Updated last year
uthree / auris_experimental_vits_dsp
View on GitHub
AI based singing voice synthesis
☆37Jun 10, 2024Updated 2 years ago
hcy71o / MB-iSTFT-VITS-with-AutoVocoder
View on GitHub
Incorporating AutoVocoder to MB-iSTFT-VITS
☆47Dec 1, 2022Updated 3 years ago
shinhyeokoh / rwen
View on GitHub
☆14Jun 16, 2023Updated 3 years ago
ORI-Muchim / PolyLangVITS
View on GitHub
Multi-speaker Speech Synthesis Using VITS(KO, JA, EN, ZH)
☆75Feb 28, 2024Updated 2 years ago
asuni / PitchSqueezer
View on GitHub
A robust pitch tracker using synchro-squeezed fft and frequency domain autocorrelation
☆38Jan 17, 2024Updated 2 years ago
hcy71o / SC-CNN
View on GitHub
SC-CNN: Effective Speaker Conditioning Method for Zero-Shot Multi-Speaker Text-to-Speech Systems
☆39Nov 1, 2023Updated 2 years ago
shengcanxu / canoSpeech
View on GitHub
text to speech
☆10Mar 19, 2024Updated 2 years ago
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
p0p4k / vits3_pytorch
View on GitHub
☆28Nov 15, 2023Updated 2 years ago
yukara-ikemiya / wavefit-pytorch
View on GitHub
PyTorch implementation of WaveFit [2022, Google] which is one of SOTA lightweight/fast speech vocoders.
☆70Jul 13, 2026Updated 2 weeks ago
mush42 / istft-onnx
View on GitHub
Export an ONNX graph that performs ISTFT. Designed for TTS models.
☆28Apr 23, 2024Updated 2 years ago
tsukumijima / pyopenjtalk-plus
View on GitHub
pyopenjtalk-plus: A Python wrapper for OpenJTalk with additional improvements
☆58Jul 18, 2026Updated last week
litagin02 / laughter-collector
View on GitHub
大量の音声データから笑い声部分を集めるやつ
☆14May 23, 2024Updated 2 years ago
pengzhendong / streaming-vocos
View on GitHub
Streaming Vocos
☆31Jun 10, 2025Updated last year
hs-oh-prml / DurFlexEVC
View on GitHub
☆82Jan 22, 2025Updated last year
PlayVoice / Grad-SVC
View on GitHub
Diffusion Singing Voice Conversion based on Grad-TTS from HuaWei
☆173Oct 24, 2023Updated 2 years ago
tarepan / SpeechMOS
View on GitHub
Easy-to-Use Speech MOS predictors
☆362Oct 24, 2023Updated 2 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
innnky / descript-audio-vae
View on GitHub
VAE modified from Descript Audio Codec, which replaces the RVQ with VAE
☆92Apr 2, 2024Updated 2 years ago
ndkgit339 / spe-dss
View on GitHub
Speech Parameter Estimation Using Differentiable Speech Synthesizer
☆43May 9, 2023Updated 3 years ago
litagin02 / vits-japros-webui
View on GitHub
日本語TTS（VITS）の学習と音声合成のGradio WebUI
☆42Jan 5, 2024Updated 2 years ago
PlayVoice / VI-SVC
View on GitHub
VI-SVC model is just VITS without MAS and DurationPredictor.
☆10Nov 9, 2023Updated 2 years ago
lakahaga / dc-comix-tts
View on GitHub
Implementation of DCComix TTS: An End-to-End Expressive TTS with Discrete Code Collaborated with Mixer
☆74Aug 21, 2023Updated 2 years ago
tonnetonne814 / unofficial-vits2-44100-Ja
View on GitHub
44100Hz日本語音源に対応させた unofficial vits2-TTS implementation in pytorchです。
☆24Sep 1, 2023Updated 2 years ago
anonymous-pits / pits
View on GitHub
PITS: Variational Pitch Inference for End-to-end Pitch-controllable TTS without External Pitch Predictor
☆280Jul 16, 2023Updated 3 years ago
kyamauchi1023 / PL-BERT-ja
View on GitHub
A repository of Japanese Phoneme-Level BERT
☆24Dec 16, 2023Updated 2 years ago
seastar105 / pflow-encodec
View on GitHub
Implementation of TTS model based on NVIDIA P-Flow TTS Paper
☆77Jul 13, 2026Updated 2 weeks ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
reppy4620 / convnext_tts
View on GitHub
Unofficial implementation of ConvNeXt-TTS powered by lightning
☆18Oct 20, 2024Updated last year
zhenye234 / FlashSpeech
View on GitHub
ACM MM 2024 FlashSpeech: Efficient Zero-Shot Speech Synthesis
☆156Sep 20, 2024Updated last year
supertone-inc / super-monotonic-align
View on GitHub
☆173Sep 19, 2024Updated last year
uthree / fastersvc
View on GitHub
☆27Mar 20, 2024Updated 2 years ago
ex3ndr / supervoice-hybrid
View on GitHub
My hybrid TTS network that combines, VALL-E, VoiceBox, SpeechFlow, Seamless and TortoiseTTS into one
☆26Aug 5, 2024Updated last year
p0p4k / pflowtts_pytorch
View on GitHub
Unofficial implementation of NVIDIA P-Flow TTS paper
☆228Dec 24, 2024Updated last year
innnky / ar-vits
View on GitHub
text to speech using autoregressive transformer and VITS
☆248Apr 3, 2024Updated 2 years ago