QuyAnh2005/vits-japanese

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/QuyAnh2005/vits-japanese)

QuyAnh2005 / vits-japanese

Text to Speech for Japanese

☆16

Alternatives and similar repositories for vits-japanese

Users that are interested in vits-japanese are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

reppy4620 / convnext_tts
View on GitHub
Unofficial implementation of ConvNeXt-TTS powered by lightning
☆18Oct 20, 2024Updated last year
vtuber-plan / vcvits
View on GitHub
Non Parallel Voice Conversion based on VITS
☆24Mar 31, 2023Updated 3 years ago
reppy4620 / x-vits
View on GitHub
☆14Aug 1, 2025Updated 11 months ago
kimsunwiub / BLOOM-Net
View on GitHub
Source code for "BLOOM-Net: Blockwise Optimization for Masking Networks Toward Scalable and Efficient Speech Enhancement"
☆14Feb 13, 2022Updated 4 years ago
Respaired / RiFornet_Vocoder
View on GitHub
a Neural Vocoder supporting Ring Attention, Conformer and NSF.
☆25Aug 1, 2025Updated 11 months ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
BearLaboratory / wifiaudio-rx-hardware
View on GitHub
基于ESP32的WiFi无线麦克风接收端
☆19Dec 2, 2021Updated 4 years ago
AUGMXNT / shisa
View on GitHub
☆43Mar 30, 2024Updated 2 years ago
mush42 / istft-onnx
View on GitHub
Export an ONNX graph that performs ISTFT. Designed for TTS models.
☆28Apr 23, 2024Updated 2 years ago
phineas-pta / speech-synthesis-ngngngan
View on GitHub
python script to download & process data to train a speech-synthesis model of Vietnamese M.C. Nguyễn Ngọc Ngạn
☆15Aug 13, 2024Updated last year
shengcanxu / canoSpeech
View on GitHub
text to speech
☆10Mar 19, 2024Updated 2 years ago
AI-Challenge-5th / MWPToolkit
View on GitHub
MWPToolkit is an open-source framework for math word problem(MWP) solvers.
☆28Jan 7, 2022Updated 4 years ago
neoncloud / mdctGAN
View on GitHub
Code for INTERSPEECH 2023 paper "mdctGAN: Taming transformer-based GAN for speech super-resolution with Modified DCT spectra"
☆66Jun 3, 2023Updated 3 years ago
winddori2002 / DEX-TTS
View on GitHub
DEX-TTS: Diffusion-based EXpressive TTS with Style Modeling on Time Variability
☆108Jan 17, 2025Updated last year
BakerBunker / FreeV
View on GitHub
[InterSpeech 24] FreeV: Free Lunch For Vocoders Through Pseudo Inversed Mel Filter
☆98Jul 4, 2024Updated 2 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
innnky / MagVITS
View on GitHub
VITS with phoneme-level prosody modeling based on MaskGIT
☆85Aug 31, 2024Updated last year
scutcsq / Neural-Transducers-for-Two-Stage-Text-to-Speech-via-Semantic-Token-Prediction
View on GitHub
Unofficial pytorch reproduction for the paper "Utilizing Neural Transducers for Two-Stage Text-to-Speech via Semantic Token Prediction" (…
☆60Apr 4, 2024Updated 2 years ago
felixfuyihui / Optimize-FixBF-Weight
View on GitHub
☆17Jun 3, 2020Updated 6 years ago
AlexandaJerry / whisper-vits-japanese
View on GitHub
Vits Japanese with Whisper as data processor (you can train your VITS even you only have audios)
☆162May 7, 2023Updated 3 years ago
aome510 / song-guessr
View on GitHub
Just a song guessing game ;)
☆16Jul 5, 2026Updated 2 weeks ago
liuhuadai / ViT-TTS
View on GitHub
PyTorch Implementation of ViT-TTS (EMNLP'23)
☆11Oct 20, 2023Updated 2 years ago
Harry-Yu-Shuhang / Step-Audio-tts
View on GitHub
☆11Feb 20, 2025Updated last year
hs-oh-prml / EmotionControllableTextToSpeech
View on GitHub
☆21Jun 16, 2021Updated 5 years ago
lakahaga / dc-comix-tts
View on GitHub
Implementation of DCComix TTS: An End-to-End Expressive TTS with Discrete Code Collaborated with Mixer
☆74Aug 21, 2023Updated 2 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
MaxMax2016 / Grad-TTS-Chinese
View on GitHub
Huawei Grad-TTS for Chinese
☆50Sep 26, 2023Updated 2 years ago
LuxPhoenix / Umamusume
View on GitHub
This is a project aiming at automatically running the game of Umamusume Pretty Derby.
☆18Dec 25, 2025Updated 6 months ago
imdanboy / jets
View on GitHub
JETS: Jointly Training FastSpeech2 and HiFi-GAN for End to End Text to Speech
☆112Jun 6, 2022Updated 4 years ago
wetdog / wavenext_pytorch
View on GitHub
Unofficial implementation of wavenext vocoder
☆59Aug 28, 2024Updated last year
xzf-thu / Voices-in-the-Wild-Bench
View on GitHub
☆28May 22, 2026Updated 2 months ago
LePhiAnhDev / magic-hand-ai
View on GitHub
AI Hand Controller uses Computer Vision to recognize hand gestures and control various functions on your computer. The application can co…
☆23Apr 7, 2025Updated last year
OpenVoiceOS / ovos-plugin-manager
View on GitHub
plugin manager for OpenVoiceOS , STT/TTS/Wakewords that can be used anywhere
☆14Updated this week
HappyColor / DrawSpeech_PyTorch
View on GitHub
☆25Nov 25, 2025Updated 7 months ago
wac81 / vits_chinese
View on GitHub
☆49May 9, 2023Updated 3 years ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
lucidrains / spear-tts-pytorch
View on GitHub
Implementation of Spear-TTS - multi-speaker text-to-speech attention network, in Pytorch
☆277Oct 30, 2023Updated 2 years ago
liuhuang31 / g2pw_once
View on GitHub
G2pw's inference speed is accelerated by about 8-10 times. Change loop generated predictive data to only once and model loop prediction b…
☆14Dec 30, 2023Updated 2 years ago
pengzhendong / speaker-diarization
View on GitHub
Offline Speaker Diarization with SenseVoice by Sherpa ONNX.
☆15Dec 23, 2024Updated last year
yl4579 / StyleTTS
View on GitHub
Official Implementation of StyleTTS
☆466Jan 13, 2025Updated last year
yl4579 / SLMGAN
View on GitHub
SLMGAN: Exploiting Speech Language Model Representations for Unsupervised Zero-Shot Voice Conversion in GANs
☆16Jul 19, 2023Updated 3 years ago
MLX15 / craftymetaverse
View on GitHub
craftymetaverse.com Front-End Source Code
☆16Mar 25, 2022Updated 4 years ago
davidschiff100 / accent_conversion_deep_learning
View on GitHub
An open source accent conversion model based on the real time voice cloning repository
☆12May 10, 2024Updated 2 years ago