Text to Speech for Japanese
☆15May 11, 2023Updated 2 years ago
Alternatives and similar repositories for vits-japanese
Users that are interested in vits-japanese are comparing it to the libraries listed below
Sorting:
- Unofficial implementation of ConvNeXt-TTS powered by lightning☆18Oct 20, 2024Updated last year
- Non Parallel Voice Conversion based on VITS☆24Mar 31, 2023Updated 2 years ago
- ☆14Aug 1, 2025Updated 7 months ago
- a Neural Vocoder supporting Ring Attention, Conformer and NSF.☆24Aug 1, 2025Updated 7 months ago
- Source code for "BLOOM-Net: Blockwise Optimization for Masking Networks Toward Scalable and Efficient Speech Enhancement"☆14Feb 13, 2022Updated 4 years ago
- 基于ESP32的WiFi无线麦克风接收端☆17Dec 2, 2021Updated 4 years ago
- Export an ONNX graph that performs ISTFT. Designed for TTS models.☆28Apr 23, 2024Updated last year
- Just a song guessing game ;)☆14Dec 29, 2025Updated 2 months ago
- python script to download & process data to train a speech-synthesis model of Vietnamese M.C. Nguyễn Ngọc Ngạn☆14Aug 13, 2024Updated last year
- ☆17Jun 3, 2020Updated 5 years ago
- text to speech☆10Mar 19, 2024Updated 2 years ago
- Code for INTERSPEECH 2023 paper "mdctGAN: Taming transformer-based GAN for speech super-resolution with Modified DCT spectra"☆66Jun 3, 2023Updated 2 years ago
- Implementation of the paper "Variable Bitrate Residual Vector Quantization for Audio Coding"☆11Apr 10, 2025Updated 11 months ago
- DEX-TTS: Diffusion-based EXpressive TTS with Style Modeling on Time Variability☆107Jan 17, 2025Updated last year
- Unofficial pytorch reproduction for the paper "Utilizing Neural Transducers for Two-Stage Text-to-Speech via Semantic Token Prediction" (…☆61Apr 4, 2024Updated last year
- T5Voice is a lightweight PyTorch implementation of T5-based text-to-speech synthesis, supporting both streaming and non-streaming speech …☆28Nov 7, 2025Updated 4 months ago
- Onset-and-Offset-Aware Sound Event Detection☆21Feb 10, 2025Updated last year
- [InterSpeech 24] FreeV: Free Lunch For Vocoders Through Pseudo Inversed Mel Filter☆93Jul 4, 2024Updated last year
- VITS with phoneme-level prosody modeling based on MaskGIT☆85Aug 31, 2024Updated last year
- ☆22Nov 25, 2025Updated 3 months ago
- Offline Speaker Diarization with SenseVoice by Sherpa ONNX.☆15Dec 23, 2024Updated last year
- Vits Japanese with Whisper as data processor (you can train your VITS even you only have audios)☆162May 7, 2023Updated 2 years ago
- Direct Preference Optimization Implementation☆17Feb 1, 2024Updated 2 years ago
- Unofficial implementation of wavenext vocoder☆60Aug 28, 2024Updated last year
- Implementation of DCComix TTS: An End-to-End Expressive TTS with Discrete Code Collaborated with Mixer☆75Aug 21, 2023Updated 2 years ago
- PyTorch Implementation of ViT-TTS (EMNLP'23)☆11Oct 20, 2023Updated 2 years ago
- ☆21Jun 16, 2021Updated 4 years ago
- ☆11Feb 20, 2025Updated last year
- Huawei Grad-TTS for Chinese☆51Sep 26, 2023Updated 2 years ago
- JETS: Jointly Training FastSpeech2 and HiFi-GAN for End to End Text to Speech☆113Jun 6, 2022Updated 3 years ago
- Prosody Predict☆10Jan 4, 2021Updated 5 years ago
- A simple implementation for improving CosyVoice2 by GRPO method☆34Oct 17, 2025Updated 5 months ago
- G2pw's inference speed is accelerated by about 8-10 times. Change loop generated predictive data to only once and model loop prediction b…☆14Dec 30, 2023Updated 2 years ago
- AI Hand Controller uses Computer Vision to recognize hand gestures and control various functions on your computer. The application can co…☆21Apr 7, 2025Updated 11 months ago
- plugin manager for OpenVoiceOS , STT/TTS/Wakewords that can be used anywhere☆13Mar 12, 2026Updated last week
- craftymetaverse.com Front-End Source Code☆16Mar 25, 2022Updated 3 years ago
- ☆23Jan 29, 2026Updated last month
- SLMGAN: Exploiting Speech Language Model Representations for Unsupervised Zero-Shot Voice Conversion in GANs☆16Jul 19, 2023Updated 2 years ago
- Grapheme-to-phoneme tool for corpus conversion, where phonemes match Phoible inventories☆20Apr 10, 2025Updated 11 months ago