Text to Speech for Japanese
☆16May 11, 2023Updated 3 years ago
Alternatives and similar repositories for vits-japanese
Users that are interested in vits-japanese are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Unofficial implementation of ConvNeXt-TTS powered by lightning☆18Oct 20, 2024Updated last year
- Non Parallel Voice Conversion based on VITS☆24Mar 31, 2023Updated 3 years ago
- ☆14Aug 1, 2025Updated 10 months ago
- a Neural Vocoder supporting Ring Attention, Conformer and NSF.☆25Aug 1, 2025Updated 10 months ago
- Source code for "BLOOM-Net: Blockwise Optimization for Masking Networks Toward Scalable and Efficient Speech Enhancement"☆14Feb 13, 2022Updated 4 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- 基于ESP32的WiFi无线麦克风接收端☆18Dec 2, 2021Updated 4 years ago
- Export an ONNX graph that performs ISTFT. Designed for TTS models.☆27Apr 23, 2024Updated 2 years ago
- Just a song guessing game ;)☆15Dec 29, 2025Updated 5 months ago
- python script to download & process data to train a speech-synthesis model of Vietnamese M.C. Nguyễn Ngọc Ngạn☆15Aug 13, 2024Updated last year
- text to speech☆10Mar 19, 2024Updated 2 years ago
- ☆17Jun 3, 2020Updated 6 years ago
- Code for INTERSPEECH 2023 paper "mdctGAN: Taming transformer-based GAN for speech super-resolution with Modified DCT spectra"☆66Jun 3, 2023Updated 3 years ago
- Implementation of the paper "Variable Bitrate Residual Vector Quantization for Audio Coding"☆11Apr 10, 2025Updated last year
- DEX-TTS: Diffusion-based EXpressive TTS with Style Modeling on Time Variability☆108Jan 17, 2025Updated last year
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- T5Voice is a lightweight PyTorch implementation of T5-based text-to-speech synthesis, supporting both streaming and non-streaming speech …☆28Nov 7, 2025Updated 7 months ago
- Unofficial pytorch reproduction for the paper "Utilizing Neural Transducers for Two-Stage Text-to-Speech via Semantic Token Prediction" (…☆60Apr 4, 2024Updated 2 years ago
- [InterSpeech 24] FreeV: Free Lunch For Vocoders Through Pseudo Inversed Mel Filter☆97Jul 4, 2024Updated last year
- VITS with phoneme-level prosody modeling based on MaskGIT☆85Aug 31, 2024Updated last year
- ☆25Nov 25, 2025Updated 6 months ago
- Offline Speaker Diarization with SenseVoice by Sherpa ONNX.☆15Dec 23, 2024Updated last year
- Vits Japanese with Whisper as data processor (you can train your VITS even you only have audios)☆162May 7, 2023Updated 3 years ago
- Direct Preference Optimization Implementation☆17Feb 1, 2024Updated 2 years ago
- Unofficial implementation of wavenext vocoder☆59Aug 28, 2024Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Implementation of DCComix TTS: An End-to-End Expressive TTS with Discrete Code Collaborated with Mixer☆74Aug 21, 2023Updated 2 years ago
- PyTorch Implementation of ViT-TTS (EMNLP'23)☆11Oct 20, 2023Updated 2 years ago
- ☆21Jun 16, 2021Updated 4 years ago
- ☆11Feb 20, 2025Updated last year
- Huawei Grad-TTS for Chinese☆51Sep 26, 2023Updated 2 years ago
- JETS: Jointly Training FastSpeech2 and HiFi-GAN for End to End Text to Speech☆112Jun 6, 2022Updated 4 years ago
- G2pw's inference speed is accelerated by about 8-10 times. Change loop generated predictive data to only once and model loop prediction b…☆14Dec 30, 2023Updated 2 years ago
- Implementation of Spear-TTS - multi-speaker text-to-speech attention network, in Pytorch☆278Oct 30, 2023Updated 2 years ago
- plugin manager for OpenVoiceOS , STT/TTS/Wakewords that can be used anywhere☆14Updated this week
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- AI Hand Controller uses Computer Vision to recognize hand gestures and control various functions on your computer. The application can co…☆23Apr 7, 2025Updated last year
- craftymetaverse.com Front-End Source Code☆16Mar 25, 2022Updated 4 years ago
- ☆24Updated this week
- SLMGAN: Exploiting Speech Language Model Representations for Unsupervised Zero-Shot Voice Conversion in GANs☆16Jul 19, 2023Updated 2 years ago
- Grapheme-to-phoneme tool for corpus conversion, where phonemes match Phoible inventories☆19Apr 10, 2025Updated last year
- ☆49May 9, 2023Updated 3 years ago
- WutheringWaves Datasets For SVC/SVS/TTS☆39Jul 27, 2025Updated 10 months ago