tonnetonne814 / unofficial-vits2-44100-Ja
44100Hz日本語音源に対応させた unofficial vits2-TTS implementation in pytorchです。
☆22Updated last year
Related projects ⓘ
Alternatives and complementary repositories for unofficial-vits2-44100-Ja
- DDPM-based Pitch Generation and Pitch Controllable Voice Synthesis.☆48Updated last year
- Coco-Nut (Corpus of connecting NIHONGO utterance and text) corpus☆21Updated 4 months ago
- Implementation of vocoders empowered with pytorch lightning☆13Updated 9 months ago
- ☆38Updated 2 months ago
- ☆27Updated 11 months ago
- PromptTTS++: Controlling Speaker Identity in Prompt-Based Text-To-Speech Using Natural Language Descriptions☆57Updated 3 weeks ago
- VITS2 using Phoneme-Level Japanese BERT☆13Updated 10 months ago
- ☆39Updated last year
- ☆25Updated 3 months ago
- ☆22Updated last year
- Official implementation of the paper "Laughter Synthesis using Pseudo Phonetic Tokens with a Large-scale In-the-wild Laughter Corpus" acc…☆70Updated last year
- PitchVC: Pitch Conditioned Any-to-Many Voice Conversion☆34Updated 5 months ago
- ☆31Updated last year
- Source code of APNet2, a vocoder☆51Updated 11 months ago
- ☆16Updated 6 months ago
- VAE modified from Descript Audio Codec, which replaces the RVQ with VAE☆54Updated 7 months ago
- 4G GPU & 10 Minutes for train☆12Updated last year
- Incorporating AutoVocoder to MB-iSTFT-VITS☆44Updated last year
- Speech Human Evaluation Estimation Toolkit (SHEET)☆31Updated this week
- SC-CNN: Effective Speaker Conditioning Method for Zero-Shot Multi-Speaker Text-to-Speech Systems☆39Updated last year
- Unofficial pytorch implementation of VISinger: Variational Inference with Adversarial Learning for End-to-end Singing Voice Synthesis (IC…☆15Updated last year
- A repository of Japanese Phoneme-Level BERT☆20Updated 10 months ago
- VITS with phoneme-level prosody modeling based on MaskGIT☆75Updated 2 months ago
- Speaker embedding for VI-SVC and VI-SVS, alse for VITS; Use this to replace the ID to implement voice clone.☆28Updated 2 years ago
- 44100Hz日本語HuBERTに対応した QuickVC: Any-to-many Voice Conversion Using Inverse Short-time Fourier Transform for Faster Conversion です。☆14Updated last year
- 日本語音声に対して音素ラベルをアラインメントするためのツールです☆12Updated 3 months ago
- ☆44Updated last year
- ☆24Updated 4 months ago