NaruseMioShirakana / Pits-Japanese-Onnx
PITS: Variational Pitch Inference for End-to-end Pitch-controllable TTS without External Pitch Predictor
☆17Updated 2 years ago
Alternatives and similar repositories for Pits-Japanese-Onnx:
Users that are interested in Pits-Japanese-Onnx are comparing it to the libraries listed below
- A Japanese G2P tool based on pyopenjtalk☆25Updated 2 years ago
- An Implementation of Singing Voice Conversion Based on Diffsinger☆70Updated 2 years ago
- 数据集自动化制作脚本☆71Updated 2 years ago
- Sovits5 with RMVPE☆14Updated last year
- 基于vits fastspeech2 visinger的tts模型☆23Updated 2 years ago
- so-vits-svc rewritten in jax and flax.Updated 8 months ago
- ☆24Updated 2 years ago
- A collection of neural vocoders suitable for singing voice synthesis tasks.☆120Updated last month
- PITS-中日英韩☆12Updated 2 years ago
- Pipelines and tools to build your own DiffSinger dataset.☆103Updated 3 weeks ago
- ☆13Updated last month
- Acoustic models for SVS/SVC/TTS☆31Updated 8 months ago
- Render wav and convert it with [Diff-SVC](https://github.com/prophesier/diff-svc) model☆10Updated 2 years ago
- SOFA_AI: Singing-Oriented Forced Aligner for Automatic Inference☆21Updated 10 months ago
- ☆137Updated 2 months ago
- singing voice conversion based on glow-tts☆11Updated last year
- Vits Japanese with Whisper as data processor (you can train your VITS even you only have audios)☆160Updated last year
- ☆39Updated 7 months ago
- 基于FreeVC的歌声转换☆21Updated 2 years ago
- WutheringWaves Datasets For SVC/SVS/TTS☆17Updated this week
- Chinese-Japanese Bilingual Text-to-Speech☆31Updated 2 years ago
- PITS: Variational Pitch Inference for End-to-end Pitch-controllable TTS without External Pitch Predictor☆276Updated last year
- 44100Hz日本語音源に対応させた unofficial vits2-TTS implementation in pytorchです。☆23Updated last year
- Implementation of RIFT-SVC, a singing voice conversion model based on Rectified Flow Transformer.☆40Updated last month
- ☆17Updated 5 months ago
- ☆22Updated 2 years ago
- BigVGAN with Neural Source-Filter☆54Updated last year
- VITS with phoneme-level prosody modeling based on MaskGIT☆81Updated 7 months ago
- Lightweight and High-Fidelity End-to-End Text-to-Speech with Multi-Band Generation and Inverse Short-Time Fourier Transform☆48Updated 2 years ago
- Speaker embedding for VI-SVC and VI-SVS, alse for VITS; Use this to replace the ID to implement voice clone.☆29Updated 2 years ago