innnky / pitsLinks
PITS-中日英韩
☆12Updated 2 years ago
Alternatives and similar repositories for pits
Users that are interested in pits are comparing it to the libraries listed below
Sorting:
- Pipelines and tools to build your own DiffSinger dataset.☆118Updated 6 months ago
- Vits Japanese with Whisper as data processor (you can train your VITS even you only have audios)☆162Updated 2 years ago
- ☆24Updated 2 years ago
- PITS: Variational Pitch Inference for End-to-end Pitch-controllable TTS without External Pitch Predictor☆17Updated 2 years ago
- 数据集自动化制作脚本☆72Updated 2 years ago
- Deep-learning-based voice changer, supporting local inference.☆99Updated 2 years ago
- A collection of neural vocoders suitable for singing voice synthesis tasks.☆135Updated 7 months ago
- Hubert-based Forced Aligner☆20Updated last month
- DiffSinger dataset processing tools, including audio processing, labeling.☆61Updated 3 weeks ago
- A Japanese G2P tool based on pyopenjtalk☆25Updated 3 years ago
- ☆150Updated 8 months ago
- PITS: Variational Pitch Inference for End-to-end Pitch-controllable TTS without External Pitch Predictor☆280Updated 2 years ago
- 基于vits fastspeech2 visinger的tts模型☆24Updated 2 years ago
- Chinese-Japanese Bilingual Text-to-Speech☆31Updated 3 years ago
- ☆19Updated 3 months ago
- Code for ICASSP2022 paper "Pseudo-Label Transfer from Frame-Level to Note-Level in a Teacher-Student Framework for Singing Transcription …☆151Updated 3 years ago
- WutheringWaves Datasets For SVC/SVS/TTS☆23Updated 3 months ago
- SOFA_AI: Singing-Oriented Forced Aligner for Automatic Inference☆24Updated last year
- Python script to convert NNSVS DBs to Diffsinger without the NNSVS Python Library☆26Updated 2 months ago
- application of vits on mandarin tts☆120Updated 2 years ago
- Acoustic models for SVS/SVC/TTS☆31Updated last year
- SOFA: Singing-Oriented Forced Aligner☆177Updated 5 months ago
- ☆49Updated 2 years ago
- Implementation of RIFT-SVC, a singing voice conversion model based on Rectified Flow Transformer.☆48Updated 7 months ago
- ☆71Updated 2 years ago
- Lightweight and High-Fidelity End-to-End Text-to-Speech with Multi-Band Generation and Inverse Short-Time Fourier Transform☆47Updated 2 years ago
- [RecurrentNN × Regression × Regularized]-base Mouth Opening Estimation via SSL(Semi-supervised Learning).☆21Updated 3 months ago
- ☆172Updated 2 weeks ago
- StarRail Datasets For SVC/SVS/TTS☆333Updated 3 months ago
- Interlingual user dictionary of Synthesizer V. Synthesizer V的跨语言用户词典。☆62Updated 3 months ago