Hecate2 / sukasuka-vocal-dataset-builderLinks
すかすかアニメボカロデータセット。1st anime vocal dataset. Extract audio (vocal) files from video based on .ass subtitle files; manually label vocal files to characters. Will be used for PITS/VITS/Diffusion text-to-speech/SVC. 根据字幕,从视频里抽取全部语音,然后手动按角色标注。
☆48Updated last year
Alternatives and similar repositories for sukasuka-vocal-dataset-builder
Users that are interested in sukasuka-vocal-dataset-builder are comparing it to the libraries listed below
Sorting:
- ACG Text-to-Speech☆174Updated 3 years ago
- Vits Japanese with Whisper as data processor (you can train your VITS even you only have audios)☆162Updated 2 years ago
- 一个集成了各种有趣和实用AI项目的工具箱☆40Updated 2 years ago
- async http process VST plugin☆160Updated 2 years ago
- An auxiliary tool for manual screening of audio dataset.☆130Updated 2 years ago
- Chinese-Japanese Bilingual Text-to-Speech☆31Updated 3 years ago
- Deep-learning-based voice changer, supporting local inference.☆99Updated 3 years ago
- MoeGoe Android Application by calling Azure function API☆58Updated 3 years ago
- voistock站点voicelist页面免费音源检索并下载程序(可在线体验)☆22Updated last year
- waifu_diffusion tags and it's translation☆43Updated 2 years ago
- Tacotron2 implementation of Japanese☆269Updated 3 years ago
- SoVits Gradio(Web UI)☆26Updated 2 years ago
- An unofficial implementation of the combination of Soft-VC and VITS☆458Updated 3 years ago
- Lightweight and High-Fidelity End-to-End Text-to-Speech with Multi-Band Generation and Inverse Short-Time Fourier Transform☆47Updated 2 years ago
- 一个使用OpenAI接口链接VITS模型的语音对话系统GUI☆104Updated 2 years ago
- vits Android部署☆342Updated last year
- VitsWebUi☆33Updated 2 years ago
- vue.js 的 Novel AI leak 前端,简易简洁简陋☆61Updated 3 years ago
- GUI for MoeGoe☆568Updated 2 years ago
- A convenient tool for generating audio files☆134Updated 2 years ago
- An automatic music transcription application☆78Updated 2 years ago
- MoeGoe Azure Cloud Function API☆53Updated 2 years ago
- ☆33Updated last year
- VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech☆166Updated 2 years ago
- ☆84Updated last year
- High-quality and controllable charting AI for rhythm games, modifed from stable diffusion☆270Updated last year
- ☆283Updated last year
- PJSK-Vits GUI☆100Updated 3 months ago
- Fine-Tuning your VITS model using a pre-trained model☆549Updated 2 years ago
- Speech synthesis model /inference GUI repo for galgame characters based on Tacotron2, Hifigan, VITS and Diff-svc☆994Updated 2 years ago