Hecate2 / sukasuka-vocal-dataset-builderLinks
すかすかアニメボカロデータセット。1st anime vocal dataset. Extract audio (vocal) files from video based on .ass subtitle files; manually label vocal files to characters. Will be used for PITS/VITS/Diffusion text-to-speech/SVC. 根据字幕,从视频里抽取全部语音,然后手动按角色标注。
☆48Updated last year
Alternatives and similar repositories for sukasuka-vocal-dataset-builder
Users that are interested in sukasuka-vocal-dataset-builder are comparing it to the libraries listed below
Sorting:
- ACG Text-to-Speech☆175Updated 2 years ago
- Vits Japanese with Whisper as data processor (you can train your VITS even you only have audios)☆162Updated 2 years ago
- async http process VST plugin☆160Updated 2 years ago
- SoVits Gradio(Web UI)☆26Updated 2 years ago
- MoeGoe Android Application by calling Azure function API☆58Updated 3 years ago
- Deep-learning-based voice changer, supporting local inference.☆98Updated 2 years ago
- Tacotron2 implementation of Japanese☆269Updated 3 years ago
- VitsWebUi☆33Updated 2 years ago
- An auxiliary tool for manual screening of audio dataset.☆130Updated 2 years ago
- Acoustic models for SVS/SVC/TTS☆31Updated last year
- 一个集成了各种有趣和实用AI项目的工具箱☆42Updated 2 years ago
- Lightweight and High-Fidelity End-to-End Text-to-Speech with Multi-Band Generation and Inverse Short-Time Fourier Transform☆48Updated 2 years ago
- MoeGoe Azure Cloud Function API☆53Updated 2 years ago
- 适用于 diffsinger 的多功能工具集☆11Updated 2 years ago
- Find available stable diffusion online☆38Updated 2 years ago
- Speech synthesis model /inference GUI repo for galgame characters based on Tacotron2, Hifigan, VITS and Diff-svc☆990Updated 2 years ago
- ☆25Updated 2 years ago
- textual inversion models I made for novelai☆80Updated 2 years ago
- 🧹 游戏剧情录屏字幕清除☆34Updated 11 months ago
- voistock站点voicelist页面免费音源检索并下载程序(可在线体验)☆22Updated last year
- ☆285Updated last year
- waifu_diffusion tags and it's translation☆43Updated 2 years ago
- 利用Stable-Diffution API去除图片ai感☆78Updated last year
- A convenient tool for generating audio files☆135Updated 2 years ago
- GUI for MoeGoe☆569Updated 2 years ago
- ☆84Updated last year
- Chinese-Japanese Bilingual Text-to-Speech☆31Updated 3 years ago
- VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech☆471Updated 2 years ago
- 一个使用OpenAI接口链接VITS模型的语音对话系统GUI☆104Updated 2 years ago
- DiffSinger community vocoders release page☆289Updated 7 months ago