innnky / audio-preprocessing-scriptsLinks
数据集自动化制作脚本
☆71Updated 2 years ago
Alternatives and similar repositories for audio-preprocessing-scripts
Users that are interested in audio-preprocessing-scripts are comparing it to the libraries listed below
Sorting:
- 基于vits fastspeech2 visinger的tts模型☆24Updated 2 years ago
- An Implementation of Singing Voice Conversion Based on Diffsinger☆70Updated 2 years ago
- ☆24Updated 2 years ago
- Pipelines and tools to build your own DiffSinger dataset.☆117Updated 6 months ago
- A collection of neural vocoders suitable for singing voice synthesis tasks.☆132Updated 6 months ago
- 🌻 VITS ONNX TTS server designed for fast inference 🔥☆128Updated 8 months ago
- Sovits5 with RMVPE☆14Updated 2 years ago
- Vits Japanese with Whisper as data processor (you can train your VITS even you only have audios)☆162Updated 2 years ago
- Bert-VITS2 onnx推理版本☆43Updated last year
- ☆148Updated 7 months ago
- RTVC: Real-Time Voice Conversion GUI☆56Updated 2 years ago
- 基于PyTorch的VITS-BigVGAN的tts中文模型,加入韵律预测模型。☆196Updated 3 years ago
- A Japanese G2P tool based on pyopenjtalk☆25Updated 3 years ago
- ☆218Updated 2 years ago
- ☆18Updated 2 months ago
- PITS: Variational Pitch Inference for End-to-end Pitch-controllable TTS without External Pitch Predictor☆280Updated 2 years ago
- DiffSinger dataset processing tools, including audio processing, labeling.☆59Updated this week
- vits chinese, tts chinese, tts mandarin 史上训练最简单,音质最好的语音合成系统☆218Updated 3 months ago
- ☆123Updated last month
- Acoustic models for SVS/SVC/TTS☆31Updated last year
- Grapheme-to-Phoneme lexicons for Chinese dialects☆69Updated 2 years ago
- PITS: Variational Pitch Inference for End-to-end Pitch-controllable TTS without External Pitch Predictor☆17Updated 2 years ago
- ☆49Updated 2 years ago
- 基于FreeVC的歌声转换☆21Updated 2 years ago
- ☆79Updated 2 years ago
- Implementation of RIFT-SVC, a singing voice conversion model based on Rectified Flow Transformer.☆48Updated 6 months ago
- 一个快速制作语音数据集的可视化工具☆195Updated last year
- Lightweight and High-Fidelity End-to-End Text-to-Speech with Multi-Band Generation and Inverse Short-Time Fourier Transform☆48Updated 2 years ago
- SOFA: Singing-Oriented Forced Aligner☆175Updated 4 months ago
- ☆80Updated 2 years ago