zixiiu / vits
VITS implementation of Japanese, Chinese, Korean, Sanskrit and Thai
☆24Updated last year
Related projects ⓘ
Alternatives and complementary repositories for vits
- 基于PyTorch的VITS-BigVGAN的tts中文模型,加入韵律预测模型。☆194Updated 2 years ago
- vits chinese, tts chinese, tts mandarin 史上训练最简单,音质最好的语音合成系统☆211Updated 3 years ago
- application of vits on mandarin tts☆119Updated last year
- vits2 backbone with bert☆335Updated 7 months ago
- 语音数据集制作标记工具☆131Updated 2 years ago
- ☆259Updated 6 months ago
- Simple data labeling script with funasr inside. 使用阿里fanasr进行VITS训练数据标注☆77Updated last year
- 一个快速制作语音数据集的可视化工具☆193Updated 8 months ago
- GPT-SoVITS2☆184Updated 3 months ago
- ☆418Updated this week
- VITS for Mandarin. Support Windows and Linux, low-end and high-end hardwares☆112Updated last year
- VC Without Retrain!☆104Updated 6 months ago
- 语音合成项目☆165Updated last year
- A cli tool for split vocal timbre.☆187Updated 2 weeks ago
- VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech☆166Updated last year
- How to use our public wav2vec2 dimensional emotion model☆457Updated last year
- Genshin Datasets For SVC/SVS/TTS☆600Updated last month
- 基于标贝数据继续训练,同时对原本的FastSpeech2模型做了改进,引入了韵律表征以及韵律预测模块,使中文发音更生动且富有节奏☆249Updated last year
- ☆193Updated last year
- Preprocess Audio for training☆262Updated last month
- vits2 backbone with bert☆84Updated 10 months ago
- SubFix: Efficient Web-Based Audio Subtitle Editing and Multilingual Automatic Annotation Tool.☆192Updated 9 months ago
- 无需情感标注的情感可控语音合成模型,基于VITS☆1,333Updated last year
- ☆48Updated last year
- 🌻 VITS ONNX TTS server designed for fast inference 🔥☆121Updated last year
- VITS implementation of Japanese, Chinese, Korean, Sanskrit and Thai☆917Updated 11 months ago
- Documentation for Bert-VITS2☆22Updated 11 months ago
- Split audio using the .srt file, clean up annotations, then merge and package into a format suitable for bert-vits2 in a standard manner.…☆44Updated 5 months ago
- Fine-Tuning your VITS model using a pre-trained model☆551Updated last year
- Lightweight and High-Fidelity End-to-End Text-to-Speech with Multi-Band Generation and Inverse Short-Time Fourier Transform☆422Updated 2 years ago