Zz-ww / VITS-BigVGAN-SpanPSP-ChineseLinks
基于PyTorch的VITS-BigVGAN的tts中文模型,加入韵律预测模型。
☆196Updated 3 years ago
Alternatives and similar repositories for VITS-BigVGAN-SpanPSP-Chinese
Users that are interested in VITS-BigVGAN-SpanPSP-Chinese are comparing it to the libraries listed below
Sorting:
- ☆49Updated 2 years ago
- vits chinese, tts chinese, tts mandarin 史上训练最简单,音质最好的语音合成系统☆217Updated 5 months ago
- 基于标贝数据继续训练,同时对原本的FastSpeech2模型做了改进,引入了韵律表征以及韵律预测模块,使中文发音更生动且富有节奏☆271Updated 2 years ago
- Unoffical implementation of Megatts2☆287Updated last year
- ☆220Updated 2 years ago
- text to speech using autoregressive transformer and VITS☆247Updated last year
- PITS: Variational Pitch Inference for End-to-end Pitch-controllable TTS without External Pitch Predictor☆280Updated 2 years ago
- Opencpop: A High-Quality Open Source Chinese Popular Song Database for Singing Voice Synthesis☆228Updated 2 years ago
- Unofficial implementation of NaturalSpeech2 for Voice Conversion and Text to Speech☆236Updated last year
- Production First and Production Ready End-to-End Text-to-Speech Toolkit☆405Updated this week
- 基于vits fastspeech2 visinger的tts模型☆24Updated 2 years ago
- QuickVC: Any-to-many Voice Conversion Using Inverse Short-time Fourier Transform for Faster Conversion☆253Updated 2 years ago
- Singing Voice Synthesis based on VITS, different from VISinger☆190Updated 2 years ago
- 一个快速制作语音数据集的可视化工具☆194Updated last year
- Forced Alignment-MFA☆48Updated 3 years ago
- Bert-VITS2 onnx推理版本☆43Updated last year
- Lightweight and High-Fidelity End-to-End Text-to-Speech with Multi-Band Generation and Inverse Short-Time Fourier Transform☆464Updated 3 years ago
- HuBERT content encoders for: A Comparison of Discrete and Soft Speech Units for Improved Voice Conversion☆389Updated last year
- 基于 g2pW 提升 pypinyin 的准确性☆101Updated 2 years ago
- VISinger 2: High-Fidelity End-to-End Singing Voice Synthesis Enhanced by Digital Signal Processing Synthesizer☆350Updated last year
- application of vits on mandarin tts☆120Updated 2 years ago
- The deme page of InstructTTS☆157Updated last year
- ☆68Updated 2 years ago
- Grapheme-to-Phoneme lexicons for Chinese dialects☆69Updated 3 years ago
- PyTorch Implementation of ByteDance's Cross-speaker Emotion Transfer Based on Speaker Condition Layer Normalization and Semi-Supervised T…☆194Updated 3 years ago
- Huawei Grad-TTS for Chinese☆49Updated 2 years ago
- ☆124Updated 2 weeks ago
- Using joint training speaker encoder with consistency loss to achieve cross-lingual voice conversion and expressive voice conversion☆151Updated 2 years ago
- The reproduced code for Google's SoundStorm☆269Updated 2 years ago
- Preprocess Audio for training☆366Updated 2 weeks ago