yrom / finetune-index-ttsLinks
IndexTTS Fine-tuning notebooks
☆30Updated 3 weeks ago
Alternatives and similar repositories for finetune-index-tts
Users that are interested in finetune-index-tts are comparing it to the libraries listed below
Sorting:
- F5-TTS 推理加速,速度提升约4倍!☆100Updated 6 months ago
- Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.☆20Updated 2 months ago
- Fast and High-Quality Zero-Shot Text-to-Speech with Flow Matching☆224Updated last week
- ☆21Updated 8 months ago
- paraformer(chinense asr) online onnx runtime for python☆46Updated last year
- TTSAudioNormalizer is a specialized tool for TTS data production, featuring descriptive statistical analysis of audio loudness and loud…☆99Updated 6 months ago
- Grapheme-to-Phoneme for Mixed Chinese (Mandarin or Cantonese) and English.☆103Updated 3 months ago
- ☆65Updated last year
- TTS appalication based on modelscope KAN-TTS☆43Updated last year
- ☆85Updated last month
- Huawei Grad-TTS for Chinese☆50Updated last year
- Unoffical implementation of Megatts2☆286Updated last year
- ASR 2Pass onnxruntime and websocket server, based on FunASR(https://github.com/alibaba-damo-academy/FunASR).☆73Updated 3 months ago
- MooER: Moore-threads Open Omni model for speech-to-speech intERaction. MooER-omni includes a series of end-to-end speech interaction mode…☆214Updated 6 months ago
- ☆57Updated last year
- ☆38Updated last month
- 语音识别模型pytorch转ONNX转MNN,C++实现部署☆71Updated 2 years ago
- ☆13Updated last year
- A enterprise-grade Voice Activity Detector from modelscope and funasr.☆102Updated 2 years ago
- ☆67Updated 2 weeks ago
- This project is to train an RWKV LLM for TTS generation which compatible to other TTS engine(like fish/cosy/chattts).☆77Updated last week
- Inference code for Audiodec-Valle-Wenetspeech4TTS☆50Updated 11 months ago
- Forced Alignment-MFA☆40Updated 3 years ago
- Python Wrapper of Silero VAD☆56Updated 2 months ago
- A enterprise-grade Chinese-English code switch punctuator from funasr.☆24Updated last year
- Using joint training speaker encoder with consistency loss to achieve cross-lingual voice conversion and expressive voice conversion☆147Updated last year
- Bert-VITS2项目bug多且教程不友好。本proj尽可能修复了Bert-vits2项目的bug,并且可一键启动训练。仅需50条目标说话人语音,获得稳定、快速的TTS模型。☆61Updated 4 months ago
- Baichuan-Audio: A Unified Framework for End-to-End Speech Interaction☆202Updated 4 months ago
- ☆32Updated 3 years ago
- 基于PyTorch的VITS-BigVGAN的tts中文模型,加入韵律预测模型。☆195Updated 2 years ago