ywh-my / Bert-VITS2-FixBugLinks

Bert-VITS2项目bug多且教程不友好。本proj尽可能修复了Bert-vits2项目的bug，并且可一键启动训练。仅需50条目标说话人语音，获得稳定、快速的TTS模型。

☆63

Alternatives and similar repositories for Bert-VITS2-FixBug

Users that are interested in Bert-VITS2-FixBug are comparing it to the libraries listed below

Sorting:

WGS-note / F5_TTS_Faster
F5-TTS 推理加速，速度提升约4倍！
☆106Updated 7 months ago
qi-hua / async_cosyvoice
使用vllm加速cosyvoice2的推理
☆397Updated 3 months ago
huahuahuage / Bert-VITS2-Speech
Bert-VITS2 onnx推理版本
☆42Updated last year
xinchen-ai / Westlake-Omni
☆201Updated 11 months ago
ZaVang / GPT-SoVits
重构GPT-SOVITS的项目，重写了部分代码，优化了webui的使用以及增加了api调用
☆29Updated 8 months ago
hexisyztem / CosyVoice
Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.
☆20Updated 4 months ago
SapphireLab / Sapphire-TTS-Collection
☆40Updated 2 months ago
duj12 / ASR-2Pass
ASR 2Pass onnxruntime and websocket server, based on FunASR(https://github.com/alibaba-damo-academy/FunASR).
☆75Updated last month
lovemefan / fsmn-vad
A enterprise-grade Voice Activity Detector from modelscope and funasr.
☆107Updated 2 years ago
YoMio-Tech-Inc / GPT-SoVITS2
GPT-SoVITS2
☆224Updated last year
chenyangMl / keyword-spot
端到端语音唤醒工具箱，从模型训练到模型推理。
☆125Updated 2 weeks ago
ScottishFold007 / TTSAudioNormalizer
TTSAudioNormalizer is a specialized tool for TTS data production, featuring descriptive statistical analysis of audio loudness and loud…
☆102Updated 8 months ago
pengzhendong / g2p-mix
Grapheme-to-Phoneme for Mixed Chinese (Mandarin or Cantonese) and English.
☆107Updated 5 months ago
modelscope / kws-training-suite
☆132Updated 2 years ago
MooreThreads / MooER
MooER: Moore-threads Open Omni model for speech-to-speech intERaction. MooER-omni includes a series of end-to-end speech interaction mode…
☆217Updated 7 months ago
xiaomingnio / kantts
TTS appalication based on modelscope KAN-TTS
☆43Updated last year
Fatfish588 / Dataset_Generator_For_VITS
基于达摩院视频切割技术的视频转换为短音频的vits数据集生成工具 A VITS Dataset Generation Tool for Converting Video to Short Audio Based on Damo Academy Video Cutting T…
☆55Updated last year
Executedone / Chinese-FastSpeech2
基于标贝数据继续训练，同时对原本的FastSpeech2模型做了改进，引入了韵律表征以及韵律预测模块，使中文发音更生动且富有节奏
☆269Updated last year
xieyuankun / VITS-chinese-finetune
语音合成VITS 纯中文微调
☆11Updated 2 years ago
JimmyMa99 / train-higgs-audio
Text-audio foundation model from Boson AI
☆77Updated last week
FunAudioLLM / CV3-Eval
☆98Updated last month
MaxMax2016 / Grad-TTS-Chinese
Huawei Grad-TTS for Chinese
☆51Updated last year
pengzhendong / streaming-ChatTTS
☆21Updated 9 months ago
baichuan-inc / Baichuan-Audio
Baichuan-Audio: A Unified Framework for End-to-End Speech Interaction
☆207Updated 5 months ago
peilongchencc / My-FunASR
基于FunASR实现语音识别，包含常规版和ONNX版(推荐)。
☆42Updated 10 months ago
pengzhendong / streaming-sensevoice
Pseudo Streaming SenseVoice with Hotwords
☆335Updated 5 months ago
pengzhendong / pysilero
Python Wrapper of Silero VAD
☆59Updated 3 months ago
ishine / vc-lm
将任意人的音色转换为成千上万种不同音色
☆30Updated 2 years ago
jingzhunxue / flow_mirror
flow mirror models from JZX AI Labs
☆44Updated 10 months ago
Zz-ww / VITS-BigVGAN-SpanPSP-Chinese
基于PyTorch的VITS-BigVGAN的tts中文模型，加入韵律预测模型。
☆196Updated 2 years ago