DakeQQ / F5-TTS-ONNX
Running the F5-TTS by ONNX Runtime
☆148Updated last week
Alternatives and similar repositories for F5-TTS-ONNX:
Users that are interested in F5-TTS-ONNX are comparing it to the libraries listed below
- F5-TTS 推理加速,速度提升约4倍!☆80Updated 4 months ago
- StyleTTS-ZS: Efficient High-Quality Zero-Shot Text-to-Speech Synthesis with Distilled Time-Varying Style Diffusion☆176Updated 7 months ago
- SenseVoice-python: A enterprise-grade open source multi-language asr system from funasr opensource with onnxruntime☆90Updated 7 months ago
- Next-generation TTS model using flow-matching and DiT, inspired by Stable Diffusion 3☆405Updated 7 months ago
- 使用vllm加速cosyvoice2的推理☆252Updated last week
- A lightweight end-to-end text-to-speech model☆112Updated 2 months ago
- VC Without Retrain!☆121Updated last year
- G2P☆227Updated this week
- Grapheme-to-Phoneme for Mixed Chinese (Mandarin or Cantonese) and English.☆96Updated last month
- ONNX Inference of Pyannote Segmentation☆86Updated 4 months ago
- MooER: Moore-threads Open Omni model for speech-to-speech intERaction. MooER-omni includes a series of end-to-end speech interaction mode…☆203Updated 3 months ago
- Codec for paper: LLaSA: Scaling Train-time and Inference-time Compute for LLaMA-based Speech Synthesis☆262Updated last month
- Reverse Engineering of Supervised Semantic Speech Tokenizer (S3Tokenizer) proposed in CosyVoice☆291Updated 3 months ago
- A enterprise-grade Voice Activity Detector from modelscope and funasr.☆99Updated 2 years ago
- Bert-VITS2 onnx推理版本☆41Updated last year
- Running the F5-TTS by ONNX Runtime standalone with GUI☆18Updated 4 months ago
- ChatTTS is a generative speech model for daily dialogue.☆22Updated 3 months ago
- Full version of wav2lip-onnx including face alignment and face enhancement and more...☆107Updated last week
- TTSAudioNormalizer is a specialized tool for TTS data production, featuring descriptive statistical analysis of audio loudness and loud…☆93Updated 4 months ago
- text to speech using autoregressive transformer and VITS☆238Updated last year
- A TTS model capable of generating ultra-realistic dialogue in one pass.☆34Updated this week
- GPT-SoVITS2☆216Updated 9 months ago
- An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io☆67Updated last year
- Unoffical implementation of Megatts2☆283Updated last year
- The reproduced code for Google's SoundStorm☆265Updated last year
- Application of MB-iSTFT-VITS components to vits2_pytorch☆126Updated 5 months ago
- ☆106Updated 3 weeks ago
- This project is to train an RWKV LLM for TTS generation which compatible to other TTS engine(like fish/cosy/chattts).☆72Updated last week
- ☆223Updated last month
- a text-conditional diffusion probabilistic model capable of generating high fidelity audio.☆162Updated 11 months ago