shenduldh / CosyVoice2-LightningLinks
Lightning-responsive CosyVoice2 streaming API based on FastAPI.
☆15Updated 3 months ago
Alternatives and similar repositories for CosyVoice2-Lightning
Users that are interested in CosyVoice2-Lightning are comparing it to the libraries listed below
Sorting:
- F5-TTS 推理加速,速度提升约4倍!☆106Updated 7 months ago
- SenseVoice-python: A enterprise-grade open source multi-language asr system from funasr opensource with onnxruntime☆98Updated 11 months ago
- ☆28Updated last year
- simple and fast wav2lip using onnx models for face-detection and inference. Easy installation☆25Updated 10 months ago
- paraformer(chinense asr) online onnx runtime for python☆50Updated last year
- Just a suturing monster project.☆41Updated last year
- Python的音频工具☆15Updated 9 months ago
- PersonaTalk Hack☆14Updated 7 months ago
- Finding the most similar tone/color in a large collection of audio. 在一大堆音频中寻找最相似的音色。☆13Updated last year
- IndexTTS Fine-tuning notebooks☆53Updated 2 months ago
- Utilizes ONNX Runtime to transcribe audio into text.☆45Updated 2 weeks ago
- TTS appalication based on modelscope KAN-TTS☆43Updated last year
- Utilizes ONNX Runtime for speech activity detection.☆27Updated last week
- 语音识别模型pytorch转ONNX转MNN,C++实现部署☆72Updated 2 years ago
- This project is to train an RWKV LLM for TTS generation which compatible to other TTS engine(like fish/cosy/chattts).☆83Updated last week
- ☆55Updated last month
- This is a repository that collects common audio noise reduction models, using Gradio to demonstrate the use of each model, which is very …☆41Updated 8 months ago
- Audio-Visual Lip Synthesis via Intermediate Landmark Representation☆18Updated 2 years ago
- ASR 2Pass onnxruntime and websocket server, based on FunASR(https://github.com/alibaba-damo-academy/FunASR).☆75Updated last month
- 复现Wav2Lip作者新的论文☆20Updated 2 years ago
- Running the F5-TTS by ONNX Runtime☆173Updated 2 weeks ago
- A enterprise-grade Voice Activity Detector from modelscope and funasr.☆108Updated 2 years ago
- LLIA - Enabling Low-Latency Interactive Avatars: Real-Time Audio-Driven Portrait Video Generation with Diffusion Models☆113Updated 2 months ago
- Spliting the ASR probability distribution results into the chinese pinyin, so as to extract more effective feature for the chinese speech…☆21Updated 2 years ago
- mnn asr demo.☆23Updated 5 months ago
- 实现基于4k视频的高分辨率人物换衣、虚拟试穿、物品替换☆54Updated 2 years ago
- A Tiny Project For ASR model training and Deployment☆27Updated 2 years ago
- 基于DINet的推理服务,推理视频流和视频☆16Updated last year
- Optimized Syncnet and Chinese enhanced version, EN and CN checkpoints released☆12Updated 3 years ago
- ☆15Updated last year