lukeewin / faster_whisper_streamingLinks
This is a project focused on Faster Whisper, a streaming speech recognition project.
☆18Updated last year
Alternatives and similar repositories for faster_whisper_streaming
Users that are interested in faster_whisper_streaming are comparing it to the libraries listed below
Sorting:
- Bert-vits2转写和标注独立整合Webui,整合阿里FunAsr,必剪Asr以及Whisper大模型☆184Updated last year
- Bert-VITS2 onnx推理版本☆43Updated last year
- 这是一款基于FunASR实现的说话人分离的GUI程序☆146Updated last week
- A enterprise-grade Chinese-English code switch punctuator from funasr.☆29Updated last year
- Pseudo Streaming SenseVoice with Hotwords☆406Updated 9 months ago
- GPT-SoVITS2☆229Updated last year
- F5-TTS 推理加速,速度提升约4倍!☆120Updated 11 months ago
- Convenient for developers to call inference models from version v1 to v3 through API, supporting streaming transmission and specified typ…☆44Updated 9 months ago
- ASR 2Pass onnxruntime and websocket server, based on FunASR(https://github.com/alibaba-damo-academy/FunASR).☆91Updated 3 weeks ago
- CosyVoice2 功能扩充(预训练音色推理/3s极速复刻/自然语言控制/自动识别/音色模型保存/API)☆177Updated 9 months ago
- Bert-VITS2项目bug多且教程不友好。本proj尽可能修复了Bert-vits2项目的bug,并且可一键启动训练。仅需50条目标说话人语音,获得稳定、快速的TTS模型。☆65Updated 4 months ago
- 一个简单的音频降噪工具,提高web UI界面和api接口☆44Updated last year
- 基于FunASR实现语音识别,包含常规版和ONNX版(推荐)。☆46Updated last year
- 基于中文文本情绪分析自动切换参考音频的 GPT-SoVITS 推理 Demo☆106Updated last year
- SenseVoice-python: A enterprise-grade open source multi-language asr system from funasr opensource with onnxruntime☆108Updated 2 months ago
- A enterprise-grade Voice Activity Detector from modelscope and funasr.☆120Updated 2 years ago
- 使用vllm加速cosyvoice2的推理☆461Updated 7 months ago
- VC Without Retrain!☆128Updated last year
- Running the F5-TTS by ONNX Runtime☆184Updated last month
- Python Wrapper of Silero VAD☆63Updated 7 months ago
- 一个用于CosyVoice的api接口项目☆325Updated 3 months ago
- ☆142Updated 2 years ago
- 本项目是基于Pytorch的语音合成项目,使用的是VITS,VITS是一种语音合成方法,这种时端到端的模型使用起来非常简单,不需要文本对齐等太复杂的流程,直接一键训练和生成,大大降低了学习门槛。☆54Updated last year
- ☆49Updated 2 years ago
- ☆16Updated this week
- ☆12Updated last year
- Fun-ASR is an end-to-end speech recognition large model launched by Tongyi Lab.☆338Updated last week
- 基于 faster-whisper 的伪实时语音转写服务☆232Updated 7 months ago
- 端到端语音唤醒工具箱,从模型训练到模型推理。☆144Updated 4 months ago
- Utilizes ONNX Runtime for speech activity detection.☆36Updated last week