lukeewin / faster_whisper_streaming
This is a project focused on Faster Whisper, a streaming speech recognition project.
☆15Updated 4 months ago
Alternatives and similar repositories for faster_whisper_streaming:
Users that are interested in faster_whisper_streaming are comparing it to the libraries listed below
- 这是一款基于FunASR实现的说话人分离的GUI程序☆29Updated last week
- Bert-VITS2 onnx推理版本☆40Updated 9 months ago
- Bert-VITS2项目bug多且教程不友好。本proj尽可能修复了Bert-vits2项目的bug,并且可一键启动训练。仅需50条目标说话人语音,获得稳定、快速的TTS模型。☆44Updated 4 months ago
- Bert-vits2转写和标注独立整合Webui,整合阿里FunAsr,必剪Asr以及Whisper大模型☆161Updated 6 months ago
- Pseudo Streaming SenseVoice with Hotwords☆171Updated last month
- Python Wrapper of Silero VAD☆48Updated last month
- A enterprise-grade Chinese-English code switch punctuator from funasr.☆21Updated 9 months ago
- GPT-SoVITS2☆210Updated 5 months ago
- A Bob plugin that calls self-deployed Cosyvoice service to achieve TTS.☆24Updated 5 months ago
- ASR 2Pass onnxruntime and websocket server, based on FunASR(https://github.com/alibaba-damo-academy/FunASR).☆57Updated last month
- 基于中文文本情绪分析自动切换参考音频的 GPT-SoVITS 推理 Demo☆89Updated 10 months ago
- paraformer(chinense asr) online onnx runtime for python☆40Updated 10 months ago
- Grapheme-to-Phoneme for Mixed Chinese (Mandarin or Cantonese) and English.☆84Updated last week
- 这个项目是数据预处理。第一步是对获取到的音频做处理,结合Funasr的时间戳去掉空背景音。也包含了喂给BERT前的label☆15Updated 7 months ago
- VC Without Retrain!☆113Updated 9 months ago
- 端到端语音唤醒工具箱,从模型训练到模型推理。☆96Updated 4 months ago
- Port of Funasr's Sense-voice model in C/C++☆229Updated last week
- ☆127Updated 4 months ago
- ☆99Updated last year
- ☆44Updated last year
- text to speech using autoregressive transformer and VITS☆234Updated 9 months ago
- ☆22Updated this week
- 本项目是基于Pytorch的语音合成项目,使用的是VITS,VITS是一种语音合成方法,这种时端到端的模型使用起来非常简单,不需要文本对齐等太复杂的流程,直接一键训练和生成,大大降低了学习门槛。☆45Updated 5 months ago
- Unoffical implementation of Megatts2☆274Updated 10 months ago
- 基于PyTorch的VITS-BigVGAN的tts中文模型,加入韵律预测模型。☆194Updated 2 years ago
- 基于FunASR实现语音识别,包含常规版和ONNX版(推荐)。☆28Updated 3 months ago
- A enterprise-grade Voice Activity Detector from modelscope and funasr.☆71Updated last year
- SenseVoice-python: A enterprise-grade open source multi-language asr system from funasr opensource with onnxruntime☆80Updated 4 months ago
- ☆88Updated 3 weeks ago
- Python Wrapper for RnNoise v0.2☆23Updated last month