ultrasev / stream-whisperLinks
基于 faster-whisper 的伪实时语音转写服务
☆219Updated 2 months ago
Alternatives and similar repositories for stream-whisper
Users that are interested in stream-whisper are comparing it to the libraries listed below
Sorting:
- 实时STT,连接OpenAI接口/智谱AI(流式LLM)和GPT-SOVITS/Edge-TTS,通过网页的方式,进行跨网络的服务调用,实现实时对话的效果☆400Updated 6 months ago
- API and websocket server for sensevoice. It has inherited some enhanced features, such as VAD detection, real-time streaming recognition,…☆471Updated 8 months ago
- 一个用于CosyVoice的api接口项目☆293Updated 5 months ago
- 阿里SenseVoice的fastpi封装,采用onnx发布,体积更小,附带量化模型,支持GPU。支持从URL文件进行语音识别。☆88Updated 10 months ago
- Whisper realtime streaming for long speech-to-text transcription and translation☆47Updated last year
- a gradio webui for faster whisper☆268Updated 2 years ago
- 适用于 GPT-SoVITS 的api调用接口☆290Updated last year
- Pseudo Streaming SenseVoice with Hotwords☆303Updated 3 months ago
- 基于SenseVoice的funasr版本进行的api发布,可以无缝对接oneapi☆67Updated 10 months ago
- Sample Repository for the AlibabaCloud Bailian Speech SDK☆224Updated last week
- ☆361Updated 11 months ago
- Fine-tune the Whisper speech recognition model to support training without timestamp data, training with timestamp data, and training wit…☆280Updated last month
- 【脱离复杂的环境配置和整合包,极简配置推理服务】从GPT-SoVITS项目里面提取出来的,纯粹的推理服务方案。☆292Updated last year
- Fuse ChatTTS with OpenVoice, upload a 10-second audio clip, and clone your personalized ChatTTS voice.☆445Updated 8 months ago
- CosyVoice2 功能扩充(预训练音色推理/3s极速复刻/自然语言控制/自动识别/音色模型保存/API)☆151Updated 3 months ago
- 基于中文文本情绪分析自动切换参考音频的 GPT-SoVITS 推理 Demo☆103Updated last year
- 10000 chatTTS voices !chatTTS 音色库,再也不为音色抽卡烦恼啦。这是我第一个项目,熬夜龟速生产10000条音色并上传Github,给点鼓励呗哈!主域名:https://www.TTSlist.com 备用:http://ttslist.aiqb…☆187Updated 11 months ago
- ChatTTS HTTP API☆55Updated last year
- Step-by-step Jupyter notebook tutorials for ChatTTS☆164Updated last year
- Port of Funasr's Sense-voice model in C/C++☆396Updated 2 weeks ago
- ChatTTS 2000条音色稳定性打分🥇+区分男女年龄👧+在线试听🔈 ChatTTS 2K Speaker Stability Score & Categorized by Gender and Age & Audio Preview☆678Updated last year
- 基于Faster-whisper和modelscope一键生成双语字幕,双语字幕生成器,基于离线大模型,Generate bilingual subtitles with one click based on Faster-whisper and modelscope. O…☆391Updated 7 months ago
- 文本语料转训练集工具,txt转dataset☆93Updated last year
- Added vLLM support to IndexTTS for faster inference.☆287Updated this week
- 这是一款基于FunASR实现的说话人分离的GUI程序☆100Updated 4 months ago
- 低成本的简单基于live2d TTS文字转语音和大模型聊天的直播解决方案☆256Updated last year
- ☆26Updated 4 months ago
- 一个中文语音转文字项目,封装自FireRedASR☆64Updated 4 months ago
- Bert-vits2转写和标注独立整合Webui,整合阿里FunAsr,必剪Asr以及Whisper大模型☆181Updated last year
- Fast-TTS 是一个基于异步框架的文本到语音转换(TTS)生成器项目。该项目利用了异步编程技术来高效处理请求和响应,实现了快速、秒级的流式生成长文本语音播放服务。Fast-TTS 可以快速地将长文本转换为语音流,并实时播放,适用于多种应用场景,如语音合成、智能助手、内容…☆38Updated 7 months ago