Gloridust / whisper_streaming_CNLinks

Whisper realtime streaming for long speech-to-text transcription and translation

☆52

Alternatives and similar repositories for whisper_streaming_CN

Users that are interested in whisper_streaming_CN are comparing it to the libraries listed below

Sorting:

ultrasev / stream-whisper
基于 faster-whisper 的伪实时语音转写服务
☆222Updated 3 months ago
Ikaros-521 / RealtimeSTT_LLM_TTS
实时STT，连接OpenAI接口/智谱AI（流式LLM）和GPT-SOVITS/Edge-TTS，通过网页的方式，进行跨网络的服务调用，实现实时对话的效果
☆406Updated 7 months ago
0x5446 / api4sensevoice
API and websocket server for sensevoice. It has inherited some enhanced features, such as VAD detection, real-time streaming recognition,…
☆481Updated 9 months ago
LuckLittleBoy / SenseVoice-OneApi
基于SenseVoice的funasr版本进行的api发布，可以无缝对接oneapi
☆69Updated 11 months ago
HG-ha / SenseVoice-Api
阿里SenseVoice的fastpi封装，采用onnx发布，体积更小，附带量化模型，支持GPU。支持从URL文件进行语音识别。
☆95Updated 11 months ago
aliyun / alibabacloud-bailian-speech-demo
Sample Repository for the AlibabaCloud Bailian Speech SDK
☆247Updated last week
shuaijiang / Whisper-Finetune
Fine-tune the Whisper speech recognition model to support training without timestamp data, training with timestamp data, and training wit…
☆287Updated 2 months ago
jianchang512 / cosyvoice-api
一个用于CosyVoice的api接口项目
☆304Updated 6 months ago
soulteary / dify-with-qwen-vl
视频理解：千问视频多模态模型 & Dify
☆63Updated 11 months ago
v3ucn / llama3-txt2json-dataset-maker
文本语料转训练集工具，txt转dataset
☆93Updated last year
CyberWon / ChatTTS-API
ChatTTS HTTP API
☆55Updated last year
jianchang512 / fireredasr-ui
一个中文语音转文字项目，封装自FireRedASR
☆65Updated 5 months ago
pengzhendong / streaming-sensevoice
Pseudo Streaming SenseVoice with Hotwords
☆323Updated 4 months ago
ycyy / faster-whisper-webui
a gradio webui for faster whisper
☆270Updated 2 years ago
Ninot1Quyi / Qwen2.5-Omni-multimodal-chat
基于通义千问 Qwen2.5-Omni 的实时语音对话系统，使用在线API服务，支持实时语音交互、动态语音活动检测和流式音频处理。A real-time voice conversation system based on Qwen2.5-Omni Online-API, …
☆62Updated 2 months ago
Kedreamix / ChatTTS
TTS
☆49Updated last year
diudiu62 / CosyVoice-api
☆28Updated 5 months ago
v3ucn / Unsloth-Windows-fineTuning-Qwen2
Unsloth框架在Windows平台微调训练Qwen2大模型，非WSL
☆61Updated last year
journey-ad / CosyVoice2-Ex
CosyVoice2 功能扩充（预训练音色推理/3s极速复刻/自然语言控制/自动识别/音色模型保存/API）
☆156Updated 4 months ago
mMrBun / AIPC
☆58Updated 9 months ago
gan / glm4v-assistant
Sample GLM4V + ChatTTS AI assistant
☆85Updated last year
zmeet-ai / asr_demo
语音识别API，分实时语音和长语音离线上传识别，支持中英文等多达100个国家的语言实时转写和同声传译
☆79Updated 7 months ago
win4r / VideoFinder-Llama3.2-vision-Ollama
VideoFinder is an advanced video analysis tool powered by multimodal AI, designed to help users easily locate and identify specific objec…
☆157Updated 9 months ago
Ikaros-521 / GraphRAG-Ollama-UI
GraphRAG-Ollama-UI + GraphRAG4OpenWebUI 融合版（有gradio webui配置生成RAG索引，有fastapi提供RAG API服务）
☆109Updated 11 months ago
uniai-lab / GLM-API
Customize APIs from GLM, ChatGLM
☆68Updated 6 months ago
2DIPW / gpt_sovits_infer_with_emotion
基于中文文本情绪分析自动切换参考音频的 GPT-SoVITS 推理 Demo
☆104Updated last year
CrazyBoyM / phi3-Chinese
Phi3 中文后训练模型仓库
☆321Updated 8 months ago
donzell888 / fast-tts
Fast-TTS 是一个基于异步框架的文本到语音转换（TTS）生成器项目。该项目利用了异步编程技术来高效处理请求和响应，实现了快速、秒级的流式生成长文本语音播放服务。Fast-TTS 可以快速地将长文本转换为语音流，并实时播放，适用于多种应用场景，如语音合成、智能助手、内容…
☆40Updated 8 months ago
volcengine / rtc-aigc-demo
RTC AIGC Demo
☆181Updated 2 weeks ago
HaujetZhao / FunASR-Online-Paraformer-Test
☆48Updated last year