jundaychan / funasr-fastapiLinks

funasr语音转文字的简单api版本，funasr+fastapi，方便部署在服务器上

☆12

Alternatives and similar repositories for funasr-fastapi

Users that are interested in funasr-fastapi are comparing it to the libraries listed below

Sorting:

jianchang512 / sense-api
用于SenseVoice的api项目，输出带时间戳字幕
☆38Updated 9 months ago
HG-ha / SenseVoice-Api
阿里SenseVoice的fastpi封装，采用onnx发布，体积更小，附带量化模型，支持GPU。支持从URL文件进行语音识别。
☆94Updated 11 months ago
CyberWon / ChatTTS-API
ChatTTS HTTP API
☆55Updated last year
jianchang512 / f5-tts-api
一个用于F5-TTS的api和webui项目
☆61Updated 7 months ago
v3ucn / zhangyimou_voice_clone_text
张艺谋(国师)一键声音克隆和恶搞文本生成项目
☆17Updated 2 years ago
LuckLittleBoy / SenseVoice-OneApi
基于SenseVoice的funasr版本进行的api发布，可以无缝对接oneapi
☆69Updated 10 months ago
yanghan-cyber / audio-service
基于FastAPI的语音服务系统，集成语音合成(TTS)和语音识别(STT)功能。使用CosyVoice2作为TTS引擎，FunASR作为STT引擎，支持零样本语音克隆、流式输出、多种语言识别等高级功能。
☆14Updated 4 months ago
2DIPW / gpt_sovits_infer_with_emotion
基于中文文本情绪分析自动切换参考音频的 GPT-SoVITS 推理 Demo
☆104Updated last year
warmshao / ChatTTSPlus
Extension of ChatTTS, 3x Faster on Windows, Support Voice Cloning and Mobile Deployment
☆169Updated 5 months ago
Ikaros-521 / digital_human_video_player
洛曦数字人视频播放器，带HTTP API，使用gradio api对接Easy-Wav2Lip、Sadtalker、GeneFacePlusPlus、MuseTalk，也可以用于播放本地视频
☆167Updated 9 months ago
ruzhila / voiceapi
Streaming ASR and TTS based on FastAPI+ sherpa-onnx
☆134Updated 3 months ago
zmeet-ai / asr_demo
语音识别API，分实时语音和长语音离线上传识别，支持中英文等多达100个国家的语言实时转写和同声传译
☆79Updated 7 months ago
dongdongzi / metahuman-stream
Real time streaming digital human based on nerf
☆16Updated last year
lovemefan / SenseVoice-python
SenseVoice-python: A enterprise-grade open source multi-language asr system from funasr opensource with onnxruntime
☆97Updated 10 months ago
xszyou / fay-android
app会常驻手机后台，你可以随时随地保持与Fay数字人的沟通。
☆46Updated 7 months ago
ABexit / Multi-Character-StoryTeller
This is a multi-character, ultra-personalized StoryTeller. It includes: 1) efficiently and accurately build multi-character voice library…
☆47Updated 6 months ago
diudiu62 / CosyVoice-api
☆27Updated 5 months ago
peilongchencc / My-GLM-4-Voice
ubuntu 系统下 GLM-4-Voice 部署经验分享
☆19Updated 9 months ago
foxyear-kyumin / lip_mask
通过此代码可以免训练模型并通过轻量级服务器定制数字人形象
☆105Updated last year
aixiaoxin123 / mcp_demo_project
mcp的webui界面，支持客户端连接多个sse服务端，支持 openai、deepseek、qwen等大模型，另外附上构建的 agent的 stdio和sse的简单天气查询的完整示例
☆33Updated 2 months ago
ctkindle / -Wave2lip-
使用 PaddleGAN 套件的 Wave2lip 模型给照片上的人“配音、配嘴型儿”~~
☆26Updated 4 years ago
wangzai23333 / blivedm
获取bilibili直播弹幕，使用WebSocket协议
☆36Updated last year
Ikaros-521 / voice_talk_chatgpt
基于Gradio开发的ChatGPT聊天应用，可以文字或语音对话，发送的音频通过OpenAI的STT转文本后，再通过ChatGPT生成回复，回复的内容通过OpenAI TTS合成后返回并自动播放，实现语音聊天功能。
☆35Updated last year
RemSynch / SenseVoice-Real-Time
简单实现VAD+声纹锁+SenseVoice完成类语音实时转录的小项目
☆31Updated 10 months ago
v3ucn / sse_tornado6_vuejs3
基于Python3.10异步非阻塞框架Tornado6.0和前端Vue.js3框架实现ChatGPT的流式返回协议Server-sent events
☆23Updated 2 years ago
soulteary / dify-with-qwen-vl
视频理解：千问视频多模态模型 & Dify
☆62Updated 11 months ago
luler / hello_asr
基于funasr开源项目和模型，快速搭建语音转文字的api服务
☆23Updated 2 months ago
v3ucn / llama3-txt2json-dataset-maker
文本语料转训练集工具，txt转dataset
☆93Updated last year
AgentEra / Agently-Talk-to-Control
An open-source chat text to control actions agentic workflow framework/showcase powered by Agently AI application development framework.
☆28Updated 10 months ago
liu-qingyuan / faster_whisper_gradio
Real time faster whisper gradio
☆26Updated 9 months ago