funasr语音转文字的简单api版本,funasr+fastapi,方便部署在服务器上
☆13Aug 10, 2024Updated last year
Alternatives and similar repositories for funasr-fastapi
Users that are interested in funasr-fastapi are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 基于funasr开源项目和模型,快速搭建语音转文字的api服务☆33Dec 15, 2025Updated 5 months ago
- 这是基于FunASR实现的区分说话人语音识别API | This is a speaker-diarization-based speech recognition API implemented using FunASR.☆26Feb 12, 2026Updated 3 months ago
- ☆45Jan 20, 2025Updated last year
- 基于FastAPI的语音服务系统,集成语音合成(TTS)和语音识别(STT)功能。使用CosyVoice2作为TTS引擎,FunASR作为STT引擎,支持零样本语音克隆、流式输出、多种语言识别等高级功能。☆20Apr 1, 2025Updated last year
- 简单实现VAD+声纹锁+SenseVoice完成类语音实时转录的小项目☆42Sep 23, 2024Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- 经过轻微修改的彩虹版易支付,支持usdt结算☆16May 24, 2023Updated 3 years ago
- A lightweight demo of FunASR-Nano using ONNX runtime.☆78Feb 25, 2026Updated 2 months ago
- Data manipulation and transformation for audio signal processing, powered by PyTorch☆11Sep 30, 2024Updated last year
- 基于InternLm chat 7B大模型基座,构建一个Agent ,可以调用 MMYOLO 工具来完成图像内视觉任务☆11Oct 30, 2024Updated last year
- paraformer web server build with sanic☆28May 3, 2023Updated 3 years ago
- ☆49Nov 26, 2023Updated 2 years ago
- An ASR API server for FunASR☆54Apr 19, 2026Updated last month
- SenseVoice-python: A enterprise-grade open source multi-language asr system from funasr opensource with onnxruntime☆112Oct 6, 2025Updated 7 months ago
- 基于roop与codeFormer的换脸一体脚本☆20Apr 9, 2025Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- One command to start a streaming ASR server.☆12Oct 2, 2024Updated last year
- ASR (Automatic Speech Recognition) for real-time streamed audio powered by Whisper and tranformers☆36Apr 22, 2026Updated last month
- Access any internal service from your browser. No VPN, no client, one command. Shield CLI is a browser-first internal service gateway — S…☆31Apr 24, 2026Updated last month
- aigc evals☆10Dec 2, 2023Updated 2 years ago
- Compute WER and SER for speech recognition evaluation☆26Mar 18, 2026Updated 2 months ago
- FreeSWITCH ASR module fork from mod_audio_stream, use FunASR online cpu version☆18Jun 27, 2025Updated 10 months ago
- Accelerating GOT-OCRv2 with VLLM☆10Nov 15, 2024Updated last year
- Code associated with the paper: CTC-DRO: Robust Optimization for Reducing Language Disparities in Speech Recognition.☆17May 16, 2025Updated last year
- ☆14Aug 9, 2021Updated 4 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- 基于wenet的短时在线语音识别服务☆11Feb 25, 2023Updated 3 years ago
- <综合> Funasr语音识别,调用Qwen大模型回答,通过GPTSovits输出语音的ai程序,其中调用模型还是在线,后续将添加离线大模型☆13Nov 30, 2024Updated last year
- 重写LMAX的Disruptor,更好的接口,更好的扩展性☆10Mar 20, 2026Updated 2 months ago
- ASR_LLM_TTS前端项目☆15Dec 3, 2024Updated last year
- Yanjie — An English Speaking Learning Assistant Based on InternLM, aims to break the traditional interaction boundaries and eliminate th…☆16Oct 9, 2024Updated last year
- 基于Gradio开发的ChatGPT聊天应用,可以文字 或 语音对话,发送的音频通过OpenAI的STT转文本后,再通过ChatGPT生成回复,回复的内容通过OpenAI TTS合成后返回并自动播放,实现语音聊天功能。☆35Feb 18, 2024Updated 2 years ago
- 仿微信 长按表情弹出表情预览弹窗/输入按钮切换☆10Mar 1, 2016Updated 10 years ago
- Train no-reference speech quality estimators with multiple datasets via learned, per-dataset alignments.☆18Aug 1, 2025Updated 9 months ago
- 用于SenseVoice的api项目,输出带时间戳字幕☆45Oct 28, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- FunASR安卓端侧离线版本2pass全模式☆15Sep 4, 2023Updated 2 years ago
- 全自动大模型llm训练,无需微调知识,门槛极低。极其适合零基础的人使用(目前暂时只支持glm3,未来会增加更多模型)☆19Dec 20, 2025Updated 5 months ago
- A PyTorch implementation of Speech Transformer, an End-to-End ASR with Transformer network on Mandarin Chinese.☆12May 7, 2019Updated 7 years ago
- TimeSeries Java client for Facebook Beringei. It also includes query service with tags support for metrics.☆10May 13, 2017Updated 9 years ago
- ☆33Feb 28, 2025Updated last year
- Simple voice activity detection (VAD) algorithm in Python☆15Aug 10, 2023Updated 2 years ago
- A Model Context Protocol (MCP) server for generating and editing images using the OpenAI gpt-image-1 model.☆18May 7, 2025Updated last year