简单实现VAD+声纹锁+SenseVoice完成类语音实时转录的小项目
☆42Sep 23, 2024Updated last year
Alternatives and similar repositories for SenseVoice-Real-Time
Users that are interested in SenseVoice-Real-Time are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 基于Fastrtc、Ollama、FunASR和MegaTTS的大模型中文语音实时对话应用☆22Apr 26, 2025Updated last year
- funasr语音转文字的简单api版本,funasr+fastapi,方便部署在服务器上☆13Aug 10, 2024Updated last year
- 使用fastrtc框架调用qwen-2.5-omni-realtime实现实时语音、视频等☆14Jun 27, 2025Updated last year
- 这是基于FunASR实现的区分说话人语音识别API | This is a speaker-diarization-based speech recognition API implemented using FunASR.☆26Jun 16, 2026Updated 2 weeks ago
- This is a project focused on Faster Whisper, a streaming speech recognition project.☆18Sep 27, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Data manipulation and transformation for audio signal processing, powered by PyTorch☆11Sep 30, 2024Updated last year
- paraformer web server build with sanic☆28May 3, 2023Updated 3 years ago
- Project of Singing Voice Conversion.☆16Oct 27, 2023Updated 2 years ago
- ☆53Nov 26, 2023Updated 2 years ago
- An ASR API server for FunASR☆54Apr 19, 2026Updated 2 months ago
- Unity AudioDance☆10Sep 12, 2020Updated 5 years ago
- 用于SenseVoice的api项目,输出带时间戳字幕☆47Oct 28, 2024Updated last year
- SenseVoice-python: A enterprise-grade open source multi-language asr system from funasr opensource with onnxruntime☆113Jun 12, 2026Updated 3 weeks ago
- Scratching lottery ticket. View prototype☆13Jun 23, 2019Updated 7 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Pseudo Streaming SenseVoice with Hotwords☆458Jun 15, 2026Updated 2 weeks ago
- tts fronted-end☆11Dec 19, 2018Updated 7 years ago
- 通过 Python 驱动 ADB , 实现自动刷抖音,并将无水印视频下载至本地.☆13Jul 9, 2021Updated 4 years ago
- ASR (Automatic Speech Recognition) for real-time streamed audio powered by Whisper and tranformers☆36Apr 22, 2026Updated 2 months ago
- MP3, MP4(m4a, m4b), FLAC and OGG(Vorbis, Opus) meta data reader and writer for go☆11Dec 29, 2024Updated last year
- WebUI for using SmolDocling-256M-preview☆14Mar 21, 2025Updated last year
- Simple optimized UI effect. Inspired by Marerial Design☆16Apr 26, 2023Updated 3 years ago
- FreeSWITCH ASR module fork from mod_audio_stream, use FunASR online cpu version☆18Jun 27, 2025Updated last year
- ☆12Jun 14, 2024Updated 2 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- A demonstration custom node that showcases how to integrate Vue as a frontend framework within ComfyUI, complete with PrimeVue components…☆21Dec 14, 2025Updated 6 months ago
- ☆13Oct 27, 2021Updated 4 years ago
- ☆29Oct 1, 2023Updated 2 years ago
- Sync Lip in Unity by Wav2Lip☆11Jan 14, 2021Updated 5 years ago
- ☆11Dec 24, 2024Updated last year
- ☆12Jul 11, 2024Updated last year
- <综合> Funasr语音识别,调用Qwen大模型回答,通过GPTSovits输出语音的ai程序,其中调用模型还是在线,后续将添加离线大模型☆13Nov 30, 2024Updated last year
- 完全独立编译 AEC, AGC, NS, VAD in WebRTC☆22Jul 8, 2019Updated 6 years ago
- ESPTool for Node.js, based on esptool.py☆10Aug 12, 2015Updated 10 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Live2d整合科大讯飞在线TTS,播放并实时口型同步案例☆14Feb 1, 2024Updated 2 years ago
- 声纹识别☆31Dec 4, 2023Updated 2 years ago
- Access any internal service from your browser. No VPN, no client, one command. Shield CLI is a browser-first internal service gateway — S…☆37Jun 18, 2026Updated 2 weeks ago
- ASR_LLM_TTS前端项目☆15Dec 3, 2024Updated last year
- Train no-reference speech quality estimators with multiple datasets via learned, per-dataset alignments.☆18Aug 1, 2025Updated 11 months ago
- WebAssembly port of Rhubarb Lip Sync - an advanced lip sync tool that automatically creates mouth animation from audio files. Perfect for…☆30Sep 3, 2025Updated 10 months ago
- FunASR安卓端侧离线版本2pass全模式☆15Sep 4, 2023Updated 2 years ago