简单实现VAD+声纹锁+SenseVoice完成类语音实时转录的小项目
☆42Sep 23, 2024Updated last year
Alternatives and similar repositories for SenseVoice-Real-Time
Users that are interested in SenseVoice-Real-Time are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 基于Fastrtc、Ollama、FunASR和MegaTTS的大模型中文语音实时对话应用☆21Apr 26, 2025Updated 10 months ago
- funasr语音转文字的简单api版本,funasr+fastapi,方便部署在服务器上☆13Aug 10, 2024Updated last year
- 使用fastrtc框架调用qwen-2.5-omni-realtime实现实时语音、视频等☆12Jun 27, 2025Updated 8 months ago
- 这是基于FunASR实现的区分说话人语音识别API | This is a speaker-diarization-based speech recognition API implemented using FunASR.☆23Feb 12, 2026Updated last month
- This is a project focused on Faster Whisper, a streaming speech recognition project.☆19Sep 27, 2024Updated last year
- Data manipulation and transformation for audio signal processing, powered by PyTorch☆10Sep 30, 2024Updated last year
- paraformer(chinense asr) online onnx runtime for python☆54Mar 27, 2024Updated last year
- 基于ultralytics训练的行人跌倒检测模型☆19Jul 10, 2023Updated 2 years ago
- An ASR API server for FunASR☆49Jan 26, 2026Updated last month
- paraformer web server build with sanic☆28May 3, 2023Updated 2 years ago
- Project of Singing Voice Conversion.☆16Oct 27, 2023Updated 2 years ago
- ☆50Nov 26, 2023Updated 2 years ago
- Unity AudioDance☆11Sep 12, 2020Updated 5 years ago
- One command to start a streaming ASR server.☆12Oct 2, 2024Updated last year
- 用于SenseVoice的api项目,输出带时间戳字幕☆42Oct 28, 2024Updated last year
- SenseVoice-python: A enterprise-grade open source multi-language asr system from funasr opensource with onnxruntime☆110Oct 6, 2025Updated 5 months ago
- Scratching lottery ticket. View prototype☆12Jun 23, 2019Updated 6 years ago
- Pseudo Streaming SenseVoice with Hotwords☆437Mar 13, 2025Updated last year
- tts fronted-end☆11Dec 19, 2018Updated 7 years ago
- 通过 Python 驱动 ADB , 实现自动刷抖音,并将无水印视频下载至本地.☆13Jul 9, 2021Updated 4 years ago
- ASR (Automatic Speech Recognition) for real-time streamed audio powered by Whisper and tranformers☆36Dec 12, 2024Updated last year
- ☆26Dec 2, 2025Updated 3 months ago
- Live2D + ASR + LLM + TTS → Real-time communication + Offline Deployment/Cloud Inference 实时沟通 本地部署/云端推理☆32Apr 21, 2025Updated 11 months ago
- WebUI for using SmolDocling-256M-preview☆13Mar 21, 2025Updated last year
- Compute WER and SER for speech recognition evaluation☆27Updated this week
- FreeSWITCH ASR module fork from mod_audio_stream, use FunASR online cpu version☆16Jun 27, 2025Updated 8 months ago
- ☆11Jun 14, 2024Updated last year
- A demonstration custom node that showcases how to integrate Vue as a frontend framework within ComfyUI, complete with PrimeVue components…☆18Dec 14, 2025Updated 3 months ago
- ☆28Oct 1, 2023Updated 2 years ago
- WebAssembly port of Rhubarb Lip Sync - an advanced lip sync tool that automatically creates mouth animation from audio files. Perfect for…☆23Sep 3, 2025Updated 6 months ago
- Code associated with the paper: CTC-DRO: Robust Optimization for Reducing Language Disparities in Speech Recognition.☆16May 16, 2025Updated 10 months ago
- mysterious ooze☆15Mar 22, 2025Updated last year
- Hpyformer base FunASR☆30Nov 5, 2024Updated last year
- CosyVoice语音合成简易API☆14Nov 1, 2024Updated last year
- Sync Lip in Unity by Wav2Lip☆11Jan 14, 2021Updated 5 years ago
- High-performance OCR microservice based on PaddleOCR-VL-0.9B (PaddleOCR-VL-1.5-0.9B) with MinerU-compatible API☆34Jan 30, 2026Updated last month
- ☆12Jul 11, 2024Updated last year
- ☆14Aug 9, 2021Updated 4 years ago
- 基于wenet的短时在线语音识别服务☆11Feb 25, 2023Updated 3 years ago