esnya / realtime-whisperLinks
ASR (Automatic Speech Recognition) for real-time streamed audio powered by Whisper and tranformers
☆32Updated 8 months ago
Alternatives and similar repositories for realtime-whisper
Users that are interested in realtime-whisper are comparing it to the libraries listed below
Sorting:
- SenseVoice-python: A enterprise-grade open source multi-language asr system from funasr opensource with onnxruntime☆99Updated 11 months ago
- Streaming ASR and TTS based on FastAPI+ sherpa-onnx☆145Updated 4 months ago
- A enterprise-grade Voice Activity Detector from modelscope and funasr.☆111Updated 2 years ago
- We Speech Transcript based on LLM, in 300 lines of code.☆176Updated 2 months ago
- Running the F5-TTS by ONNX Runtime☆177Updated 3 weeks ago
- Utilizes ONNX Runtime to transcribe audio into text.☆50Updated this week
- A lightweight end-to-end text-to-speech model☆119Updated 6 months ago
- 用于SenseVoice的api项目,输出带时间戳字幕☆39Updated 10 months ago
- 简单实现VAD+声纹锁+SenseVoice完成类语音实时转录的小项目☆35Updated 11 months ago
- 基于FunASR实现语音识别,包含常规版和ONNX版(推荐)。☆43Updated 10 months ago
- ☆201Updated 11 months ago
- An open source chat bot architecture for voice/vision (and multimodal) assistants, local(CPU/GPU bound) and remote(I/O bound) to run.☆68Updated this week
- ChatTTS HTTP API☆55Updated last year
- ASR 2Pass onnxruntime and websocket server, based on FunASR(https://github.com/alibaba-damo-academy/FunASR).☆76Updated this week
- funasr语音转文字的简单api版本,funasr+fastapi,方便部署在服务器上☆13Updated last year
- Port of Funasr's Paraformer model in C/C++☆34Updated last year
- livekit agent plugins☆19Updated 2 weeks ago
- 超快的中文普通话TTS☆121Updated 4 years ago
- Extension of ChatTTS, 3x Faster on Windows, Support Voice Cloning and Mobile Deployment☆170Updated 6 months ago
- A toolkit for speaker diarization.☆279Updated 3 weeks ago
- 语音识别API,分实时语音和长语音离线上传识别,支持中英文等多达100个国家的语言实时转写和同声传译☆79Updated 8 months ago
- flow mirror models from JZX AI Labs☆44Updated 11 months ago
- Next-generation TTS model using flow-matching and DiT, inspired by Stable Diffusion 3☆419Updated 11 months ago
- 基于标贝数据继续训练,同时对原本的FastSpeech2模型做了改进,引入了韵律表征以及韵律预测模块,使中文发音更生动且富有节奏☆270Updated last year
- RealSI: Open Benchmark for Simultaneous Interpretation in Real-world Scenarios☆66Updated 2 months ago
- ONNX-compatible Fast SeamlessM4T—Massively Multilingual & Multimodal Machine Translation☆43Updated 2 years ago
- Dynamic Voice Actor Assignment and Emotional Narration for Realistic Story Play☆40Updated 5 months ago
- Speech Diarization for scrum automation☆111Updated 2 years ago
- 🌻 VITS ONNX TTS server designed for fast inference 🔥☆128Updated 7 months ago
- Whisper realtime streaming for long speech-to-text transcription and translation☆121Updated last year