esnya / realtime-whisper
ASR (Automatic Speech Recognition) for real-time streamed audio powered by Whisper and tranformers
☆24Updated last month
Alternatives and similar repositories for realtime-whisper:
Users that are interested in realtime-whisper are comparing it to the libraries listed below
- SenseVoice-python: A enterprise-grade open source multi-language asr system from funasr opensource with onnxruntime☆79Updated 3 months ago
- We Speech Transcript based on LLM, in 300 lines of code.☆140Updated this week
- FastAPI service on top of WhisperX☆62Updated last week
- A enterprise-grade Voice Activity Detector from modelscope and funasr.☆71Updated last year
- A simple Python package to easily use Meta's Massively Multilingual Speech (MMS) project☆52Updated last year
- ONNX Inference of Pyannote Segmentation☆81Updated 3 weeks ago
- A lightweight end-to-end text-to-speech model☆99Updated 3 weeks ago
- Running the F5-TTS by ONNX Runtime☆81Updated this week
- Streaming ASR and TTS based on FastAPI+ sherpa-onnx☆63Updated 3 months ago
- 用于SenseVoice的api项目,输出带时间戳字幕☆21Updated 2 months ago
- Pseudo Streaming SenseVoice with Hotwords☆161Updated last month
- Speech Diarization for scrum automation☆101Updated last year
- Live-Transcription (STT) with Whisper PoC☆165Updated 7 months ago
- Whisper realtime streaming for long speech-to-text transcription and translation☆110Updated 11 months ago
- Next-generation TTS model using flow-matching and DiT, inspired by Stable Diffusion 3☆378Updated 4 months ago
- flow mirror models from JZX AI Labs☆43Updated 3 months ago
- A toolkit for speaker diarization.☆164Updated 2 months ago
- ONNX implementation of Whisper. PyTorch free.☆88Updated 2 months ago
- 一个简单的音频降噪工具,提高web UI界面和api接口☆19Updated 2 months ago
- 中文标点符号模型,可以给文本添加标点符号。☆134Updated 3 weeks ago
- Efficient approach to speaker diarization using voice characteristics extraction☆83Updated 8 months ago
- a gradio webui for faster whisper☆245Updated last year
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelines☆91Updated 8 months ago
- A cross platform implementation of Text-to-Speech based on ONNXRuntime.☆31Updated last year
- whisper.cpp bindings for python☆83Updated last year
- ubuntu 系统下 GLM-4-Voice 部署经验分享☆19Updated 2 months ago
- Grapheme-to-Phoneme lexicons for Chinese dialects☆67Updated 2 years ago
- Putting flows on top of neural transducers for better TTS☆62Updated 3 weeks ago
- An implementation of MeloTTS by onnxruntime☆16Updated 2 months ago