litongjava / whisper-cpp-server
whisper-cpp-serve Real-time speech recognition and c+ of OpenAI's Whisper model in C/C++
☆60Updated 10 months ago
Alternatives and similar repositories for whisper-cpp-server:
Users that are interested in whisper-cpp-server are comparing it to the libraries listed below
- Self hosted high quality voice recognition for de-googled Android using whisper. Like Siri or OK Google.☆61Updated last year
- FastAPI service on top of WhisperX☆76Updated this week
- streaming speech to text server using Whisper☆90Updated last year
- Streaming ASR and TTS based on FastAPI+ sherpa-onnx☆84Updated 5 months ago
- A high-throughput and memory-efficient inference and serving engine for Whisper, https://mesolitica.com/blog/vllm-whisper☆25Updated 7 months ago
- Whisper realtime streaming for long speech-to-text transcription and translation☆113Updated last year
- ez audio transcription tool with flexible processing and post-processing options☆146Updated last year
- Running the F5-TTS by ONNX Runtime☆129Updated this week
- On-device voice activity detection (VAD) powered by deep learning☆202Updated last week
- Live-Transcription (STT) with Whisper PoC☆175Updated 9 months ago
- A text-to-speech and speech-to-text server compatible with the OpenAI API, supporting Whisper, FunASR, Bark, and CosyVoice backends.☆84Updated 2 months ago
- ONNX Inference of Pyannote Segmentation☆81Updated 3 months ago
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelines☆94Updated 10 months ago
- A FreeSWITCH module to interface to your speech recognition server over websocket☆32Updated last year
- Experiments to test different speech recognition systems for SEPIA Framework☆59Updated last year
- Real-Time Whisper Voice Recognition with vosk model feedback.☆112Updated last year
- Real-time Voice Activity Detection (VAD) with some example use case like simple voice bot and live transcription (realtime transcription)☆71Updated 9 months ago
- On-device streaming text-to-speech engine powered by deep learning☆73Updated last week
- WhisperX Service love docker!☆13Updated 7 months ago
- ☆318Updated 8 months ago
- On-device speaker diarization powered by deep learning☆39Updated last week
- Utilizes ONNX Runtime to transcribe audio into text.☆18Updated last month
- SenseVoice-python: A enterprise-grade open source multi-language asr system from funasr opensource with onnxruntime☆86Updated 6 months ago
- A curated list of awesome voice activity detection☆43Updated 4 months ago
- Using FastChat-T5 Large Language Model, Vosk API for automatic speech recognition, and Piper for text-to-speech☆117Updated last year
- Efficient approach to speaker diarization using voice characteristics extraction☆92Updated 11 months ago
- Open models for Coqui STT☆134Updated last year
- C++ version of pyannote audio speaker diarizaiton pipeline☆19Updated last year
- A lightweight end-to-end text-to-speech model☆110Updated last month
- A simple Python package to easily use Meta's Massively Multilingual Speech (MMS) project☆52Updated last year