litongjava / whisper-cpp-server
whisper-cpp-serve Real-time speech recognition and c+ of OpenAI's Whisper model in C/C++
☆56Updated 8 months ago
Alternatives and similar repositories for whisper-cpp-server:
Users that are interested in whisper-cpp-server are comparing it to the libraries listed below
- FastAPI service on top of WhisperX☆63Updated last week
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelines☆90Updated 8 months ago
- streaming speech to text server using Whisper☆84Updated last year
- Self hosted high quality voice recognition for de-googled Android using whisper. Like Siri or OK Google.☆57Updated last year
- Streaming ASR and TTS based on FastAPI+ sherpa-onnx☆61Updated 3 months ago
- ez audio transcription tool with flexible processing and post-processing options☆140Updated 11 months ago
- ONNX Inference of Pyannote Segmentation☆81Updated 3 weeks ago
- ASR using OpenAI capability API `v1/audio/transcriptions` like Groq, SiliconFlow☆25Updated 4 months ago
- web based editor for subtitles and transcripts☆119Updated 5 months ago
- Speech Diarization for scrum automation☆101Updated last year
- A cross platform implementation of Text-to-Speech based on ONNXRuntime.☆31Updated last year
- WhisperX Service love docker!☆13Updated 5 months ago
- Real-Time Whisper Voice Recognition with vosk model feedback.☆108Updated last year
- Real-time Voice Activity Detection (VAD) with some example use case like simple voice bot and live transcription (realtime transcription)☆61Updated 7 months ago
- On-device voice activity detection (VAD) powered by deep learning☆190Updated this week
- SenseVoice-python: A enterprise-grade open source multi-language asr system from funasr opensource with onnxruntime☆79Updated 3 months ago
- Experiments to test different speech recognition systems for SEPIA Framework☆59Updated last year
- Open models for Coqui STT☆127Updated last year
- Live-Transcription (STT) with Whisper PoC☆165Updated 7 months ago
- faster-whisper livestream translation, OBS noise reduction, dual language subtitles☆77Updated last year
- A FreeSWITCH module to interface to your speech recognition server over websocket☆29Updated last year
- Whisper realtime streaming for long speech-to-text transcription and translation☆110Updated 11 months ago
- ☆308Updated 6 months ago
- A text-to-speech and speech-to-text server compatible with the OpenAI API, supporting Whisper, FunASR, Bark, and CosyVoice backends.☆22Updated this week
- Record audio and save a transcription to your system's clipboard with ctranslate2 and faster-whisper.☆83Updated this week
- Voice activity detection (VAD) library, based on WebRTC's VAD engine built to WASM with Emscripten to run in browsers, Node, and NativeSc…☆28Updated 5 months ago
- A high-throughput and memory-efficient inference and serving engine for Whisper, https://mesolitica.com/blog/vllm-whisper☆21Updated 5 months ago
- C++ version of pyannote audio speaker diarizaiton pipeline☆19Updated 11 months ago
- Open dubbing is an AI dubbing system which uses machine learning models to automatically translate and synchronize audio dialogue into di…☆122Updated this week
- An implementation of MeloTTS by onnxruntime☆15Updated 2 months ago