litongjava / whisper-cpp-serverLinks
whisper-cpp-serve Real-time speech recognition and c+ of OpenAI's Whisper model in C/C++
☆66Updated last year
Alternatives and similar repositories for whisper-cpp-server
Users that are interested in whisper-cpp-server are comparing it to the libraries listed below
Sorting:
- streaming speech to text server using Whisper☆92Updated 2 years ago
- Self hosted high quality voice recognition for de-googled Android using whisper. Like Siri or OK Google.☆64Updated last year
- FastAPI service on top of WhisperX☆101Updated this week
- Real-time processing and delivery of sentences from a continuous stream of characters or text chunks.☆56Updated last month
- Running the F5-TTS by ONNX Runtime☆155Updated 2 weeks ago
- Streaming ASR and TTS based on FastAPI+ sherpa-onnx☆114Updated last month
- ONNX Inference of Pyannote Segmentation☆90Updated 5 months ago
- A high-throughput and memory-efficient inference and serving engine for Whisper, https://mesolitica.com/blog/vllm-whisper☆27Updated 10 months ago
- A text-to-speech and speech-to-text server compatible with the OpenAI API, supporting Whisper, FunASR, Bark, and CosyVoice backends.☆115Updated last week
- Real-Time Whisper Voice Recognition with vosk model feedback.☆112Updated last year
- ☆329Updated 11 months ago
- On-device voice activity detection (VAD) powered by deep learning☆216Updated 3 weeks ago
- Pybind11 bindings for Whisper.cpp☆57Updated last month
- ☆22Updated 4 months ago
- Mobile web app for audio "push-to-talk" + TTS chat interface with OpenAI-like APIs☆43Updated last year
- This project provides a Flask-based API for generating high-quality text-to-speech (TTS) audio using F5-TTS, a flexible and powerful TTS …☆12Updated 2 months ago
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelines☆94Updated last year
- Babylon.cpp is a C and C++ library for grapheme to phoneme conversion and text to speech synthesis. For phonemization a ONNX runtime port…☆21Updated 9 months ago
- ez audio transcription tool with flexible processing and post-processing options☆150Updated last year
- An open source chat bot architecture for voice/vision (and multimodal) assistants, local(CPU/GPU bound) and remote(I/O bound) to run.☆53Updated this week
- SenseVoice-python: A enterprise-grade open source multi-language asr system from funasr opensource with onnxruntime☆93Updated 8 months ago
- web based editor for subtitles and transcripts☆133Updated 9 months ago
- Experiments to test different speech recognition systems for SEPIA Framework☆60Updated 2 years ago
- A lightweight end-to-end text-to-speech model☆115Updated 3 months ago
- Whisper realtime streaming for long speech-to-text transcription and translation☆116Updated last year
- C++ version of pyannote audio speaker diarizaiton pipeline☆21Updated last year
- Whisperx API implementation☆27Updated last year
- Speech To Speech: an effort for an open-sourced and modular GPT4-o☆62Updated 7 months ago
- Port of Funasr's Paraformer model in C/C++☆31Updated 11 months ago
- Speech Diarization for scrum automation☆105Updated last year