litongjava / whisper-cpp-serverLinks
whisper-cpp-serve Real-time speech recognition and c+ of OpenAI's Whisper model in C/C++
☆68Updated last year
Alternatives and similar repositories for whisper-cpp-server
Users that are interested in whisper-cpp-server are comparing it to the libraries listed below
Sorting:
- streaming speech to text server using Whisper☆94Updated 2 years ago
- FastAPI service on top of WhisperX☆120Updated this week
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelines☆96Updated last year
- Real-Time Whisper Voice Recognition with vosk model feedback.☆117Updated 2 years ago
- Self hosted high quality voice recognition for de-googled Android using whisper. Like Siri or OK Google.☆64Updated last year
- faster-whisper as serverless endpoint☆109Updated 2 months ago
- Real-time processing and delivery of sentences from a continuous stream of characters or text chunks.☆68Updated 3 weeks ago
- Live-Transcription (STT) with Whisper PoC☆189Updated last year
- Record audio and save a transcription to your system's clipboard with ctranslate2 and faster-whisper.☆136Updated 2 weeks ago
- Whisper realtime streaming for long speech-to-text transcription and translation☆120Updated last year
- ez audio transcription tool with flexible processing and post-processing options☆156Updated last year
- Open models for Coqui STT☆141Updated 2 years ago
- ☆22Updated 6 months ago
- Running the F5-TTS by ONNX Runtime☆170Updated last week
- Self-contained voice activity detector☆29Updated last year
- speechlib is a library that can do speaker diarization, transcription and speaker recognition on an audio file to create transcripts with…☆223Updated 4 months ago
- On-device streaming text-to-speech engine powered by deep learning☆102Updated 2 weeks ago
- ☆337Updated last year
- ☆27Updated 2 years ago
- On-device voice activity detection (VAD) powered by deep learning☆223Updated this week
- An API to transcribe audio with OpenAI's Whisper Large v3!☆296Updated 8 months ago
- Real-time Voice Activity Detection (VAD) with some example use case like simple voice bot and live transcription (realtime transcription)☆86Updated last year
- 🐍 🤖 Pip installable package for StyleTTS 2 human-level text-to-speech and voice cloning☆161Updated last year
- Streaming ASR and TTS based on FastAPI+ sherpa-onnx☆135Updated 3 months ago
- web based editor for subtitles and transcripts☆137Updated 11 months ago
- C++ library for converting text to phonemes for Piper☆128Updated 3 weeks ago
- A ggml (C++) re-implementation of tortoise-tts☆188Updated 11 months ago
- Listen to any audio stream on your machine and print out the transcribed or translated audio.☆119Updated last year
- Open source inference code for Rev's model☆416Updated 3 months ago
- Whisperx API implementation☆27Updated last year