fedirz / faster-whisper-server

☆762

Related projects ⓘ

Alternatives and complementary repositories for faster-whisper-server

collabora / WhisperLive
A nearly-live implementation of OpenAI's Whisper.
☆2,060Updated 2 weeks ago
ufal / whisper_streaming
Whisper realtime streaming for long speech-to-text transcription and translation
☆2,092Updated this week
alesaccoia / VoiceStreamAI
Near-Realtime audio transcription using self-hosted Whisper and WebSocket in Python/JS
☆730Updated last month
lhl / voicechat2
Local SRT/LLM/TTS Voicechat
☆544Updated last month
aiola-lab / whisper-medusa
Whisper with Medusa heads
☆799Updated 2 weeks ago
matatonic / openedai-speech
An OpenAI API compatible text to speech server using Coqui AI's xtts_v2 and/or piper tts as the backend.
☆476Updated 2 months ago
collabora / WhisperFusion
WhisperFusion builds upon the capabilities of WhisperLive and WhisperSpeech to provide a seamless conversations with an AI.
☆1,547Updated 3 months ago
KoljaB / LocalAIVoiceChat
Local AI talk with a custom voice based on Zephyr 7B model. Uses RealtimeSTT with faster_whisper for transcription and RealtimeTTS with C…
☆518Updated 3 months ago
KoljaB / RealtimeTTS
Converts text to speech in realtime
☆2,023Updated this week
fixie-ai / ultravox
A fast multimodal LLM for real-time voice
☆1,339Updated this week
Standard-Intelligence / hertz-dev
first base model for full-duplex conversational audio
☆1,560Updated last week
idiap / coqui-ai-TTS
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
☆590Updated this week
coqui-ai / xtts-streaming-server
☆296Updated 4 months ago
Softcatala / whisper-ctranslate2
Whisper command line client compatible with original OpenAI client based on CTranslate2.
☆916Updated this week
KoljaB / Linguflex
Command Your World with Voice
☆443Updated this week
ochen1 / insanely-fast-whisper-cli
The fastest Whisper optimization for automatic speech recognition as a command-line interface ⚡️
☆322Updated 5 months ago
Vaibhavs10 / open-tts-tracker
☆1,094Updated 5 months ago
juanmc2005 / diart
A python package to build AI-powered real-time audio applications
☆1,090Updated 4 months ago
shashikg / WhisperS2T
An Optimized Speech-to-Text Pipeline for the Whisper Model Supporting Multiple Inference Engine
☆312Updated 2 months ago
ricky0123 / vad
Voice activity detector (VAD) for the browser with a simple API
☆893Updated last week
jim60105 / docker-whisperX
Dockerfile for WhisperX: Automatic Speech Recognition with Word-Level Timestamps and Speaker Diarization (Dockerfile, CI image build and …
☆178Updated 2 months ago
janhq / ichigo
Local realtime voice AI
☆1,946Updated this week
JuergenFleiss / aTrain
A GUI tool for offline transcription of speech recordings, including speaker diarization, utilizing state-of-the-art machine learning mod…
☆348Updated this week
facebookresearch / spiritlm
Inference code for the paper "Spirit-LM Interleaved Spoken and Written Language Model".
☆781Updated 3 weeks ago
edwko / OuteTTS
Interface for OuteTTS models.
☆406Updated 2 weeks ago
revdotcom / reverb
Open source inference code for Rev's model
☆333Updated this week
nyrahealth / CrisperWhisper
Verbatim Automatic Speech Recognition with improved word-level timestamps and filler detection
☆262Updated 2 months ago
akashmjn / tinydiarize
Minimal extension of OpenAI's Whisper adding speaker diarization with special tokens
☆444Updated last year
CerebriumAI / examples
☆446Updated this week
gaborvecsei / whisper-live-transcription
Live-Transcription (STT) with Whisper PoC
☆155Updated 5 months ago