alesaccoia / VoiceStreamAILinks

Near-Realtime audio transcription using self-hosted Whisper and WebSocket in Python/JS

☆898

Alternatives and similar repositories for VoiceStreamAI

Users that are interested in VoiceStreamAI are comparing it to the libraries listed below

Sorting:

ufal / whisper_streaming
Whisper realtime streaming for long speech-to-text transcription and translation
☆3,165Updated last month
collabora / WhisperLive
A nearly-live implementation of OpenAI's Whisper.
☆3,190Updated last week
juanmc2005 / diart
A python package to build AI-powered real-time audio applications
☆1,387Updated 5 months ago
gaborvecsei / whisper-live-transcription
Live-Transcription (STT) with Whisper PoC
☆189Updated last year
QuentinFuxa / WhisperLiveKit
Python package for Real-time, Local Speech-to-Text and Speaker Diarization. FastAPI Server & Web Interface
☆405Updated 2 weeks ago
yinruiqing / pyannote-whisper
☆614Updated last year
collabora / WhisperFusion
WhisperFusion builds upon the capabilities of WhisperLive and WhisperSpeech to provide a seamless conversations with an AI.
☆1,620Updated last year
nyrahealth / CrisperWhisper
Verbatim Automatic Speech Recognition with improved word-level timestamps and filler detection
☆786Updated 2 months ago
davabase / whisper_real_time
Real time transcription with OpenAI Whisper.
☆2,794Updated 3 months ago
speaches-ai / speaches
☆2,151Updated this week
davabase / transcriber_app
Real time speech to text transcription app.
☆420Updated 2 years ago
lhl / voicechat2
Local SRT/LLM/TTS Voicechat
☆702Updated 9 months ago
KoljaB / RealtimeTTS
Converts text to speech in realtime
☆3,338Updated last week
saharmor / whisper-playground
Build real time speech2text web apps using OpenAI's Whisper https://openai.com/blog/whisper/
☆816Updated last year
KoljaB / Linguflex
Command Your World with Voice
☆737Updated last month
ricky0123 / vad
Voice activity detector (VAD) for the browser with a simple API
☆1,494Updated last week
shashikg / WhisperS2T
An Optimized Speech-to-Text Pipeline for the Whisper Model Supporting Multiple Inference Engine
☆445Updated 11 months ago
linto-ai / whisper-timestamped
Multilingual Automatic Speech Recognition with word-level timestamps and confidence
☆2,530Updated 4 months ago
ochen1 / insanely-fast-whisper-cli
The fastest Whisper optimization for automatic speech recognition as a command-line interface ⚡️
☆364Updated last year
KoljaB / LocalAIVoiceChat
Local AI talk with a custom voice based on Zephyr 7B model. Uses RealtimeSTT with faster_whisper for transcription and RealtimeTTS with C…
☆656Updated last month
Vaibhavs10 / open-tts-tracker
☆1,131Updated 5 months ago
chengsokdara / use-whisper
React hook for OpenAI Whisper with speech recorder, real-time transcription, and silence removal built-in
☆773Updated last year
aiola-lab / whisper-medusa
Whisper with Medusa heads
☆850Updated 3 weeks ago
jianfch / stable-ts
Transcription, forced alignment, and audio indexing with OpenAI's Whisper
☆1,955Updated 2 months ago
EtienneAb3d / WhisperHallu
Experimental code: sound file preprocessing to optimize Whisper transcriptions without hallucinated texts
☆333Updated 8 months ago
thomasmol / cog-whisper-diarization
Cog implementation of transcribing + diarization pipeline with Whisper & Pyannote
☆217Updated 5 months ago
jim60105 / docker-whisperX
Dockerfile for WhisperX: Automatic Speech Recognition with Word-Level Timestamps and Speaker Diarization (Dockerfile, CI image build and …
☆336Updated 2 weeks ago
pavelzbornik / whisperX-FastAPI
FastAPI service on top of WhisperX
☆120Updated this week
JigsawStack / insanely-fast-whisper-api
An API to transcribe audio with OpenAI's Whisper Large v3!
☆296Updated 8 months ago
NavodPeiris / speechlib
speechlib is a library that can do speaker diarization, transcription and speaker recognition on an audio file to create transcripts with…
☆222Updated 3 months ago