fedirz / faster-whisper-server
☆384Updated this week
Related projects: ⓘ
- Local AI talk with a custom voice based on Zephyr 7B model. Uses RealtimeSTT with faster_whisper for transcription and RealtimeTTS with C…☆475Updated last month
- An OpenAI API compatible text to speech server using Coqui AI's xtts_v2 and/or piper tts as the backend.☆308Updated 3 weeks ago
- Near-Realtime audio transcription using self-hosted Whisper and WebSocket in Python/JS☆650Updated 2 months ago
- Whisper realtime streaming for long speech-to-text transcription and translation☆1,770Updated 2 weeks ago
- Whisper with Medusa heads☆774Updated last week
- Local SRT/LLM/TTS Voicechat☆471Updated last month
- 🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production☆341Updated this week
- A nearly-live implementation of OpenAI's Whisper.☆1,798Updated 2 weeks ago
- ☆278Updated 2 months ago
- An Optimized Speech-to-Text Pipeline for the Whisper Model Supporting Multiple Inference Engine☆276Updated 3 weeks ago
- Dockerfile for WhisperX: Automatic Speech Recognition with Word-Level Timestamps and Speaker Diarization (Dockerfile, CI image build and …☆147Updated 3 weeks ago
- The fastest Whisper optimization for automatic speech recognition as a command-line interface ⚡️☆308Updated 3 months ago
- ☆1,079Updated 2 months ago
- Pybind11 bindings for Whisper.cpp☆321Updated this week
- WhisperFusion builds upon the capabilities of WhisperLive and WhisperSpeech to provide a seamless conversations with an AI.☆1,509Updated last month
- A simple FastAPI Server to run XTTSv2☆357Updated last month
- 💬 ASR FastAPI server using faster-whisper and Multi-Scale Auto-Tuning Spectral Clustering for diarization.☆188Updated last month
- Experimental code: sound file preprocessing to optimize Whisper transcriptions without hallucinated texts☆262Updated 7 months ago
- A python package to build AI-powered real-time audio applications☆992Updated 2 months ago
- A GUI tool for offline transcription of speech recordings, including speaker diarization, utilizing state-of-the-art machine learning mod…☆299Updated last week
- Command Your World with Voice☆368Updated 3 weeks ago
- Live-Transcription (STT) with Whisper PoC☆140Updated 3 months ago
- Whisper realtime streaming for long speech-to-text transcription and translation☆97Updated 7 months ago
- Build real-time multimodal AI applications 🤖🎙️📹☆1,053Updated this week
- Converts text to speech in realtime☆1,730Updated 3 weeks ago
- A talking LLM that runs on your own computer without needing the internet.☆222Updated last month
- speechlib is a library that can do speaker diarization, transcription and speaker recognition on an audio file to create transcripts with…☆134Updated 3 weeks ago
- plug whisper audio transcription to a local ollama server and ouput tts audio responses☆196Updated 10 months ago
- Python bindings for whisper.cpp☆150Updated this week
- Suno AI's Bark model in C/C++ for fast text-to-speech☆684Updated 2 months ago