rpdrewes / whisper-websocket-server
Self hosted high quality voice recognition for de-googled Android using whisper. Like Siri or OK Google.
☆51Updated 8 months ago
Related projects: ⓘ
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelines☆81Updated 4 months ago
- web based editor for subtitles and transcripts☆102Updated last month
- Real-Time Whisper Voice Recognition with vosk model feedback.☆103Updated last year
- Real-time processing and delivery of sentences from a continuous stream of characters or text chunks.☆29Updated last month
- streaming speech to text server using Whisper☆75Updated last year
- ez audio transcription tool with flexible processing and post-processing options☆122Updated 7 months ago
- Efficient approach to speaker diarization using voice characteristics extraction☆56Updated 4 months ago
- 🎧 | RunPod worker of the faster-whisper model for Serverless Endpoint.☆63Updated last month
- A sample Android app using [whisper.cpp](https://github.com/ggerganov/whisper.cpp/) to do voice-to-text transcriptions.☆62Updated last year
- An API to transcribe audio with OpenAI's Whisper Large v3!☆166Updated 3 weeks ago
- Whisper realtime streaming for long speech-to-text transcription and translation☆97Updated 7 months ago
- Offline voice input panel & keyboard with punctuation for Android.☆84Updated 3 months ago
- ☆62Updated 4 months ago
- Open models for Coqui STT☆119Updated last year
- On-device streaming text-to-speech engine powered by deep learning☆43Updated last week
- An open-source, browser-based transcript viewer and manager. Upload, transcribe, and chat with meeting recordings using AI. Features meet…☆12Updated last month
- a transcription application that listens to audio input from the microphone using OpenAI's Whisper, transcribes it into text, and simulat…☆13Updated 6 months ago
- Demo python script app to interact with llama.cpp server using whisper API, microphone and webcam devices.☆44Updated 10 months ago
- 🐍 🤖 Pip installable package for StyleTTS 2 human-level text-to-speech and voice cloning☆119Updated 2 months ago
- An Android keyboard that performs speech-to-text (STT/ASR) with OpenAI Whisper and input the recognized text; Supports English, Chinese, …☆34Updated last month
- ONNX-compatible Fast SeamlessM4T—Massively Multilingual & Multimodal Machine Translation☆41Updated last year
- Talk with ChatGPT using your VOICE☆120Updated this week
- Cog implementation of transcribing + diarization pipeline with Whisper & Pyannote☆151Updated last week
- speechlib is a library that can do speaker diarization, transcription and speaker recognition on an audio file to create transcripts with…☆134Updated 3 weeks ago
- AI 3D avatar voice interface in browser. VAD -> STT -> LLM -> TTS -> VRM (Prototype/Proof-of-Concept)☆63Updated last year
- ☆74Updated 2 months ago
- Python bindings for whisper.cpp☆150Updated this week
- Talk to GPT-4 and create a story together.☆78Updated 9 months ago
- ☆66Updated 6 months ago
- A simple Python package to easily use Meta's Massively Multilingual Speech (MMS) project☆51Updated last year