gkorepanov / whisper-streamLinks
Thin wrapper around OpenAI Whisper API with streaming support
ā86Updated 3 weeks ago
Alternatives and similar repositories for whisper-stream
Users that are interested in whisper-stream are comparing it to the libraries listed below
Sorting:
- 2D Positional Embeddings for Webpage Structural Understanding š¦šā95Updated last year
- ā158Updated 2 years ago
- Code for OpenAI Whisper Web App Demoā93Updated 3 years ago
- Improving transcription performance of OpenAI Whisper for CPU based deploymentā256Updated 3 years ago
- Real-Time Whisper Voice Recognition with vosk model feedback.ā121Updated 2 years ago
- streaming speech to text server using Whisperā98Updated 2 years ago
- Demo python script app to interact with llama.cpp server using whisper API, microphone and webcam devices.ā46Updated 2 years ago
- A curated list of awesome OpenAI's Whisperā99Updated 2 years ago
- CLIP as a service - Embed image and sentences, object recognition, visual reasoning, image classification and reverse image searchā66Updated 5 months ago
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelinesā100Updated last year
- ā60Updated 2 weeks ago
- Text To Speech Synthesis with Voskā230Updated last month
- faster-whisper as serverless endpointā126Updated last month
- Create amazing Stable Diffusion prompts with minimal prompt knowledge. A vicuna based prompt engineering tool for stable diffusionā91Updated 2 years ago
- TTS with The Massively Multilingual Speech (MMS) projectā231Updated last year
- Fine tuning of the base model from OpenAI Whisper in Russian language on the dataset Sber-golosā41Updated 3 years ago
- PlayHT Python SDK - AI Text-to-Speech Streaming & Voice Cloning APIā218Updated 4 months ago
- Whisper realtime streaming for long speech-to-text transcription and translationā121Updated last year
- Cog implementation of transcribing + diarization pipeline with Whisper & Pyannoteā230Updated 10 months ago
- Record a sample of your own voice and let AI narrate the text in your own voice.ā79Updated 2 years ago
- This is a phonemic multilingual (Russian-English) Implementation of Transfer Learning from Speaker Verification to Multispeaker Text-To-Sā¦ā52Updated 5 years ago
- T5-based (russian) text normalizationā24Updated last year
- Booster - open accelerator for LLM models. Better inference and debugging for AI hackersā167Updated last year
- Command-line script for inferencing from models such as MPT-7B-Chatā100Updated 2 years ago
- ā33Updated 2 years ago
- Coqui AI TTS pluginā85Updated 5 months ago
- Run OpenAI Whisper as a Cog modelā67Updated last year
- An JS web client for connecting to Pipecat bots with voice and visionā44Updated last year
- ā69Updated 8 months ago
- Transcription with speaker diarization pipelineā97Updated 2 years ago