gkorepanov / whisper-streamLinks
Thin wrapper around OpenAI Whisper API with streaming support
☆89Updated 10 months ago
Alternatives and similar repositories for whisper-stream
Users that are interested in whisper-stream are comparing it to the libraries listed below
Sorting:
- Real-Time Whisper Voice Recognition with vosk model feedback.☆119Updated 2 years ago
- streaming speech to text server using Whisper☆98Updated 2 years ago
- T5-based (russian) text normalization☆24Updated last year
- Text To Speech Synthesis with Vosk☆228Updated last week
- 2D Positional Embeddings for Webpage Structural Understanding 🦙👀☆95Updated last year
- ☆158Updated 2 years ago
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelines☆98Updated last year
- Improving transcription performance of OpenAI Whisper for CPU based deployment☆256Updated 3 years ago
- Whisper realtime streaming for long speech-to-text transcription and translation☆121Updated last year
- Demo python script app to interact with llama.cpp server using whisper API, microphone and webcam devices.☆46Updated 2 years ago
- Tools and agents for automated research.☆46Updated last month
- Transcription with speaker diarization pipeline☆97Updated 2 years ago
- Простой IPA фонемизатор на базе ruaccent-encoder☆24Updated 7 months ago
- An JS web client for connecting to Pipecat bots with voice and vision☆44Updated 11 months ago
- Fine tuning of the base model from OpenAI Whisper in Russian language on the dataset Sber-golos☆41Updated 3 years ago
- This is a phonemic multilingual (Russian-English) Implementation of Transfer Learning from Speaker Verification to Multispeaker Text-To-S…☆52Updated 5 years ago
- A curated list of awesome OpenAI's Whisper☆99Updated 2 years ago
- ☆60Updated 2 weeks ago
- Cog implementation of transcribing + diarization pipeline with Whisper & Pyannote☆230Updated 9 months ago
- PlayHT Python SDK - AI Text-to-Speech Streaming & Voice Cloning API☆217Updated 3 months ago
- Code for OpenAI Whisper Web App Demo☆93Updated 3 years ago
- ASR + diarization model server with speculative decoding☆63Updated last year
- DeepPavlov Dream is a free, open-source Multiskill AI Assistant Platform built using DeepPavlov Conversational AI Stack. It is built on t…☆177Updated 10 months ago
- faster-whisper as serverless endpoint☆125Updated 2 weeks ago
- Lyrics generation with GPT2-based Transformer☆108Updated 3 years ago
- openvino version of openai/whisper☆178Updated 2 years ago
- Listen to any audio stream on your machine and print out the transcribed or translated audio.☆119Updated 2 years ago
- Coqui AI TTS plugin☆87Updated 5 months ago
- Benchmark LLMs by fighting in Street Fighter 3! The new way to evaluate the quality of an LLM☆22Updated last year
- TTS with The Massively Multilingual Speech (MMS) project☆231Updated last year