gkorepanov / whisper-streamLinks
Thin wrapper around OpenAI Whisper API with streaming support
☆89Updated 7 months ago
Alternatives and similar repositories for whisper-stream
Users that are interested in whisper-stream are comparing it to the libraries listed below
Sorting:
- streaming speech to text server using Whisper☆94Updated 2 years ago
- Real-Time Whisper Voice Recognition with vosk model feedback.☆119Updated 2 years ago
- 2D Positional Embeddings for Webpage Structural Understanding 🦙👀☆95Updated 11 months ago
- Improving transcription performance of OpenAI Whisper for CPU based deployment☆249Updated 2 years ago
- Demo python script app to interact with llama.cpp server using whisper API, microphone and webcam devices.☆46Updated last year
- ☆55Updated last week
- Code for OpenAI Whisper Web App Demo☆93Updated 2 years ago
- Fine tuning of the base model from OpenAI Whisper in Russian language on the dataset Sber-golos☆40Updated 2 years ago
- ☆157Updated 2 years ago
- HuggingChat like UI in Gradio☆71Updated 2 years ago
- Tools and agents for automated research.☆35Updated this week
- T5-based (russian) text normalization☆22Updated last year
- This is a phonemic multilingual (Russian-English) Implementation of Transfer Learning from Speaker Verification to Multispeaker Text-To-S…☆51Updated 5 years ago
- Text To Speech Synthesis with Vosk☆210Updated last week
- Listen to any audio stream on your machine and print out the transcribed or translated audio.☆119Updated 2 years ago
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelines☆96Updated last year
- Coqui AI TTS plugin☆85Updated 2 months ago
- Lyrics generation with GPT2-based Transformer☆106Updated 3 years ago
- [WIP] AI Try-On plugin for Chrome☆27Updated last year
- ASR + diarization model server with speculative decoding☆63Updated last year
- LLaMA Server combines the power of LLaMA C++ with the beauty of Chatbot UI.☆129Updated 2 years ago
- A WebRTC server that allows you to interact with an LLM using your speech and responds back with generated audio.☆136Updated last year
- CLIP as a service - Embed image and sentences, object recognition, visual reasoning, image classification and reverse image search☆64Updated last month
- A huggingface pipeline to train a gpt model based on the transcript obtained byt the Open AI whisper model☆15Updated 2 years ago
- A high-throughput and memory-efficient inference and serving engine for Whisper, https://mesolitica.com/blog/vllm-whisper☆30Updated last year
- Простой IPA фонемизатор на базе ruaccent-encoder☆24Updated 4 months ago
- faster-whisper as serverless endpoint☆115Updated 3 months ago
- TTS with The Massively Multilingual Speech (MMS) project☆235Updated last year
- DeepPavlov Dream is a free, open-source Multiskill AI Assistant Platform built using DeepPavlov Conversational AI Stack. It is built on t…☆174Updated 7 months ago
- openvino version of openai/whisper☆174Updated last year