gkorepanov / whisper-streamLinks
Thin wrapper around OpenAI Whisper API with streaming support
☆89Updated 5 months ago
Alternatives and similar repositories for whisper-stream
Users that are interested in whisper-stream are comparing it to the libraries listed below
Sorting:
- 2D Positional Embeddings for Webpage Structural Understanding 🦙👀☆95Updated 10 months ago
- CLIP as a service - Embed image and sentences, object recognition, visual reasoning, image classification and reverse image search☆65Updated last year
- Improving transcription performance of OpenAI Whisper for CPU based deployment☆246Updated 2 years ago
- streaming speech to text server using Whisper☆93Updated 2 years ago
- ☆51Updated 2 weeks ago
- Demo python script app to interact with llama.cpp server using whisper API, microphone and webcam devices.☆46Updated last year
- Fine tuning of the base model from OpenAI Whisper in Russian language on the dataset Sber-golos☆39Updated 2 years ago
- Real-Time Whisper Voice Recognition with vosk model feedback.☆116Updated 2 years ago
- T5-based (russian) text normalization☆21Updated last year
- Lyrics generation with GPT2-based Transformer☆105Updated 3 years ago
- ☆158Updated 2 years ago
- A curated list of awesome OpenAI's Whisper☆101Updated last year
- An JS web client for connecting to Pipecat bots with voice and vision☆45Updated 6 months ago
- Text To Speech Synthesis with Vosk☆197Updated this week
- Open-source Rewind.ai clone written in Rust and Vue running 100% locally with whisper.cpp☆51Updated 2 years ago
- Coqui AI TTS plugin☆80Updated 2 weeks ago
- Benchmark LLMs by fighting in Street Fighter 3! The new way to evaluate the quality of an LLM☆20Updated 11 months ago
- Command-line script for inferencing from models such as MPT-7B-Chat☆101Updated 2 years ago
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelines☆96Updated last year
- ☆17Updated 3 years ago
- LLaMA Server combines the power of LLaMA C++ with the beauty of Chatbot UI.☆127Updated 2 years ago
- Tools and agents for automated research.☆30Updated 2 weeks ago
- Framework for processing and filtering datasets☆27Updated 11 months ago
- ☆33Updated 2 years ago
- ASR + diarization model server with speculative decoding☆61Updated last year
- Booster - open accelerator for LLM models. Better inference and debugging for AI hackers☆158Updated 11 months ago
- Простой IPA фонемизатор на базе ruaccent-encoder☆21Updated 3 months ago
- Production-ready audio and video transcription app that can run on your laptop or in the cloud.☆72Updated last year
- Code for OpenAI Whisper Web App Demo☆93Updated 2 years ago
- ☆31Updated 9 months ago