gkorepanov / whisper-streamLinks
Thin wrapper around OpenAI Whisper API with streaming support
☆86Updated 3 weeks ago
Alternatives and similar repositories for whisper-stream
Users that are interested in whisper-stream are comparing it to the libraries listed below
Sorting:
- streaming speech to text server using Whisper☆98Updated 2 years ago
- Tools and agents for automated research.☆47Updated 3 weeks ago
- 2D Positional Embeddings for Webpage Structural Understanding 🦙👀☆95Updated last year
- Real-Time Whisper Voice Recognition with vosk model feedback.☆121Updated 2 years ago
- CLIP as a service - Embed image and sentences, object recognition, visual reasoning, image classification and reverse image search☆66Updated 5 months ago
- Improving transcription performance of OpenAI Whisper for CPU based deployment☆256Updated 3 years ago
- An JS web client for connecting to Pipecat bots with voice and vision☆44Updated last year
- ☆158Updated 2 years ago
- ☆60Updated last week
- Text To Speech Synthesis with Vosk☆230Updated last month
- T5-based (russian) text normalization☆24Updated last year
- Demo python script app to interact with llama.cpp server using whisper API, microphone and webcam devices.☆46Updated 2 years ago
- This is a phonemic multilingual (Russian-English) Implementation of Transfer Learning from Speaker Verification to Multispeaker Text-To-S…☆52Updated 5 years ago
- LLaMA Server combines the power of LLaMA C++ with the beauty of Chatbot UI.☆130Updated 2 years ago
- Whisper realtime streaming for long speech-to-text transcription and translation☆121Updated last year
- Code for OpenAI Whisper Web App Demo☆93Updated 3 years ago
- ASR + diarization model server with speculative decoding☆63Updated last year
- Бенчмарк сравнивает русские аналоги ChatGPT: Saiga, YandexGPT, Gigachat☆61Updated 2 years ago
- Lyrics generation with GPT2-based Transformer☆108Updated 3 years ago
- Kandinsky 2 — multilingual text2image latent diffusion model☆87Updated last year
- Record a sample of your own voice and let AI narrate the text in your own voice.☆79Updated 2 years ago
- Create amazing Stable Diffusion prompts with minimal prompt knowledge. A vicuna based prompt engineering tool for stable diffusion☆91Updated 2 years ago
- Run OpenAI Whisper as a Cog model☆67Updated last year
- Command-line script for inferencing from models such as MPT-7B-Chat☆100Updated 2 years ago
- A tool for summarizing dialogues from videos or audio☆83Updated 2 years ago
- Kandinsky x Deforum — generating short animations☆105Updated last year
- faster-whisper livestream translation, OBS noise reduction, dual language subtitles☆80Updated 2 years ago
- Простой IPA фонемизатор на базе ruaccent-encoder☆24Updated 8 months ago
- ☆33Updated 2 years ago
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelines☆100Updated last year