gkorepanov / whisper-streamLinks
Thin wrapper around OpenAI Whisper API with streaming support
☆89Updated 8 months ago
Alternatives and similar repositories for whisper-stream
Users that are interested in whisper-stream are comparing it to the libraries listed below
Sorting:
- streaming speech to text server using Whisper☆94Updated 2 years ago
- 2D Positional Embeddings for Webpage Structural Understanding 🦙👀☆95Updated last year
- Real-Time Whisper Voice Recognition with vosk model feedback.☆119Updated 2 years ago
- Improving transcription performance of OpenAI Whisper for CPU based deployment☆249Updated 2 years ago
- Tools and agents for automated research.☆37Updated this week
- Code for OpenAI Whisper Web App Demo☆93Updated 3 years ago
- Text To Speech Synthesis with Vosk☆212Updated last month
- Demo python script app to interact with llama.cpp server using whisper API, microphone and webcam devices.☆46Updated last year
- Framework for processing and filtering datasets☆27Updated last year
- Бенчмарк сравнивает русские аналоги ChatGPT: Saiga, YandexGPT, Gigachat☆60Updated 2 years ago
- ☆33Updated 2 years ago
- An JS web client for connecting to Pipecat bots with voice and vision☆45Updated 9 months ago
- CLIP as a service - Embed image and sentences, object recognition, visual reasoning, image classification and reverse image search☆65Updated last month
- Modified Arena-Hard-Auto LLM evaluation toolkit with an emphasis on Russian language☆45Updated 6 months ago
- T5-based (russian) text normalization☆22Updated last year
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelines☆97Updated last year
- Scripts and stuff☆18Updated 2 years ago
- Lyrics generation with GPT2-based Transformer☆106Updated 3 years ago
- ☆158Updated 2 years ago
- ☆48Updated 2 months ago
- TTS with The Massively Multilingual Speech (MMS) project☆234Updated last year
- Top ML papers of the week.☆40Updated this week
- Whisper realtime streaming for long speech-to-text transcription and translation☆121Updated last year
- LLaMA Server combines the power of LLaMA C++ with the beauty of Chatbot UI.☆129Updated 2 years ago
- MERA (Multimodal Evaluation for Russian-language Architectures) is a new open benchmark for the Russian language for evaluating fundament…☆62Updated 11 months ago
- Transcription with speaker diarization pipeline☆94Updated 2 years ago
- Create amazing Stable Diffusion prompts with minimal prompt knowledge. A vicuna based prompt engineering tool for stable diffusion☆91Updated 2 years ago
- An OpenAI-like LLaMA inference API☆113Updated 2 years ago
- Простой IPA фонемизатор на базе ruaccent-encoder☆24Updated 5 months ago
- ☆175Updated last year