Live-Transcription (STT) with Whisper PoC
☆201Jun 18, 2024Updated last year
Alternatives and similar repositories for whisper-live-transcription
Users that are interested in whisper-live-transcription are comparing it to the libraries listed below
Sorting:
- Near-Realtime audio transcription using self-hosted Whisper and WebSocket in Python/JS☆949Oct 2, 2024Updated last year
- A nearly-live implementation of OpenAI's Whisper.☆3,850Feb 20, 2026Updated 2 weeks ago
- Getting VibeVoice 7b working with 10 gb of vram.☆14Aug 31, 2025Updated 6 months ago
- Real time transcription with OpenAI Whisper.☆2,913Apr 15, 2025Updated 10 months ago
- Add geo functionality extension to datafusion query engine.☆11Apr 26, 2024Updated last year
- Auto-Video maker handling many AI's☆11Mar 18, 2024Updated last year
- CVPR2025-Multi-party Collaborative Attention Control for Image Customization☆16May 14, 2025Updated 9 months ago
- network pinger with UI☆14Feb 12, 2024Updated 2 years ago
- Small Multimodal Vision Model "Imp-v1-3b" trained using Phi-2 and Siglip.☆17Feb 5, 2024Updated 2 years ago
- Transcribe is a real time transcription, conversation, Language learning platform. It provides live transcripts from microphone and speak…☆250Feb 11, 2026Updated 3 weeks ago
- Real-time transcription using faster-whisper☆613Jul 23, 2024Updated last year
- A simple dify bot☆34Apr 16, 2025Updated 10 months ago
- The subtitles and translations are generated in real-time and displayed as pop-ups.☆183Jun 8, 2023Updated 2 years ago
- ☆32May 22, 2024Updated last year
- ☆15Sep 21, 2022Updated 3 years ago
- 我从动漫中学习到的知识和人生感悟☆16Mar 6, 2025Updated last year
- Record audio or transcribe files using ctranslate2 and whisper!☆177Feb 28, 2026Updated last week
- This repository contains a simple vocoder that works with live input. The vocoder uses LPC coefficients to do voice transformations and/o…☆14Aug 19, 2022Updated 3 years ago
- Query on Everything with SQL☆17Oct 29, 2024Updated last year
- A python package to build AI-powered real-time audio applications☆1,938Feb 12, 2025Updated last year
- 用来管理Dify多个用户空间和用户☆21Mar 12, 2025Updated 11 months ago
- ☆15Apr 3, 2025Updated 11 months ago
- Concept Representation (Embedding) and Semantic Relatedness☆15Jul 3, 2019Updated 6 years ago
- ☆11Updated this week
- The fastest Whisper optimization for automatic speech recognition as a command-line interface ⚡️☆388Jun 8, 2024Updated last year
- ☆8,826Oct 25, 2025Updated 4 months ago
- 友善之臂☆24Apr 4, 2024Updated last year
- Verbatim Automatic Speech Recognition with improved word-level timestamps and filler detection☆922Jun 3, 2025Updated 9 months ago
- Whisper realtime streaming for long speech-to-text transcription and translation☆121Jan 29, 2024Updated 2 years ago
- Build real time speech2text web apps using OpenAI's Whisper https://openai.com/blog/whisper/☆833Sep 12, 2025Updated 5 months ago
- Bringing large-language models and chat to web browsers. Everything runs inside the browser with no server support.☆16Feb 4, 2024Updated 2 years ago
- A command-line tool that uses AI to rename and organize video files.☆20Nov 6, 2025Updated 4 months ago
- LLM inference in C/C++☆21Mar 22, 2025Updated 11 months ago
- Crawl any website with Tavily, embed the content, and deploy the RAG on MongoDB Atlas vector search.☆46Dec 31, 2025Updated 2 months ago
- Faster Whisper transcription with CTranslate2☆21,289Nov 19, 2025Updated 3 months ago
- A transformer-based multimodal model for music.☆29Aug 15, 2024Updated last year
- web based editor for subtitles and transcripts☆144Aug 16, 2024Updated last year
- A JPEG Image Compression Service using Part Homomorphic Encryption.☆31Mar 7, 2025Updated last year
- Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model☆23Sep 26, 2024Updated last year