Whisper realtime streaming for long speech-to-text transcription and translation
☆122Jan 29, 2024Updated 2 years ago
Alternatives and similar repositories for whisper_streaming
Users that are interested in whisper_streaming are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Whisper realtime streaming for long speech-to-text transcription and translation☆3,611Nov 12, 2025Updated 5 months ago
- A streaming whisper server for on-prem transcription☆23Aug 15, 2024Updated last year
- KAN (Kolmogorov–Arnold Networks) in the MLX framework for Apple Silicon☆31Jun 18, 2025Updated 10 months ago
- High accuracy code-switching whisper / qwen3 transcription☆28Apr 20, 2026Updated 2 weeks ago
- A system for live lecture translation (speech to text) where the audience can easily provide corrections.☆14Aug 4, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- MLX binary vectors and associated algorithms.☆14Mar 13, 2025Updated last year
- Build a Conversational AI System that can answer questions by retrieving the answers from a document.☆11Feb 23, 2024Updated 2 years ago
- An API to transcribe audio with OpenAI's Whisper Large v3!☆354Nov 13, 2024Updated last year
- Fast edit distance Python extension written in Cython/C++. Supports Levenshtein distance and Damerau Optimal String Alignment (OSA) dista…☆25Apr 20, 2026Updated 2 weeks ago
- an example of local large langue model embeddings with Redis as vector stores☆12Apr 30, 2023Updated 3 years ago
- ☆158Jun 26, 2023Updated 2 years ago
- python wrapper for kaldi's native I/O☆27Jan 9, 2025Updated last year
- ☆29Nov 28, 2025Updated 5 months ago
- ☆17Sep 30, 2023Updated 2 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Realtime voice-enabled AI assistant that can engage in natural conversations☆24Nov 23, 2025Updated 5 months ago
- ☆12,736Oct 25, 2025Updated 6 months ago
- A nearly-live implementation of OpenAI's Whisper.☆4,001Apr 21, 2026Updated 2 weeks ago
- Verbatim Automatic Speech Recognition with improved word-level timestamps and filler detection☆942Jun 3, 2025Updated 11 months ago
- ☆12Apr 10, 2019Updated 7 years ago
- Identifying individual speakers in an audio stream based on the unique characteristics found in individual voices using Python☆18Jun 18, 2023Updated 2 years ago
- ☆206May 27, 2024Updated last year
- ☆10Oct 27, 2024Updated last year
- An Optimized Speech-to-Text Pipeline for the Whisper Model Supporting Multiple Inference Engine☆568Aug 27, 2024Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- a simple transcription library for rust☆190Apr 8, 2026Updated 3 weeks ago
- Declarative javascript library for RethinkDB supporting Atomic transactions☆12Oct 21, 2020Updated 5 years ago
- ☆19Jan 9, 2024Updated 2 years ago
- WhisperPlus: Faster, Smarter, and More Capable 🚀☆1,950Mar 2, 2026Updated 2 months ago
- Local realtime voice AI☆2,484Nov 26, 2025Updated 5 months ago
- Avalinguo Audio Dataset: Dataset for Speaker Fluency Level Classification☆13Aug 13, 2018Updated 7 years ago
- The fastest Whisper optimization for automatic speech recognition as a command-line interface ⚡️☆401Jun 8, 2024Updated last year
- ☆13Apr 28, 2025Updated last year
- Live-Transcription (STT) with Whisper PoC☆200Jun 18, 2024Updated last year
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- 🐚 CLI tool for working with Firefly Zero: build, publish, and install games, control device, etc.☆15Apr 24, 2026Updated last week
- PersonaPlex on Apple Silicon: an MLX port of NVIDIA’s full-duplex speech-to-speech model with realtime local/web modes and offline WAV in…☆65Feb 18, 2026Updated 2 months ago
- streaming speech to text server using Whisper☆102Jun 2, 2023Updated 2 years ago
- QnA bot on a CSV☆21May 23, 2023Updated 2 years ago
- ☆22Dec 19, 2025Updated 4 months ago
- Set up YOLOv8 on Jetson nano with Jetpack 4.6☆14Jan 2, 2024Updated 2 years ago
- The subtitles and translations are generated in real-time and displayed as pop-ups.☆190Jun 8, 2023Updated 2 years ago