Whisper realtime streaming for long speech-to-text transcription and translation
☆122Jan 29, 2024Updated 2 years ago
Alternatives and similar repositories for whisper_streaming
Users that are interested in whisper_streaming are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Whisper realtime streaming for long speech-to-text transcription and translation☆3,594Nov 12, 2025Updated 5 months ago
- A streaming whisper server for on-prem transcription☆23Aug 15, 2024Updated last year
- KAN (Kolmogorov–Arnold Networks) in the MLX framework for Apple Silicon☆31Jun 18, 2025Updated 9 months ago
- High accuracy code-switching whisper / qwen3 transcription☆25Updated this week
- A system for live lecture translation (speech to text) where the audience can easily provide corrections.☆14Aug 4, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- MLX binary vectors and associated algorithms.☆14Mar 13, 2025Updated last year
- An API to transcribe audio with OpenAI's Whisper Large v3!☆348Nov 13, 2024Updated last year
- ☆158Jun 26, 2023Updated 2 years ago
- python wrapper for kaldi's native I/O☆27Jan 9, 2025Updated last year
- A simple app to convert audio files to text using speech-to-text APIs☆11Dec 8, 2022Updated 3 years ago
- Projeto para rodar o asterisk em docker e docker-compose.☆11Jun 6, 2020Updated 5 years ago
- Realtime voice-enabled AI assistant that can engage in natural conversations☆24Nov 23, 2025Updated 4 months ago
- ☆12,423Oct 25, 2025Updated 5 months ago
- A nearly-live implementation of OpenAI's Whisper.☆3,948Mar 17, 2026Updated 3 weeks ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Verbatim Automatic Speech Recognition with improved word-level timestamps and filler detection☆936Jun 3, 2025Updated 10 months ago
- The Asterisk Documentation Project.☆53Updated this week
- ☆206May 27, 2024Updated last year
- The best way to practice interview questions☆14Apr 25, 2023Updated 2 years ago
- An Optimized Speech-to-Text Pipeline for the Whisper Model Supporting Multiple Inference Engine☆564Aug 27, 2024Updated last year
- Python script to generate a PDF report based on sentiment analysis, words usage, personality insights, tone analysis and facial expressio…☆12Aug 1, 2021Updated 4 years ago
- Local realtime voice AI☆2,479Nov 26, 2025Updated 4 months ago
- Avalinguo Audio Dataset: Dataset for Speaker Fluency Level Classification☆13Aug 13, 2018Updated 7 years ago
- The fastest Whisper optimization for automatic speech recognition as a command-line interface ⚡️☆396Jun 8, 2024Updated last year
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- A python package to build AI-powered real-time audio applications☆1,966Feb 12, 2025Updated last year
- Discord integration for the oobabooga's text-generation-webui☆13Apr 27, 2023Updated 2 years ago
- ☆13Apr 28, 2025Updated 11 months ago
- Live-Transcription (STT) with Whisper PoC☆200Jun 18, 2024Updated last year
- Seamlessly manage your Alexa shopping list. Add, remove, and view items instantly. Interact with your Alexa shopping list via MCP, using …☆17Apr 14, 2025Updated last year
- PersonaPlex on Apple Silicon: an MLX port of NVIDIA’s full-duplex speech-to-speech model with realtime local/web modes and offline WAV in…☆60Feb 18, 2026Updated last month
- This plugin allows you to create and edit files in a directory on your computer using ChatGPT☆48Jul 10, 2023Updated 2 years ago
- streaming speech to text server using Whisper☆102Jun 2, 2023Updated 2 years ago
- QnA bot on a CSV☆21May 23, 2023Updated 2 years ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Set up YOLOv8 on Jetson nano with Jetpack 4.6☆14Jan 2, 2024Updated 2 years ago
- The subtitles and translations are generated in real-time and displayed as pop-ups.☆187Jun 8, 2023Updated 2 years ago
- Do Multilingual Language Models Think Better in English?☆42Aug 3, 2023Updated 2 years ago
- A repository of Japanese Phoneme-Level BERT☆24Dec 16, 2023Updated 2 years ago
- Text analyzer that extracts tokens from text for use in full-text search queries and indexes.☆12Nov 25, 2022Updated 3 years ago
- Faster Whisper transcription with CTranslate2☆22,041Nov 19, 2025Updated 4 months ago
- Silero VAD: pre-trained enterprise-grade Voice Activity Detector☆8,741Mar 26, 2026Updated 2 weeks ago