nalbion / whisper-serverView external linksLinks
streaming speech to text server using Whisper
☆101Jun 2, 2023Updated 2 years ago
Alternatives and similar repositories for whisper-server
Users that are interested in whisper-server are comparing it to the libraries listed below
Sorting:
- Thin wrapper around OpenAI Whisper API with streaming support☆86Dec 5, 2025Updated 2 months ago
- Whisper realtime streaming for long speech-to-text transcription and translation☆3,530Nov 12, 2025Updated 3 months ago
- Build real time speech2text web apps using OpenAI's Whisper https://openai.com/blog/whisper/☆833Sep 12, 2025Updated 5 months ago
- Speech recognition module for Python, supporting several engines and APIs, online and offline.☆13Mar 9, 2022Updated 3 years ago
- A simple, accessible and offline real-time transcription app for Android.☆14Oct 1, 2024Updated last year
- ☆18Feb 4, 2026Updated last week
- ☆11Sep 5, 2025Updated 5 months ago
- Zero-shot Domain-sensitive Speech Recognition with Prompt-conditioning Fine-tuning (ASRU2023)☆27Oct 10, 2023Updated 2 years ago
- C++ version of pyannote audio overlapped speech detection pipeline☆13Feb 14, 2024Updated 2 years ago
- ai-validator is a powerful library that helps to extract and validate structured data from the output text of language models.☆16May 23, 2023Updated 2 years ago
- General purpose synchronized bit sampler for esp8266☆13Feb 28, 2015Updated 10 years ago
- gRPC server for hnswlib☆16Mar 6, 2023Updated 2 years ago
- A high-throughput and memory-efficient inference and serving engine for Whisper, https://mesolitica.com/blog/vllm-whisper☆32Jul 28, 2024Updated last year
- ☆30Jun 12, 2025Updated 8 months ago
- A nearly-live implementation of OpenAI's Whisper.☆3,803Updated this week
- javascript implementation of nmap (Neighborhood Preservation Space-filling Algorithm)☆18Sep 25, 2015Updated 10 years ago
- Official release of pretrained models and codes for 'Golden Gemini Is All You Need: Finding the Sweet Spots for Speaker Verification'☆15Jan 20, 2025Updated last year
- Lite Voice Terminal, an "offline smart speaker" solution powered by on-premise ASR server (vosk API / kaldi engine)☆18Feb 29, 2024Updated last year
- ☆40May 4, 2024Updated last year
- A Streaming-Native Serving Engine for TTS/STS Models☆48Updated this week
- ☆21May 24, 2023Updated 2 years ago
- DiFlow-TTS delivers low-latency zero-shot TTS via discrete flow matching and factorized speech tokens. A compact, open framework for fast…☆51Updated this week
- Shared Voice Interface☆44Oct 21, 2023Updated 2 years ago
- ☆2,935Updated this week
- Use mark to run lots of prompts on lots of data☆18Aug 12, 2025Updated 6 months ago
- Real-Time Whisper Voice Recognition with vosk model feedback.☆121Jun 30, 2023Updated 2 years ago
- ☆88Feb 13, 2025Updated last year
- OpenAI Whisper API-style local server, runnig on FastAPI☆87Oct 8, 2025Updated 4 months ago
- [ICASSP 2026] Task Vector in TTS: Toward Emotionally Expressive Dialectal Speech Synthesis☆36Dec 24, 2025Updated last month
- ☆18Sep 19, 2023Updated 2 years ago
- ☆32Aug 22, 2024Updated last year
- React hook for OpenAI Whisper with speech recorder, real-time transcription, and silence removal built-in☆786Apr 30, 2024Updated last year
- Open Source Python SDK for AI Agents Identity☆33Jan 20, 2026Updated 3 weeks ago
- A High-Quality and Large-Scale Dataset for English-Vietnamese Speech Translation (INTERSPEECH 2022)☆22Jun 5, 2025Updated 8 months ago
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelines☆100May 7, 2024Updated last year
- In this repository, we explore model compression for transformer architectures via quantization. We specifically explore quantization awa…☆24May 14, 2021Updated 4 years ago
- Code for the method proposed in the paper:- ccc-wav2vec 2.0: Clustering aided Cross-Contrastive learning of Self-Supervised speech repres…☆23Mar 18, 2024Updated last year
- ez audio transcription tool with flexible processing and post-processing options☆162Feb 1, 2024Updated 2 years ago
- Example: Micro speech for TensorFlow Lite☆34Dec 18, 2023Updated 2 years ago