Real-Time Whisper Voice Recognition with vosk model feedback.
☆121Jun 30, 2023Updated 2 years ago
Alternatives and similar repositories for vosper
Users that are interested in vosper are comparing it to the libraries listed below
Sorting:
- Build real time speech2text web apps using OpenAI's Whisper https://openai.com/blog/whisper/☆833Sep 12, 2025Updated 5 months ago
- Streaming transcriber with whisper☆695May 1, 2023Updated 2 years ago
- A nearly-live implementation of OpenAI's Whisper, using sounddevice. Requires existing Whisper install.☆359Jul 20, 2025Updated 7 months ago
- A quick experiment to achieve almost realtime transcription using Whisper.☆186Sep 22, 2022Updated 3 years ago
- Thin wrapper around OpenAI Whisper API with streaming support☆86Dec 5, 2025Updated 2 months ago
- WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)☆12Aug 1, 2025Updated 7 months ago
- Speaker diarization service☆27Feb 24, 2026Updated last week
- plugin manager for OpenVoiceOS , STT/TTS/Wakewords that can be used anywhere☆13Jan 30, 2026Updated last month
- Arabic Grapheme-to-Phoneme (G2P) Conversion☆13Mar 15, 2025Updated 11 months ago
- Modern web scraper with LLM-enhanced extraction, extensible pipeline, and pluggable parsers.☆10Updated this week
- ☆26Nov 3, 2025Updated 4 months ago
- Voice memos recorded from the microphone, transcribed offline to text and converted to Joplin notes☆29Mar 1, 2024Updated 2 years ago
- Real time transcription with OpenAI Whisper.☆2,914Apr 15, 2025Updated 10 months ago
- Real time speech to text transcription app.☆434Jan 14, 2023Updated 3 years ago
- Accelerate Whisper tasks such as transcription, by multiprocesing through parallelization☆25Oct 29, 2022Updated 3 years ago
- React hook for OpenAI Whisper with speech recorder, real-time transcription, and silence removal built-in☆786Apr 30, 2024Updated last year
- ☆11Aug 24, 2022Updated 3 years ago
- Robust Speech Recognition via Large-Scale Weak Supervision☆89Aug 28, 2023Updated 2 years ago
- An interface for llama.cpp, ChatGPT, Gemini, and Claude☆27Jan 29, 2026Updated last month
- A complete speech segmentation system using Kaldi and x-vectors for voice activity detection (VAD) and speaker diarisation.☆31Jun 17, 2024Updated last year
- An extension of thu-spmi/CAT which contains a full-fledged implementation of CTC-CRF for Tensorflow.☆12Jul 5, 2021Updated 4 years ago
- This repository contains code for applying Data2Vec to pretrain Keyword Transformer model as described in "Improving Label-Deficient Keyw…☆30Mar 6, 2025Updated 11 months ago
- PolEval 2021 Task 1☆15Jun 28, 2022Updated 3 years ago
- Listen, transcribe, reply - Voice Assistant using OpenAI & ElevenLabs API's☆14Jun 24, 2023Updated 2 years ago
- tkinter desktop chat interface with OpenAI's gpt-3.5-turbo API☆12Apr 29, 2023Updated 2 years ago
- ☆17Apr 28, 2021Updated 4 years ago
- ☆13Mar 12, 2024Updated last year
- SpeechPlus: Small LLM-Based Text-to-Speech Library 🚀☆20May 20, 2025Updated 9 months ago
- (WIP) A retrain of F5-TTS on permissively-licensed data☆13Apr 6, 2025Updated 10 months ago
- Llama.cpp-qt is a Python-based GUI wrapper for the LLama.cpp server, providing a user-friendly interface for configuring and running the …☆16Oct 4, 2023Updated 2 years ago
- Lightweight wrapper for Silero VAD using internal ONNX Runtime and with no python package dependencies☆15Nov 25, 2024Updated last year
- Extract structured data from PDF invoices☆14Mar 16, 2021Updated 4 years ago
- Python клиент API распознавания и синтеза речи Облака ЦРТ☆11Dec 26, 2022Updated 3 years ago
- ☆32Oct 23, 2025Updated 4 months ago
- ☆15Aug 25, 2022Updated 3 years ago
- A real time offline transcriber with gui, based on OpenAI whisper☆16Dec 25, 2025Updated 2 months ago
- An AudioServer that takes audio from Asterisk via UDP and sends it to Google's Speech To Text Engine☆33Jan 6, 2023Updated 3 years ago
- streaming speech to text server using Whisper☆101Jun 2, 2023Updated 2 years ago
- Russian accentuator and IPA transcriber☆16Sep 10, 2024Updated last year