Real-Time Whisper Voice Recognition with vosk model feedback.
☆120Jun 30, 2023Updated 2 years ago
Alternatives and similar repositories for vosper
Users that are interested in vosper are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Build real time speech2text web apps using OpenAI's Whisper https://openai.com/blog/whisper/☆832Sep 12, 2025Updated 6 months ago
- Streaming transcriber with whisper☆694May 1, 2023Updated 2 years ago
- A nearly-live implementation of OpenAI's Whisper, using sounddevice. Requires existing Whisper install.☆360Jul 20, 2025Updated 8 months ago
- A quick experiment to achieve almost realtime transcription using Whisper.☆186Sep 22, 2022Updated 3 years ago
- plugin manager for OpenVoiceOS , STT/TTS/Wakewords that can be used anywhere☆13Updated this week
- ☆18Apr 28, 2021Updated 4 years ago
- Implementation of vocoders empowered with pytorch lightning☆18Jan 27, 2024Updated 2 years ago
- React hook for OpenAI Whisper with speech recorder, real-time transcription, and silence removal built-in☆784Apr 30, 2024Updated last year
- Real time transcription with OpenAI Whisper.☆2,914Apr 15, 2025Updated 11 months ago
- Real time speech to text transcription app.☆435Jan 14, 2023Updated 3 years ago
- An extension of thu-spmi/CAT which contains a full-fledged implementation of CTC-CRF for Tensorflow.☆12Jul 5, 2021Updated 4 years ago
- Robust Speech Recognition via Large-Scale Weak Supervision☆90Aug 28, 2023Updated 2 years ago
- Thin wrapper around OpenAI Whisper API with streaming support☆86Dec 5, 2025Updated 3 months ago
- WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)☆12Aug 1, 2025Updated 7 months ago
- Jupyter notebooks for PuLID face transfer with Flux.1 dev. Able to run on Google Colab Free Tier☆18Dec 18, 2024Updated last year
- Voice memos recorded from the microphone, transcribed offline to text and converted to Joplin notes☆29Mar 1, 2024Updated 2 years ago
- A free & open tool for transcribing audio interviews with offline ASR support☆25Dec 21, 2023Updated 2 years ago
- A library for adding punctuation into a text from ASR.☆19May 8, 2023Updated 2 years ago
- Python library to write, read, and verify transparency metadata in audio files for AI transparency compliance.☆19Aug 17, 2025Updated 7 months ago
- A nearly-live implementation of OpenAI's Whisper.☆3,893Mar 17, 2026Updated last week
- Speaker diarization service☆28Feb 24, 2026Updated last month
- A tiny jsx compiler☆14Dec 5, 2017Updated 8 years ago
- Keyword Spotting for detecting a word in an audio file☆17Jul 21, 2019Updated 6 years ago
- An interface for llama.cpp, ChatGPT, Gemini, and Claude☆27Mar 9, 2026Updated 2 weeks ago
- ☆15Aug 25, 2022Updated 3 years ago
- Private voice keyboard, AI chat, images, webcam, recordings, voice control with >= 4 GiB of VRAM.☆289Dec 30, 2025Updated 2 months ago
- Self hosted high quality voice recognition for de-googled Android using whisper. Like Siri or OK Google.☆70Dec 30, 2023Updated 2 years ago
- PolEval 2021 Task 1☆15Jun 28, 2022Updated 3 years ago
- Listen, transcribe, reply - Voice Assistant using OpenAI & ElevenLabs API's☆14Jun 24, 2023Updated 2 years ago
- Audio-visual diarization pipeline used for creating VoxConverse dataset☆21Jun 6, 2025Updated 9 months ago
- Send SMS Text Messages within the Ethereum Blockchain - TextMessage.ETH☆16Aug 24, 2017Updated 8 years ago
- A collection of all our phonemeizers for dataset construction and inference☆28Feb 21, 2025Updated last year
- Zero-shot multimodal punctuation insertion and truecasing using Whisper☆119Feb 4, 2023Updated 3 years ago
- This repository contains code for applying Data2Vec to pretrain Keyword Transformer model as described in "Improving Label-Deficient Keyw…☆31Mar 6, 2025Updated last year
- ☆27Nov 3, 2025Updated 4 months ago
- Arabic Grapheme-to-Phoneme (G2P) Conversion☆13Mar 15, 2025Updated last year
- Live transcription with OpenAi Whisper☆50Nov 11, 2022Updated 3 years ago
- Simple diarization model☆53Jun 13, 2025Updated 9 months ago
- Provides an interface for extensions to use language models directly in the browser.☆16Mar 3, 2026Updated 3 weeks ago