tableos / mina
An experiment of trying out whisper.cpp for real-time speech-to-text
β20Updated 2 years ago
Alternatives and similar repositories for mina:
Users that are interested in mina are comparing it to the libraries listed below
- streaming speech to text server using Whisperβ91Updated last year
- TypeScript-based library for real-time audio transcription, integrating OpenAI's Whisper model for accurate speech-to-text conversion.β68Updated last year
- π¬ ASR FastAPI server using faster-whisper and Multi-Scale Auto-Tuning Spectral Clustering for diarization.β208Updated 5 months ago
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelinesβ94Updated 11 months ago
- whisper.cpp bindings for pythonβ94Updated last year
- β156Updated last year
- Live-Transcription (STT) with Whisper PoCβ181Updated 10 months ago
- Listen to any audio stream on your machine and print out the transcribed or translated audio.β118Updated last year
- β36Updated 2 years ago
- A quick experiment to achieve almost realtime transcription using Whisper.β188Updated 2 years ago
- A curated list of awesome OpenAI's Whisperβ101Updated last year
- A Javascript library (with Typescript types) to parse metadata of GGML based GGUF files.β47Updated 8 months ago
- Real-Time Whisper Voice Recognition with vosk model feedback.β112Updated last year
- generate granular word-level captions in srt formatβ57Updated 2 years ago
- Transcription with speaker diarization pipelineβ92Updated last year
- Just an .exe that can be used for those unable to build whisper.cpp in Windows.β42Updated 2 years ago
- Smart Whisper is a native Node.js addon designed for efficient and streamlined interaction with the whisper.cpp, with automatic model offβ¦β50Updated last week
- Shush is an app that deploys a WhisperV3 model with Flash Attention v2 on Modal and makes requests to it via a NextJS appβ203Updated 10 months ago
- React Native Expo wrapper for the Swift WhisperKit libraryβ12Updated last week
- The fastest Whisper optimization for automatic speech recognition as a command-line interface β‘οΈβ349Updated 10 months ago
- Whisper realtime streaming for long speech-to-text transcription and translationβ113Updated last year
- faster-whisper as serverless endpointβ96Updated last week
- A simple Python package to easily use Meta's Massively Multilingual Speech (MMS) projectβ52Updated last year
- web based editor for subtitles and transcriptsβ130Updated 8 months ago
- The subtitles and translations are generated in real-time and displayed as pop-ups.β155Updated last year
- On-device voice activity detection (VAD) powered by deep learningβ206Updated last week
- ez audio transcription tool with flexible processing and post-processing optionsβ149Updated last year
- Create modular, cross-browser, web audio pipelines to record and process audio in background threads. Comes with modules for VAD, ASR, reβ¦β47Updated last year
- Pybind11 bindings for Whisper.cppβ328Updated 4 months ago
- Web Browser Audio Detection/Speech Recording Events APIβ74Updated 2 years ago