tableos / mina
An experiment of trying out whisper.cpp for real-time speech-to-text
☆20Updated 2 years ago
Alternatives and similar repositories for mina:
Users that are interested in mina are comparing it to the libraries listed below
- A Javascript library (with Typescript types) to parse metadata of GGML based GGUF files.☆47Updated 8 months ago
- TypeScript-based library for real-time audio transcription, integrating OpenAI's Whisper model for accurate speech-to-text conversion.☆67Updated last year
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelines☆94Updated 10 months ago
- whisper.cpp bindings for python☆94Updated last year
- 💬 ASR FastAPI server using faster-whisper and Multi-Scale Auto-Tuning Spectral Clustering for diarization.☆207Updated 5 months ago
- streaming speech to text server using Whisper☆89Updated last year
- Synchronize Whisper's timestamps over an existing accurate transcription☆142Updated 10 months ago
- Real-Time Whisper Voice Recognition with vosk model feedback.☆112Updated last year
- web based editor for subtitles and transcripts☆128Updated 7 months ago
- On-device voice activity detection (VAD) powered by deep learning☆204Updated this week
- Minimal extension of OpenAI's Whisper adding speaker diarization with special tokens☆483Updated last year
- Whisper realtime streaming for long speech-to-text transcription and translation☆113Updated last year
- Go from raw audio files to a text-audio dataset automatically with OpenAI's Whisper.☆135Updated last year
- Accelerate Whisper tasks such as transcription, by multiprocesing through parallelization☆25Updated 2 years ago
- Real-time processing and delivery of sentences from a continuous stream of characters or text chunks.☆46Updated this week
- Transcription with speaker diarization pipeline☆91Updated last year
- Speech Diarization for scrum automation☆102Updated last year
- Batch Support for OpenAI Whisper☆92Updated last year
- Live-Transcription (STT) with Whisper PoC☆175Updated 9 months ago
- Improving transcription performance of OpenAI Whisper for CPU based deployment☆239Updated 2 years ago
- An even smaller speech recognizer / force aligner☆32Updated 3 months ago
- A simple Python package to easily use Meta's Massively Multilingual Speech (MMS) project☆52Updated last year
- faster-whisper livestream translation, OBS noise reduction, dual language subtitles☆77Updated last year
- Self hosted high quality voice recognition for de-googled Android using whisper. Like Siri or OK Google.☆62Updated last year
- ONNX-compatible Fast SeamlessM4T—Massively Multilingual & Multimodal Machine Translation☆43Updated last year
- Web browser version of StarCoder.cpp☆44Updated last year
- Pybind11 bindings for Whisper.cpp☆328Updated 3 months ago
- TypeScript generator for llama.cpp Grammar directly from TypeScript interfaces☆135Updated 8 months ago
- generate granular word-level captions in srt format☆57Updated 2 years ago
- ez audio transcription tool with flexible processing and post-processing options☆148Updated last year