tableos / mina
An experiment of trying out whisper.cpp for real-time speech-to-text
☆20Updated last year
Related projects ⓘ
Alternatives and complementary repositories for mina
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelines☆83Updated 6 months ago
- openvino version of openai/whisper☆161Updated last year
- A ggml (C++) re-implementation of tortoise-tts☆155Updated 2 months ago
- A Javascript library (with Typescript types) to parse metadata of GGML based GGUF files.☆44Updated 3 months ago
- streaming speech to text server using Whisper☆83Updated last year
- SemanticFinder - frontend-only live semantic search with transformers.js☆228Updated 2 months ago
- whisper.cpp bindings for python☆76Updated last year
- ez audio transcription tool with flexible processing and post-processing options☆128Updated 9 months ago
- Live-Transcription (STT) with Whisper PoC☆152Updated 4 months ago
- Real-Time Whisper Voice Recognition with vosk model feedback.☆105Updated last year
- 💬 ASR FastAPI server using faster-whisper and Multi-Scale Auto-Tuning Spectral Clustering for diarization.☆196Updated last week
- An API to transcribe audio with OpenAI's Whisper Large v3!☆184Updated 2 months ago
- Easy-to-use speech toolset. Written in TypeScript. Includes tools for synthesis, recognition, alignment, speech translation, language det…☆192Updated 3 weeks ago
- Port of Meta's Encodec in C/C++☆199Updated 2 weeks ago
- web based editor for subtitles and transcripts☆110Updated 2 months ago
- Experiments to test different speech recognition systems for SEPIA Framework☆57Updated last year
- Experimental code: sound file preprocessing to optimize Whisper transcriptions without hallucinated texts☆274Updated 9 months ago
- Private voice keyboard, AI chat, images, webcam, recordings, voice control with >= 4 GiB of VRAM.☆176Updated 3 weeks ago
- Self hosted high quality voice recognition for de-googled Android using whisper. Like Siri or OK Google.☆53Updated 10 months ago
- Whisper realtime streaming for long speech-to-text transcription and translation☆101Updated 9 months ago
- ☆97Updated 4 months ago
- speechlib is a library that can do speaker diarization, transcription and speaker recognition on an audio file to create transcripts with…☆152Updated last month
- A client side vector search library that can embed, store, search, and cache vectors. Works on the browser and node. It outperforms OpenA…☆168Updated 5 months ago
- Pybind11 bindings for Whisper.cpp☆324Updated this week
- A sample Android app using [whisper.cpp](https://github.com/ggerganov/whisper.cpp/) to do voice-to-text transcriptions.☆64Updated last year
- Shush is an app that deploys a WhisperV3 model with Flash Attention v2 on Modal and makes requests to it via a NextJS app☆187Updated 5 months ago
- The subtitles and translations are generated in real-time and displayed as pop-ups.☆122Updated last year
- Speaker Diarization with Transformers☆59Updated 5 months ago
- A simple Python package to easily use Meta's Massively Multilingual Speech (MMS) project☆52Updated last year