Digipom / WhisperCppAndroidDemo
A sample Android app using [whisper.cpp](https://github.com/ggerganov/whisper.cpp/) to do voice-to-text transcriptions.
☆64Updated last year
Related projects ⓘ
Alternatives and complementary repositories for WhisperCppAndroidDemo
- Offline Speech Recognition with OpenAI Whisper and TensorFlow Lite for Android☆229Updated this week
- A voice to text keyboard based on OpenAI Whisper Model.☆48Updated last year
- Experiments to test different speech recognition systems for SEPIA Framework☆57Updated last year
- Self hosted high quality voice recognition for de-googled Android using whisper. Like Siri or OK Google.☆53Updated 10 months ago
- Optimized OpenAI's Whisper TFLite Port for Efficient Offline Inference on Edge Devices☆174Updated 2 months ago
- Real-Time Whisper Voice Recognition with vosk model feedback.☆105Updated last year
- openvino version of openai/whisper☆161Updated last year
- Robust Speech Recognition via Large-Scale Weak Supervision☆70Updated last year
- A ggml (C++) re-implementation of tortoise-tts☆155Updated 2 months ago
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelines☆83Updated 6 months ago
- C++ version of pyannote audio speaker diarizaiton pipeline☆18Updated 8 months ago
- Offline voice input panel & keyboard with punctuation for Android.☆89Updated 5 months ago
- C++ library for converting text to phonemes for Piper☆88Updated 7 months ago
- faster-whisper livestream translation, OBS noise reduction, dual language subtitles☆74Updated last year
- ONNX Inference of Pyannote Segmentation☆65Updated last month
- ez audio transcription tool with flexible processing and post-processing options☆128Updated 9 months ago
- Experimental code: sound file preprocessing to optimize Whisper transcriptions without hallucinated texts☆274Updated 9 months ago
- streaming speech to text server using Whisper☆83Updated last year
- Android Voice Activity Detection (VAD) library. Supports WebRTC VAD GMM, Silero VAD DNN, Yamnet VAD DNN models.☆253Updated 8 months ago
- Listen to any audio stream on your machine and print out the transcribed or translated audio.☆114Updated last year
- Efficient approach to speaker diarization using voice characteristics extraction☆66Updated 6 months ago
- ☆97Updated 4 months ago
- A simple Python package to easily use Meta's Massively Multilingual Speech (MMS) project☆52Updated last year
- On-device voice activity detection (VAD) powered by deep learning☆173Updated 2 weeks ago
- 💬 ASR FastAPI server using faster-whisper and Multi-Scale Auto-Tuning Spectral Clustering for diarization.☆196Updated last week
- ONNX-compatible Fast SeamlessM4T—Massively Multilingual & Multimodal Machine Translation☆40Updated last year
- Open models for Coqui STT☆122Updated last year
- ☆294Updated 4 months ago
- FastAPI service on top of WhisperX☆40Updated 7 months ago
- web based editor for subtitles and transcripts☆110Updated 2 months ago