Digipom / WhisperCppAndroidDemoLinks
A sample Android app using [whisper.cpp](https://github.com/ggerganov/whisper.cpp/) to do voice-to-text transcriptions.
☆64Updated last year
Alternatives and similar repositories for WhisperCppAndroidDemo
Users that are interested in WhisperCppAndroidDemo are comparing it to the libraries listed below
Sorting:
- Robust Speech Recognition via Large-Scale Weak Supervision☆84Updated last year
- Offline voice input panel & keyboard with punctuation for Android.☆106Updated last year
- Experiments to test different speech recognition systems for SEPIA Framework☆60Updated 2 years ago
- Self hosted high quality voice recognition for de-googled Android using whisper. Like Siri or OK Google.☆64Updated last year
- A voice to text keyboard based on OpenAI Whisper Model.☆50Updated 2 years ago
- ONNX-compatible Fast SeamlessM4T—Massively Multilingual & Multimodal Machine Translation☆43Updated last year
- Real-Time Whisper Voice Recognition with vosk model feedback.☆117Updated 2 years ago
- openvino version of openai/whisper☆170Updated last year
- Optimized OpenAI's Whisper TFLite Port for Efficient Offline Inference on Edge Devices☆250Updated 11 months ago
- C++ library for converting text to phonemes for Piper☆128Updated 3 weeks ago
- Open models for Coqui STT☆142Updated 2 years ago
- faster-whisper livestream translation, OBS noise reduction, dual language subtitles☆79Updated 2 years ago
- Offline Speech Recognition with OpenAI Whisper and TensorFlow Lite for Android☆498Updated 5 months ago
- On-device noise suppression powered by deep learning☆73Updated 3 weeks ago
- Listen to any audio stream on your machine and print out the transcribed or translated audio.☆119Updated last year
- Babylon.cpp is a C and C++ library for grapheme to phoneme conversion and text to speech synthesis. For phonemization a ONNX runtime port…☆23Updated last week
- Zero-shot Audio Classification using Whisper☆79Updated 2 years ago
- A ggml (C++) re-implementation of tortoise-tts☆188Updated 11 months ago
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelines☆96Updated last year
- Go from raw audio files to a text-audio dataset automatically with OpenAI's Whisper.☆137Updated last year
- 🐍 🤖 Pip installable package for StyleTTS 2 human-level text-to-speech and voice cloning☆161Updated last year
- Android Voice Activity Detection (VAD) library. Supports WebRTC VAD GMM, Silero VAD DNN, Yamnet VAD DNN models.☆390Updated 3 weeks ago
- On-device voice activity detection (VAD) powered by deep learning☆222Updated 2 weeks ago
- Batch Support for OpenAI Whisper☆94Updated last year
- Improving transcription performance of OpenAI Whisper for CPU based deployment☆247Updated 2 years ago
- C++ version of pyannote audio speaker diarizaiton pipeline☆21Updated last year
- A high-throughput and memory-efficient inference and serving engine for Whisper, https://mesolitica.com/blog/vllm-whisper☆28Updated last year
- Experimental code: sound file preprocessing to optimize Whisper transcriptions without hallucinated texts☆334Updated 8 months ago
- Port of Meta's Encodec in C/C++☆225Updated 8 months ago
- streaming speech to text server using Whisper☆94Updated 2 years ago