Digipom / WhisperCppAndroidDemoLinks
A sample Android app using [whisper.cpp](https://github.com/ggerganov/whisper.cpp/) to do voice-to-text transcriptions.
☆63Updated 2 years ago
Alternatives and similar repositories for WhisperCppAndroidDemo
Users that are interested in WhisperCppAndroidDemo are comparing it to the libraries listed below
Sorting:
- Robust Speech Recognition via Large-Scale Weak Supervision☆88Updated 2 years ago
- Experiments to test different speech recognition systems for SEPIA Framework☆62Updated 2 years ago
- openvino version of openai/whisper☆178Updated 2 years ago
- Offline voice input panel & keyboard with punctuation for Android.☆108Updated last year
- Optimized OpenAI's Whisper TFLite Port for Efficient Offline Inference on Edge Devices☆267Updated last year
- ONNX-compatible Fast SeamlessM4T—Massively Multilingual & Multimodal Machine Translation☆43Updated 2 years ago
- Self hosted high quality voice recognition for de-googled Android using whisper. Like Siri or OK Google.☆67Updated last year
- Real-Time Whisper Voice Recognition with vosk model feedback.☆121Updated 2 years ago
- faster-whisper livestream translation, OBS noise reduction, dual language subtitles☆80Updated 2 years ago
- A voice to text keyboard based on OpenAI Whisper Model.☆49Updated 2 years ago
- Listen to any audio stream on your machine and print out the transcribed or translated audio.☆119Updated 2 years ago
- C++ library for converting text to phonemes for Piper☆137Updated 5 months ago
- Open models for Coqui STT☆148Updated 2 years ago
- Offline Speech Recognition with OpenAI Whisper and TensorFlow Lite for Android☆588Updated 10 months ago
- On-device noise suppression powered by deep learning☆77Updated last week
- Zero-shot Audio Classification using Whisper☆79Updated 3 years ago
- 🐍 🤖 Pip installable package for StyleTTS 2 human-level text-to-speech and voice cloning☆161Updated last year
- Improving transcription performance of OpenAI Whisper for CPU based deployment☆256Updated 3 years ago
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelines☆99Updated last year
- A ggml (C++) re-implementation of tortoise-tts☆193Updated last year
- Port of Meta's Encodec in C/C++☆227Updated last year
- Using OpenVINO to speed up MeloTTS inference☆15Updated last year
- On-device voice activity detection (VAD) powered by deep learning☆238Updated this week
- How to create your own model for vosk☆74Updated 4 years ago
- Android Voice Activity Detection (VAD) library. Supports WebRTC VAD GMM, Silero VAD DNN, Yamnet VAD DNN models.☆446Updated 5 months ago
- Go from raw audio files to a text-audio dataset automatically with OpenAI's Whisper.☆137Updated 2 years ago
- Voice models for Mimic 3 text to speech system☆160Updated last year
- Babylon.cpp is a C and C++ library for grapheme to phoneme conversion and text to speech synthesis. For phonemization a ONNX runtime port…☆27Updated 3 months ago
- ONNX Inference of Pyannote Segmentation☆97Updated last year
- SEPIA server to support open-source speech recognition via WebSocket connection.☆134Updated last year