Digipom/WhisperCppAndroidDemo

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Digipom/WhisperCppAndroidDemo)

Digipom / WhisperCppAndroidDemo

A sample Android app using [whisper.cpp](https://github.com/ggerganov/whisper.cpp/) to do voice-to-text transcriptions.

☆64

Alternatives and similar repositories for WhisperCppAndroidDemo

Users that are interested in WhisperCppAndroidDemo are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

moonshine-ai / openai-whisper
View on GitHub
Robust Speech Recognition via Large-Scale Weak Supervision
☆91Aug 28, 2023Updated 2 years ago
Vuzix / UltraliteSDK-releases-iOS
View on GitHub
Use this library to connect your iOS, WatchOS, or MacOS app to the Vuzix Z100™ smart glasses.
☆16Mar 18, 2025Updated last year
bjnortier / whisper-tflite-ios
View on GitHub
☆19Nov 4, 2022Updated 3 years ago
nii-yamagishilab / speaker_sex_attribute_privacy
View on GitHub
Project for HIDING SPEAKER’S SEX IN SPEECH USING ZERO-EVIDENCE SPEAKER REPRESENTATION IN AN ANALYSIS/SYNTHESIS PIPELINE
☆15Nov 30, 2022Updated 3 years ago
tableos / mina
View on GitHub
An experiment of trying out whisper.cpp for real-time speech-to-text
☆20Dec 25, 2022Updated 3 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
jumon / zac
View on GitHub
Zero-shot Audio Classification using Whisper
☆79Dec 12, 2022Updated 3 years ago
cyfer0618 / kaldi-pytorch-rnnlm
View on GitHub
Enable RNNLM lattice rescoring with Pytorch [kaldi]
☆12Jun 5, 2020Updated 6 years ago
vanquish630 / BaldGAN
View on GitHub
Make any person bald!! Component of the paper: Learning to regulate 3D head shape by removing occluding hair from in-the-wild images.
☆12Jun 6, 2022Updated 4 years ago
Hannes1 / react-native-wenet
View on GitHub
Wenet speech to text for react native
☆10Nov 1, 2022Updated 3 years ago
vilassn / whisper_android
View on GitHub
Offline Speech Recognition with OpenAI Whisper and TensorFlow Lite for Android
☆679Mar 18, 2026Updated 4 months ago
kaisoapbox / OldWhisperVoiceKeyboard
View on GitHub
A voice to text keyboard based on OpenAI Whisper Model.
☆12Dec 11, 2024Updated last year
yurlovm / VideoThursday
View on GitHub
☆10Oct 3, 2023Updated 2 years ago
cpii-cai / PunCantonese
View on GitHub
A Benchmark Corpus for Low-Resource Cantonese Punctuation Restoration from Speech Transcripts
☆15Dec 3, 2024Updated last year
michaelneri / unsupervised-audio-anomaly-detection
View on GitHub
Official repository of the work "Low-complexity Unsupervised Audio Anomaly Detection exploiting Separable Convolutions and Angular Loss" …
☆11Nov 6, 2024Updated last year
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
Mihaiii / trivia
View on GitHub
A live multiplayer trivia game where users can bid for the subject of the next question
☆29Jan 9, 2026Updated 6 months ago
miccio-dk / NISQA
View on GitHub
NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment
☆16Apr 13, 2022Updated 4 years ago
FantSun / Speechflow
View on GitHub
Speechflow for emotion recognition related information decomposition
☆10Jul 27, 2021Updated 4 years ago
MrEdwards007 / WhisperTaskAcceleration
View on GitHub
Accelerate Whisper tasks such as transcription, by multiprocesing through parallelization
☆25Oct 29, 2022Updated 3 years ago
TTS-Research / PEL-TTS
View on GitHub
☆14Aug 16, 2023Updated 2 years ago
bagustris / s3prl-ser
View on GitHub
S3PRL for Speech Emotion Recognition (see s3prl > downstream)
☆15Feb 28, 2026Updated 4 months ago
Everyday-Programmer / Android-Camera-using-CameraX
View on GitHub
This repository contains code of Camera App using CameraX library.
☆11Oct 23, 2023Updated 2 years ago
MichaelMcCulloch / WhisperVoiceKeyboard
View on GitHub
A voice to text keyboard based on OpenAI Whisper Model.
☆50Jun 10, 2023Updated 3 years ago
douhaohaode / xtts_v2
View on GitHub
☆72Dec 12, 2023Updated 2 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
tiro-is / tiro-speech-core
View on GitHub
This is a mirror of https://gitlab.com/tiro-is/tiro-speech-core
☆15Jun 19, 2023Updated 3 years ago
skysbird / g2p-zh-en
View on GitHub
Chinese and English Bilinguish G2P
☆22Jul 16, 2023Updated 3 years ago
amazon-science / proteno
View on GitHub
This repository contains data used in the NAACL 2021 Paper - Proteno: Text Normalization with Limited Data for Fast Deployment in Text to…
☆45May 25, 2021Updated 5 years ago
patyork / AutomaticSpeechChunker
View on GitHub
From a large speech audio file and its corresponding body of text, automatically chunk the audio and text into (phrase, audio_snippet) pa…
☆17May 15, 2015Updated 11 years ago
luomingshuang / k2-speechbrain
View on GitHub
In this repository, I try to combine k2 with speechbrain to decode well and fastly.
☆16Jun 17, 2022Updated 4 years ago
lgessler / microbert
View on GitHub
A tiny BERT for low-resource monolingual models
☆32Dec 24, 2025Updated 7 months ago
CaydenPierce / MSA
View on GitHub
Open Source Wearable Microphone Array Glasses for Multi-Speaker Speech Recognition
☆18May 12, 2022Updated 4 years ago
seungheondoh / hi_kia
View on GitHub
wake-up word emotion recognition [APSIPA 2022]
☆17Nov 11, 2022Updated 3 years ago
dafyddg / RFA
View on GitHub
Implementation of the Rhythm Formant Analysis methodology for identifying speech rhythms and rhythm variation in the low frequency spectr…
☆17Apr 27, 2023Updated 3 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
diegohce / gogwave
View on GitHub
Go language bindings for the ggwave C++ library
☆14Apr 9, 2025Updated last year
ryanrudes / YTTTS
View on GitHub
The YouTube Text-To-Speech dataset is comprised of waveform audio extracted from YouTube videos alongside their English transcriptions
☆53Apr 1, 2021Updated 5 years ago
alefiury / SE-R-2022-SER-Track
View on GitHub
Code for the winning solution in the SE&R 2022 Challenge - SER track.
☆16Mar 28, 2023Updated 3 years ago
JacobLinCool / whisper-cli
View on GitHub
A CLI speech recognition tool, using OpenAI Whisper, supports audio file transcription and near-realtime microphone input.
☆22Jul 17, 2026Updated last week
idiap / zff_vad
View on GitHub
Unsupervised Voice Activity Detection by Modeling Source and System Information using Zero Frequency Filtering
☆23Oct 19, 2023Updated 2 years ago
Lakemast / WifiCar
View on GitHub
Now you will be able to build and control your own RC Car over the Internet using the Message Queue Telemetry Transport Protocol (MQTT) w…
☆18Jan 21, 2023Updated 3 years ago
spirobel / bunny-llama
View on GitHub
iterate quickly with llama.cpp hot reloading. use the llama.cpp bindings with bun.sh
☆51Oct 30, 2023Updated 2 years ago