AIFanatic / google-offline-speech-recognition
This project aims to research google's offline speech recognition, from several android apps and ideally make them interoperable by replicating it on any system that supports tensorflow.
☆65Updated 4 years ago
Alternatives and similar repositories for google-offline-speech-recognition:
Users that are interested in google-offline-speech-recognition are comparing it to the libraries listed below
- Android offline speech recognition natively on PC☆52Updated 4 years ago
- Google Chrome SODA Offline Speech Recognition command line client☆157Updated 2 months ago
- Google Chrome Text to Speech command line client☆33Updated 3 years ago
- This repository is a collection of TTS Models in TFLite☆192Updated 4 years ago
- A sample Android app using [whisper.cpp](https://github.com/ggerganov/whisper.cpp/) to do voice-to-text transcriptions.☆64Updated last year
- A proof-of-concept app using KeenASR SDK on Android. WE ARE HIRING: https://keenresearch.com/careers.html☆26Updated 3 weeks ago
- Experiments to test different speech recognition systems for SEPIA Framework☆59Updated last year
- Android Speech Recognition Service using Vosk/Kaldi and Mozilla DeepSpeech☆98Updated 3 years ago
- Zero-shot Audio Classification using Whisper☆80Updated 2 years ago
- An even smaller speech recognizer / force aligner☆32Updated 3 months ago
- Robust Speech Recognition via Large-Scale Weak Supervision☆77Updated last year
- On-device voice activity detection (VAD) powered by deep learning☆203Updated this week
- flask+tornado based NVIDIA tacotron2+waveglow tts web app☆28Updated last year
- Kaldi API for Android, Python and Node. Forked from vosk-api with minimal modifications.☆16Updated 4 years ago
- How to create your own model for vosk☆70Updated 3 years ago
- 🐸STT integration examples☆127Updated 2 years ago
- Model for recasing and repunctuating ASR transcripts☆133Updated 11 months ago
- Voice activity detection (VAD) library, based on WebRTC's VAD engine built to WASM with Emscripten to run in browsers, Node, and NativeSc…☆29Updated 8 months ago
- DeepSpeech based forced alignment tool☆237Updated 4 years ago
- 26-Point MFCC & 512-Point FFT Generator & Visualizer in Java, C++, and NEON intrinsics☆15Updated 5 years ago
- An example app that demos how to use TFLite to do automatic speech recognition on-device☆14Updated 3 years ago
- An Android app that offers speech-to-text user interfaces to other apps☆282Updated 4 months ago
- webrtcvad provides node.js bindings to the WebRTC voice activity detection library.☆30Updated 4 years ago
- Simplified diarization pipeline using some pretrained models - audio file to diarized segments in a few lines of code☆146Updated 10 months ago
- STT Service based on Kaldi ASR☆15Updated 6 years ago
- Multilingual Grapheme to Phoneme☆49Updated 9 years ago
- VCTK multi-speaker tacotron for ICASSP 2020☆266Updated 3 years ago
- Web app for keyword spotting using TensorflowJS☆71Updated 2 years ago
- An espeak-compatible, permissively-licensed IPA phonemizer (G2P) based on DeepPhonemizer. Usable as a drop-in replacement for espeak's GP…☆95Updated 5 months ago
- openvino version of openai/whisper☆166Updated last year