AIFanatic / google-offline-speech-recognitionLinks
This project aims to research google's offline speech recognition, from several android apps and ideally make them interoperable by replicating it on any system that supports tensorflow.
☆67Updated 5 years ago
Alternatives and similar repositories for google-offline-speech-recognition
Users that are interested in google-offline-speech-recognition are comparing it to the libraries listed below
Sorting:
- Android offline speech recognition natively on PC☆52Updated 4 years ago
- Google Chrome SODA Offline Speech Recognition command line client☆158Updated 4 months ago
- Google Chrome Text to Speech command line client☆34Updated 3 years ago
- This repository is a collection of TTS Models in TFLite☆194Updated 4 years ago
- A sample Android app using [whisper.cpp](https://github.com/ggerganov/whisper.cpp/) to do voice-to-text transcriptions.☆64Updated last year
- A proof-of-concept app using KeenASR SDK on Android. WE ARE HIRING: https://keenresearch.com/careers.html☆27Updated 3 months ago
- Robust Speech Recognition via Large-Scale Weak Supervision☆81Updated last year
- DeepSpeech based forced alignment tool☆237Updated 4 years ago
- Android Speech Recognition Service using Vosk/Kaldi and Mozilla DeepSpeech☆102Updated 3 years ago
- On-device voice activity detection (VAD) powered by deep learning☆218Updated last week
- Open tools and data for cloudless automatic speech recognition☆10Updated 5 years ago
- Onnx wrapper for espnet infrernce model☆163Updated 8 months ago
- Zero-shot Audio Classification using Whisper☆79Updated 2 years ago
- Model for recasing and repunctuating ASR transcripts☆133Updated last year
- openvino version of openai/whisper☆167Updated last year
- VCTK multi-speaker tacotron for ICASSP 2020☆266Updated 3 years ago
- Postprocess SRT derived speech alignments for creating clean datasets for machine learning☆17Updated 2 years ago
- 🐸STT integration examples☆129Updated 2 years ago
- TalkNet 2: Non-Autoregressive Depth-Wise Separable Convolutional Model for Speech Synthesis with Explicit Pitch and Duration Prediction.☆89Updated 4 years ago
- 📈 A forced aligner intended for synchronization of narrated text☆93Updated 2 years ago
- speaker diarization system using an LSTM☆50Updated 2 years ago
- On-device noise suppression powered by deep learning☆73Updated this week
- A tokenizer, text cleaner, and phonemizer for many human languages.☆317Updated 7 months ago
- Experiments to test different speech recognition systems for SEPIA Framework☆60Updated 2 years ago
- C++ library for converting text to phonemes for Piper☆122Updated last year
- Zero-shot multimodal punctuation insertion and truecasing using Whisper☆115Updated 2 years ago
- Pytorch implementation of Deepmind's WaveRNN model☆121Updated 5 years ago
- ☆258Updated 2 years ago
- automatically align transcribed audio and generate a wav2letter training corpus☆36Updated 2 years ago
- An HTML interface for finetuning the sync map output from aeneas☆53Updated 2 years ago