AIFanatic / google-offline-speech-recognition
This project aims to research google's offline speech recognition, from several android apps and ideally make them interoperable by replicating it on any system that supports tensorflow.
☆63Updated 4 years ago
Alternatives and similar repositories for google-offline-speech-recognition:
Users that are interested in google-offline-speech-recognition are comparing it to the libraries listed below
- Android offline speech recognition natively on PC☆50Updated 4 years ago
- Google Chrome SODA Offline Speech Recognition command line client☆150Updated this week
- Google Chrome Text to Speech command line client☆32Updated 3 years ago
- This repository is a collection of TTS Models in TFLite☆189Updated 3 years ago
- VCTK multi-speaker tacotron for ICASSP 2020☆265Updated 2 years ago
- A sample Android app using [whisper.cpp](https://github.com/ggerganov/whisper.cpp/) to do voice-to-text transcriptions.☆65Updated last year
- On-device noise suppression powered by deep learning☆64Updated this week
- Android Speech Recognition Service using Vosk/Kaldi and Mozilla DeepSpeech☆98Updated 3 years ago
- On-device voice activity detection (VAD) powered by deep learning☆190Updated this week
- Desktop application for neural speech synthesis written in C++☆212Updated last year
- Experiments to test different speech recognition systems for SEPIA Framework☆59Updated last year
- A tokenizer, text cleaner, and phonemizer for many human languages.☆296Updated 2 months ago
- wake word engine benchmark framework☆131Updated 3 years ago
- speaker diarization system using an LSTM☆49Updated 2 years ago
- DEPRECATED - An Android library module to Mozilla's Speech-To-Text services☆205Updated 4 years ago
- An example app that demos how to use TFLite to do automatic speech recognition on-device☆13Updated 3 years ago
- DeepSpeech based forced alignment tool☆234Updated 4 years ago
- ESPnet Model Zoo☆245Updated last year
- Music Pitch detection using Tensorflow SPICE model.☆72Updated 4 years ago
- TalkNet 2: Non-Autoregressive Depth-Wise Separable Convolutional Model for Speech Synthesis with Explicit Pitch and Duration Prediction.☆88Updated 3 years ago
- PyTorch Implementation of FastSpeech 2 : Fast and High-Quality End-to-End Text to Speech☆228Updated 2 years ago
- On-device speaker diarization powered by deep learning☆33Updated this week
- A proof-of-concept app using KeenASR SDK on Android. WE ARE HIRING: https://keenresearch.com/careers.html☆25Updated last month
- Crystal - C++ implementation of a unified framework for multilingual TTS synthesis engine with SSML specification as interface.☆228Updated 4 years ago
- ☆254Updated 2 years ago
- Kaldi based speaker verification☆47Updated 6 years ago
- ☆58Updated 3 years ago
- Zero-shot Audio Classification using Whisper☆77Updated 2 years ago
- Segment an audio file and obtain utterance alignments. (Python package)☆325Updated 8 months ago