biemster / gasrLinks

Google Chrome SODA Offline Speech Recognition command line client

☆158

Alternatives and similar repositories for gasr

Users that are interested in gasr are comparing it to the libraries listed below

Sorting:

biemster / gtts
Google Chrome Text to Speech command line client
☆34Updated 3 years ago
AIFanatic / google-offline-speech-recognition
This project aims to research google's offline speech recognition, from several android apps and ideally make them interoperable by repli…
☆67Updated 5 years ago
biemster / asr
Android offline speech recognition natively on PC
☆52Updated 4 years ago
geekodour / wscribe
ez audio transcription tool with flexible processing and post-processing options
☆152Updated last year
zhuzilin / whisper-openvino
openvino version of openai/whisper
☆167Updated last year
fquirin / speech-recognition-experiments
Experiments to test different speech recognition systems for SEPIA Framework
☆60Updated 2 years ago
pengzhendong / pyannote-onnx
ONNX Inference of Pyannote Segmentation
☆91Updated 6 months ago
OzymandiasTheGreat / libfvad-wasm
Voice activity detection (VAD) library, based on WebRTC's VAD engine built to WASM with Emscripten to run in browsers, Node, and NativeSc…
☆30Updated 11 months ago
EtienneAb3d / WhisperTimeSync
Synchronize Whisper's timestamps over an existing accurate transcription
☆152Updated last year
moonshine-ai / openai-whisper
Robust Speech Recognition via Large-Scale Weak Supervision
☆82Updated last year
appvoid / vosper
Real-Time Whisper Voice Recognition with vosk model feedback.
☆115Updated last year
shiguredo / dtln-aec
An echo cancellation library for browsers using DTLN-aec
☆26Updated last year
Picovoice / cobra
On-device voice activity detection (VAD) powered by deep learning
☆219Updated this week
intel-str / rnnoise-wasm
Wasm Port of Recurrent neural network for audio noise reduction. Based on xiph/rnnoise C++ project
☆42Updated 4 years ago
EtienneAb3d / WhisperHallu
Experimental code: sound file preprocessing to optimize Whisper transcriptions without hallucinated texts
☆329Updated 7 months ago
fortypercnt / stream-translator
☆256Updated 2 years ago
glut23 / webvtt-py
Read, write, convert and segment WebVTT caption files in Python.
☆215Updated last year
ANonEntity / WhisperWithVAD
Whisper combined with Silero VAD, for improved long-form transcriptions
☆52Updated 2 years ago
RomanKlimov / faster-whisper-acceleration
Accelerating faster-whisper single file processing by multiprocessing through parallelization
☆54Updated 2 years ago
rudder-analytics / Goodness-of-Pronounciation
☆38Updated last year
ReadAlongs / SoundSwallower
An even smaller speech recognizer / force aligner
☆33Updated 6 months ago
jumon / whisper-punctuator
Zero-shot multimodal punctuation insertion and truecasing using Whisper
☆115Updated 2 years ago
geekodour / wscribe-editor
web based editor for subtitles and transcripts
☆135Updated 10 months ago
alumae / kiirkirjutaja
☆53Updated 2 years ago
leohuang2013 / pyannote-audio_speaker-diarization_cpp
C++ version of pyannote audio speaker diarizaiton pipeline
☆21Updated last year
coqui-ai / inference-engine
Coqui Inference Engine
☆40Updated 3 years ago
adrianlyjak / kokoro-onnx-export
☆13Updated 2 months ago
tableos / mina
An experiment of trying out whisper.cpp for real-time speech-to-text
☆20Updated 2 years ago
chorusai / arpa2ipa
A converter from Arpabet to IPA (see https://en.wikipedia.org/wiki/Arpabet)
☆16Updated 7 years ago
r4victor / afaligner
📈 A forced aligner intended for synchronization of narrated text
☆93Updated 2 years ago