Google Chrome SODA Offline Speech Recognition command line client
☆170Jan 28, 2025Updated last year
Alternatives and similar repositories for gasr
Users that are interested in gasr are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Google Chrome Text to Speech command line client☆37Jul 16, 2021Updated 4 years ago
- Android offline speech recognition natively on PC☆53Dec 13, 2020Updated 5 years ago
- Kaldi code for doing DNN with tensorflow☆13Feb 8, 2016Updated 10 years ago
- This is an extension of kaldi speech recognition software which allows to perform decoding of speech with hybrid word and phoneme graphs.…☆11Feb 4, 2020Updated 6 years ago
- This is code for an audio search engine that uses vocal imitations of the desired sound☆38May 16, 2023Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Tensorflow-based wake word detection☆18Jun 22, 2026Updated last week
- The Codec 2 speech codec, compiled to WASM using Emscripten.☆13Apr 27, 2023Updated 3 years ago
- Tiny wrapper around webrtc-audio-processing for noise suppression/auto gain only☆33May 28, 2026Updated last month
- A Kotlin Multiplatform Project utilizing ggwave, a data-over-sound library.☆20Nov 23, 2024Updated last year
- A sample Android app using [whisper.cpp](https://github.com/ggerganov/whisper.cpp/) to do voice-to-text transcriptions.☆64Sep 6, 2023Updated 2 years ago
- ☆11Aug 11, 2023Updated 2 years ago
- An efficient implementation of RNN-T Prefix Beam Search in C++/CUDA.☆67Jan 7, 2026Updated 5 months ago
- Accelerate Whisper tasks such as transcription, by multiprocesing through parallelization☆25Oct 29, 2022Updated 3 years ago
- ✨Realtime Voice Changer with 3~ seconds for custom voice in CPU☆20Apr 21, 2026Updated 2 months ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- ☆14Aug 19, 2024Updated last year
- ☆27Jan 19, 2021Updated 5 years ago
- BurrMill core☆22Nov 2, 2021Updated 4 years ago
- ☆14Aug 1, 2025Updated 10 months ago
- Code release for "TinySpeech: Attention Condensers for Deep Speech Recognition Neural Networks on Edge Devices"☆23Jun 7, 2025Updated last year
- DDPM-based Pitch Generation and Pitch Controllable Voice Synthesis.☆55Sep 25, 2023Updated 2 years ago
- DiTTo-TTS: Diffusion Transformers for Scalable Text-to-Speech without Domain-Specific Factors☆39Feb 11, 2025Updated last year
- GGML implementation of BERT model with Python bindings and quantization.☆57Feb 19, 2024Updated 2 years ago
- Colab notebooks for Next-gen Kaldi☆31Oct 12, 2025Updated 8 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Raspberry-based E-Paper Smart Home Display Project☆21Apr 13, 2026Updated 2 months ago
- Python wrapper for OpenFST and its extensions from Kaldi. Also support reading/writing ark/scp files☆57Apr 9, 2026Updated 2 months ago
- ☆27Nov 3, 2025Updated 7 months ago
- Compiled list of links from "Ask HN: Where can I post my startup to get beta users?"☆17Jan 28, 2016Updated 10 years ago
- High quality text-to-speech based on StyleTTS 2.☆78Apr 6, 2026Updated 2 months ago
- Web App to transcribe memos using Whisper AI.☆18Oct 23, 2022Updated 3 years ago
- Arabic Grapheme-to-Phoneme (G2P) Conversion☆16Mar 15, 2025Updated last year
- llama.cpp gguf file parser for javascript☆50Dec 11, 2024Updated last year
- openvino version of openai/whisper☆184Nov 6, 2023Updated 2 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Robust Speech Recognition via Large-Scale Weak Supervision☆93Aug 28, 2023Updated 2 years ago
- Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.☆13Jul 15, 2024Updated last year
- repo of files pertaining to realtime, offline translations using whisper realtime and argos translate. This repo is marked Creative Commo…☆19May 20, 2025Updated last year
- Grapheme-to-phoneme tool for corpus conversion, where phonemes match Phoible inventories☆19Apr 10, 2025Updated last year
- Fast neural codec compression and generation for audio waveforms☆230Dec 4, 2024Updated last year
- SpeechPlus: Small LLM-Based Text-to-Speech Library 🚀☆21May 20, 2025Updated last year
- Sequence to sequence model for Arabic punctuation prediction.☆12Feb 13, 2020Updated 6 years ago