biemster / gasrView external linksLinks
Google Chrome SODA Offline Speech Recognition command line client
☆164Jan 28, 2025Updated last year
Alternatives and similar repositories for gasr
Users that are interested in gasr are comparing it to the libraries listed below
Sorting:
- Google Chrome Text to Speech command line client☆34Jul 16, 2021Updated 4 years ago
- This project aims to research google's offline speech recognition, from several android apps and ideally make them interoperable by repli…☆68May 10, 2020Updated 5 years ago
- This is code for an audio search engine that uses vocal imitations of the desired sound☆38May 16, 2023Updated 2 years ago
- Experiment in automatic insertion of timed transcript corrections☆21Oct 31, 2017Updated 8 years ago
- Fully Local Push-to-Transcribe☆16Nov 6, 2025Updated 3 months ago
- Grapheme-to-phoneme tool for corpus conversion, where phonemes match Phoible inventories☆19Apr 10, 2025Updated 10 months ago
- ☆11Aug 11, 2023Updated 2 years ago
- A sample Android app using [whisper.cpp](https://github.com/ggerganov/whisper.cpp/) to do voice-to-text transcriptions.☆64Sep 6, 2023Updated 2 years ago
- ☆13Oct 25, 2024Updated last year
- A corpus of diacritized Hebrew texts (טקסט מנוקד)☆11May 4, 2022Updated 3 years ago
- IPA Phonetic dataset lexicon☆18Jan 12, 2026Updated last month
- ☆23Nov 3, 2025Updated 3 months ago
- Arabic Grapheme-to-Phoneme (G2P) Conversion☆13Mar 15, 2025Updated 11 months ago
- Decoders from Kaldi using OpenFst☆34Jan 29, 2026Updated 2 weeks ago
- Accelerate Whisper tasks such as transcription, by multiprocesing through parallelization☆25Oct 29, 2022Updated 3 years ago
- Set or toggle multiple monitor's input sources via DDC/CI☆13Mar 1, 2019Updated 6 years ago
- ☆14Aug 1, 2025Updated 6 months ago
- This is an extension of kaldi speech recognition software which allows to perform decoding of speech with hybrid word and phoneme graphs.…☆11Feb 4, 2020Updated 6 years ago
- Sequence to sequence model for Arabic punctuation prediction.☆12Feb 13, 2020Updated 6 years ago
- ☆15Nov 10, 2025Updated 3 months ago
- Solfège learning for Android☆11Nov 7, 2020Updated 5 years ago
- [ICCV'21] The Right to Talk: An Audio-Visual Transformer Approach☆20Aug 2, 2021Updated 4 years ago
- Colab notebooks for Next-gen Kaldi☆29Oct 12, 2025Updated 4 months ago
- Reverse Engineered implementation of the Backblaze Personal Backup Downloader client☆11Dec 22, 2024Updated last year
- A tiny cross-platform library to get app icons of other applications.☆15Feb 9, 2026Updated last week
- What if we wired an LLM to an HTTP server and connected it to a database and told it to be an API?☆12Jan 17, 2024Updated 2 years ago
- DiTTo-TTS: Diffusion Transformers for Scalable Text-to-Speech without Domain-Specific Factors☆35Feb 11, 2025Updated last year
- The Codec 2 speech codec, compiled to WASM using Emscripten.☆12Apr 27, 2023Updated 2 years ago
- ☆17Oct 22, 2020Updated 5 years ago
- Kaldi code for doing DNN with tensorflow☆13Feb 8, 2016Updated 10 years ago
- VITS2 using Phoneme-Level Japanese BERT☆14Dec 17, 2023Updated 2 years ago
- DDPM-based Pitch Generation and Pitch Controllable Voice Synthesis.☆54Sep 25, 2023Updated 2 years ago
- High quality text-to-speech based on StyleTTS 2.☆72Updated this week
- ☆27Jan 19, 2021Updated 5 years ago
- Tensorflow-based wake word detection☆17Jan 29, 2026Updated 2 weeks ago
- ☆14Aug 19, 2024Updated last year
- SpeechPlus: Small LLM-Based Text-to-Speech Library 🚀☆20May 20, 2025Updated 8 months ago
- Learnable STRF, from Riad et al. 2021 JASA☆13Aug 21, 2021Updated 4 years ago
- Chrome Extension for Flow Chat Messages on YouTube Live (Mirror).☆17Feb 21, 2023Updated 2 years ago