Google Chrome SODA Offline Speech Recognition command line client
☆164Jan 28, 2025Updated last year
Alternatives and similar repositories for gasr
Users that are interested in gasr are comparing it to the libraries listed below
Sorting:
- This is code for an audio search engine that uses vocal imitations of the desired sound☆38May 16, 2023Updated 2 years ago
- Experiment in automatic insertion of timed transcript corrections☆21Oct 31, 2017Updated 8 years ago
- Grapheme-to-phoneme tool for corpus conversion, where phonemes match Phoible inventories☆19Apr 10, 2025Updated 10 months ago
- Fully Local Push-to-Transcribe☆18Nov 6, 2025Updated 4 months ago
- ☆11Aug 11, 2023Updated 2 years ago
- A sample Android app using [whisper.cpp](https://github.com/ggerganov/whisper.cpp/) to do voice-to-text transcriptions.☆64Sep 6, 2023Updated 2 years ago
- ☆26Nov 3, 2025Updated 4 months ago
- IPA Phonetic dataset lexicon☆18Updated this week
- ☆13Oct 25, 2024Updated last year
- Arabic Grapheme-to-Phoneme (G2P) Conversion☆13Mar 15, 2025Updated 11 months ago
- A corpus of diacritized Hebrew texts (טקסט מנוקד)☆11May 4, 2022Updated 3 years ago
- Decoders from Kaldi using OpenFst☆34Jan 29, 2026Updated last month
- Accelerate Whisper tasks such as transcription, by multiprocesing through parallelization☆25Oct 29, 2022Updated 3 years ago
- This is an extension of kaldi speech recognition software which allows to perform decoding of speech with hybrid word and phoneme graphs.…☆11Feb 4, 2020Updated 6 years ago
- ☆15Nov 10, 2025Updated 3 months ago
- Set or toggle multiple monitor's input sources via DDC/CI☆13Mar 1, 2019Updated 7 years ago
- The Codec 2 speech codec, compiled to WASM using Emscripten.☆13Apr 27, 2023Updated 2 years ago
- PE/MZ Header Parser :: A crossplatform Windows PE/MS-DOS MZ Header Parser : Powered by @pay1oad-repo☆11Jul 4, 2025Updated 8 months ago
- Sequence to sequence model for Arabic punctuation prediction.☆12Feb 13, 2020Updated 6 years ago
- ☆14Aug 1, 2025Updated 7 months ago
- Solfège learning for Android☆11Nov 7, 2020Updated 5 years ago
- [ICCV'21] The Right to Talk: An Audio-Visual Transformer Approach☆20Aug 2, 2021Updated 4 years ago
- ☆17Oct 22, 2020Updated 5 years ago
- Colab notebooks for Next-gen Kaldi☆31Oct 12, 2025Updated 4 months ago
- A tiny cross-platform library to get app icons of other applications.☆15Feb 16, 2026Updated 3 weeks ago
- Reverse Engineered implementation of the Backblaze Personal Backup Downloader client☆11Dec 22, 2024Updated last year
- Kaldi code for doing DNN with tensorflow☆13Feb 8, 2016Updated 10 years ago
- What if we wired an LLM to an HTTP server and connected it to a database and told it to be an API?☆12Jan 17, 2024Updated 2 years ago
- DiTTo-TTS: Diffusion Transformers for Scalable Text-to-Speech without Domain-Specific Factors☆37Feb 11, 2025Updated last year
- VITS2 using Phoneme-Level Japanese BERT☆14Dec 17, 2023Updated 2 years ago
- DDPM-based Pitch Generation and Pitch Controllable Voice Synthesis.☆54Sep 25, 2023Updated 2 years ago
- High quality text-to-speech based on StyleTTS 2.☆73Feb 25, 2026Updated last week
- ☆27Jan 19, 2021Updated 5 years ago
- Learnable STRF, from Riad et al. 2021 JASA☆13Aug 21, 2021Updated 4 years ago
- Tensorflow-based wake word detection☆17Jan 29, 2026Updated last month
- Chrome Extension for Flow Chat Messages on YouTube Live (Mirror).☆17Feb 21, 2023Updated 3 years ago
- Character-level conversion between Hebrew text and Latin transliteration using deep learning - a demonstration of seq2seq training.☆14Jun 27, 2023Updated 2 years ago
- A hardware monitor built on top of Libre Hardware Monitor☆14Aug 17, 2023Updated 2 years ago
- Fork of dump1090-stream-parser. Takes SBS output from `dump1090` and puts it into a database.☆13Apr 16, 2019Updated 6 years ago