biemster / gasrLinks
Google Chrome SODA Offline Speech Recognition command line client
☆160Updated 9 months ago
Alternatives and similar repositories for gasr
Users that are interested in gasr are comparing it to the libraries listed below
Sorting:
- Google Chrome Text to Speech command line client☆34Updated 4 years ago
- This project aims to research google's offline speech recognition, from several android apps and ideally make them interoperable by repli…☆68Updated 5 years ago
- Android offline speech recognition natively on PC☆52Updated 4 years ago
- C++ library for converting text to phonemes for Piper☆134Updated 3 months ago
- Synchronize Whisper's timestamps over an existing accurate transcription☆159Updated last year
- On-device voice activity detection (VAD) powered by deep learning☆232Updated last month
- Experiments to test different speech recognition systems for SEPIA Framework☆63Updated 2 years ago
- Accelerating faster-whisper single file processing by multiprocessing through parallelization☆55Updated 2 years ago
- openvino version of openai/whisper☆176Updated last year
- An even smaller speech recognizer / force aligner☆36Updated 10 months ago
- Experimental code: sound file preprocessing to optimize Whisper transcriptions without hallucinated texts☆346Updated 11 months ago
- Robust Speech Recognition via Large-Scale Weak Supervision☆86Updated 2 years ago
- web based editor for subtitles and transcripts☆141Updated last year
- Voice activity detection (VAD) library, based on WebRTC's VAD engine built to WASM with Emscripten to run in browsers, Node, and NativeSc…☆31Updated last year
- Real-Time Whisper Voice Recognition with vosk model feedback.☆119Updated 2 years ago
- A sample Android app using [whisper.cpp](https://github.com/ggerganov/whisper.cpp/) to do voice-to-text transcriptions.☆64Updated 2 years ago
- On-device noise suppression powered by deep learning☆76Updated 2 months ago
- SEPIA server to support open-source speech recognition via WebSocket connection.☆132Updated 11 months ago
- Timething is a library for aligning text transcripts with their audio recordings.☆124Updated 11 months ago
- Model for recasing and repunctuating ASR transcripts☆141Updated last year
- ONNX Inference of Pyannote Segmentation☆95Updated 10 months ago
- Voice models for Mimic 3 text to speech system☆154Updated last year
- ☆260Updated 2 years ago
- 🔮 Word by word audio subtitle synchronisation tool and API. Developed under GSoC 2017 with CCExtractor.☆171Updated 6 years ago
- Read, write, convert and segment WebVTT caption files in Python.☆222Updated last year
- A bidirectional recurrent neural network model with attention mechanism for restoring missing punctuation in unsegmented text☆34Updated 5 years ago
- Port of Meta's Encodec in C/C++☆223Updated 10 months ago
- Zero-shot multimodal punctuation insertion and truecasing using Whisper☆119Updated 2 years ago
- Package for aligning audio files through audio fingerprinting☆130Updated 7 months ago
- 📈 A forced aligner intended for synchronization of narrated text☆100Updated 2 months ago