biemster / gasrLinks
Google Chrome SODA Offline Speech Recognition command line client
☆158Updated 5 months ago
Alternatives and similar repositories for gasr
Users that are interested in gasr are comparing it to the libraries listed below
Sorting:
- Google Chrome Text to Speech command line client☆34Updated 3 years ago
- This project aims to research google's offline speech recognition, from several android apps and ideally make them interoperable by repli…☆67Updated 5 years ago
- Android offline speech recognition natively on PC☆52Updated 4 years ago
- ez audio transcription tool with flexible processing and post-processing options☆152Updated last year
- openvino version of openai/whisper☆167Updated last year
- Experiments to test different speech recognition systems for SEPIA Framework☆60Updated 2 years ago
- ONNX Inference of Pyannote Segmentation☆91Updated 6 months ago
- Voice activity detection (VAD) library, based on WebRTC's VAD engine built to WASM with Emscripten to run in browsers, Node, and NativeSc…☆30Updated 11 months ago
- Synchronize Whisper's timestamps over an existing accurate transcription☆152Updated last year
- Robust Speech Recognition via Large-Scale Weak Supervision☆82Updated last year
- Real-Time Whisper Voice Recognition with vosk model feedback.☆115Updated last year
- An echo cancellation library for browsers using DTLN-aec☆26Updated last year
- On-device voice activity detection (VAD) powered by deep learning☆219Updated this week
- Wasm Port of Recurrent neural network for audio noise reduction. Based on xiph/rnnoise C++ project☆42Updated 4 years ago
- Experimental code: sound file preprocessing to optimize Whisper transcriptions without hallucinated texts☆329Updated 7 months ago
- ☆256Updated 2 years ago
- Read, write, convert and segment WebVTT caption files in Python.☆215Updated last year
- Whisper combined with Silero VAD, for improved long-form transcriptions☆52Updated 2 years ago
- Accelerating faster-whisper single file processing by multiprocessing through parallelization☆54Updated 2 years ago
- ☆38Updated last year
- An even smaller speech recognizer / force aligner☆33Updated 6 months ago
- Zero-shot multimodal punctuation insertion and truecasing using Whisper☆115Updated 2 years ago
- web based editor for subtitles and transcripts☆135Updated 10 months ago
- ☆53Updated 2 years ago
- C++ version of pyannote audio speaker diarizaiton pipeline☆21Updated last year
- Coqui Inference Engine☆40Updated 3 years ago
- ☆13Updated 2 months ago
- An experiment of trying out whisper.cpp for real-time speech-to-text☆20Updated 2 years ago
- A converter from Arpabet to IPA (see https://en.wikipedia.org/wiki/Arpabet)☆16Updated 7 years ago
- 📈 A forced aligner intended for synchronization of narrated text☆93Updated 2 years ago