biemster / gasr
Google Chrome SODA Offline Speech Recognition command line client
☆154Updated 3 weeks ago
Alternatives and similar repositories for gasr:
Users that are interested in gasr are comparing it to the libraries listed below
- Google Chrome Text to Speech command line client☆32Updated 3 years ago
- This project aims to research google's offline speech recognition, from several android apps and ideally make them interoperable by repli…☆64Updated 4 years ago
- Android offline speech recognition natively on PC☆50Updated 4 years ago
- Experiments to test different speech recognition systems for SEPIA Framework☆58Updated last year
- Experimental code: sound file preprocessing to optimize Whisper transcriptions without hallucinated texts☆306Updated 3 months ago
- ez audio transcription tool with flexible processing and post-processing options☆144Updated last year
- openvino version of openai/whisper☆165Updated last year
- On-device voice activity detection (VAD) powered by deep learning☆198Updated this week
- Robust Speech Recognition via Large-Scale Weak Supervision☆74Updated last year
- Synchronize Whisper's timestamps over an existing accurate transcription☆141Updated 8 months ago
- ☆245Updated last year
- rnnoise noise suppression library as a WASM module☆132Updated 2 weeks ago
- A sample Android app using [whisper.cpp](https://github.com/ggerganov/whisper.cpp/) to do voice-to-text transcriptions.☆65Updated last year
- Voice activity detection (VAD) library, based on WebRTC's VAD engine☆515Updated 10 months ago
- Model for recasing and repunctuating ASR transcripts☆133Updated 10 months ago
- DeepSpeech based forced alignment tool☆237Updated 4 years ago
- Automatically synchronize and translate subtitles, or create new ones by transcribing, using pre-trained DNNs, Forced Alignments and Tran…☆460Updated this week
- Wasm Port of Recurrent neural network for audio noise reduction. Based on xiph/rnnoise C++ project☆41Updated 4 years ago
- 🔮 Word by word audio subtitle synchronisation tool and API. Developed under GSoC 2017 with CCExtractor.☆170Updated 5 years ago
- Whisper combined with Silero VAD, for improved long-form transcriptions☆46Updated 2 years ago
- ONNX Inference of Pyannote Segmentation☆80Updated last month
- SEPIA server to support open-source speech recognition via WebSocket connection.☆123Updated 3 months ago
- Zero-shot multimodal punctuation insertion and truecasing using Whisper☆108Updated 2 years ago
- Voice models for Mimic 3 text to speech system☆139Updated 8 months ago
- A speech recognition library running in the browser thanks to a WebAssembly build of Vosk☆407Updated last year
- C++ library for converting text to phonemes for Piper☆106Updated 11 months ago
- A simple Python package to easily use Meta's Massively Multilingual Speech (MMS) project☆52Updated last year
- 🐸STT integration examples☆125Updated 2 years ago
- An even smaller speech recognizer / force aligner☆32Updated 2 months ago
- 📈 A forced aligner intended for synchronization of narrated text☆89Updated 2 years ago