biemster / gasr
Google Chrome SODA Offline Speech Recognition command line client
☆157Updated 2 months ago
Alternatives and similar repositories for gasr:
Users that are interested in gasr are comparing it to the libraries listed below
- Google Chrome Text to Speech command line client☆33Updated 3 years ago
- This project aims to research google's offline speech recognition, from several android apps and ideally make them interoperable by repli…☆65Updated 4 years ago
- Android offline speech recognition natively on PC☆52Updated 4 years ago
- Experiments to test different speech recognition systems for SEPIA Framework☆59Updated last year
- ez audio transcription tool with flexible processing and post-processing options☆147Updated last year
- An even smaller speech recognizer / force aligner☆32Updated 3 months ago
- Experimental code: sound file preprocessing to optimize Whisper transcriptions without hallucinated texts☆316Updated 4 months ago
- openvino version of openai/whisper☆166Updated last year
- On-device voice activity detection (VAD) powered by deep learning☆203Updated this week
- Real-Time Whisper Voice Recognition with vosk model feedback.☆112Updated last year
- Robust Speech Recognition via Large-Scale Weak Supervision☆77Updated last year
- web based editor for subtitles and transcripts☆127Updated 7 months ago
- Self hosted high quality voice recognition for de-googled Android using whisper. Like Siri or OK Google.☆62Updated last year
- An echo cancellation library for browsers using DTLN-aec☆26Updated last year
- ☆247Updated 2 years ago
- Zero-shot multimodal punctuation insertion and truecasing using Whisper☆111Updated 2 years ago
- Voice activity detection (VAD) library, based on WebRTC's VAD engine built to WASM with Emscripten to run in browsers, Node, and NativeSc…☆29Updated 8 months ago
- Model for recasing and repunctuating ASR transcripts☆133Updated 11 months ago
- Synchronize Whisper's timestamps over an existing accurate transcription☆142Updated 10 months ago
- Minimal extension of OpenAI's Whisper adding speaker diarization with special tokens☆482Updated last year
- On-device speech-to-text engine powered by deep learning☆451Updated this week
- Improving transcription performance of OpenAI Whisper for CPU based deployment☆239Updated 2 years ago
- Accelerating faster-whisper single file processing by multiprocessing through parallelization☆53Updated last year
- An unofficial PyTorch implementation of the StreamVC(Real-Time Low-Latency Voice Conversion)☆119Updated 8 months ago
- ONNX Inference of Pyannote Segmentation☆81Updated 3 months ago
- Open models for Coqui STT☆135Updated last year
- Whisper combined with Silero VAD, for improved long-form transcriptions☆47Updated 2 years ago
- On-device noise suppression powered by deep learning☆69Updated 2 weeks ago
- Offline voice input panel & keyboard with punctuation for Android.☆102Updated 10 months ago
- C++ version of pyannote audio speaker diarizaiton pipeline☆20Updated last year