biemster / gasrLinks
Google Chrome SODA Offline Speech Recognition command line client
☆159Updated 7 months ago
Alternatives and similar repositories for gasr
Users that are interested in gasr are comparing it to the libraries listed below
Sorting:
- Google Chrome Text to Speech command line client☆34Updated 4 years ago
- Android offline speech recognition natively on PC☆52Updated 4 years ago
- This project aims to research google's offline speech recognition, from several android apps and ideally make them interoperable by repli…☆68Updated 5 years ago
- Synchronize Whisper's timestamps over an existing accurate transcription☆156Updated last year
- openvino version of openai/whisper☆174Updated last year
- On-device voice activity detection (VAD) powered by deep learning☆228Updated last month
- C++ library for converting text to phonemes for Piper☆134Updated 2 months ago
- Experiments to test different speech recognition systems for SEPIA Framework☆60Updated 2 years ago
- Experimental code: sound file preprocessing to optimize Whisper transcriptions without hallucinated texts☆342Updated 10 months ago
- Real-Time Whisper Voice Recognition with vosk model feedback.☆119Updated 2 years ago
- An even smaller speech recognizer / force aligner☆36Updated 9 months ago
- 🔮 Word by word audio subtitle synchronisation tool and API. Developed under GSoC 2017 with CCExtractor.☆172Updated 5 years ago
- A sample Android app using [whisper.cpp](https://github.com/ggerganov/whisper.cpp/) to do voice-to-text transcriptions.☆64Updated 2 years ago
- Robust Speech Recognition via Large-Scale Weak Supervision☆84Updated 2 years ago
- Minimal extension of OpenAI's Whisper adding speaker diarization with special tokens☆516Updated last year
- Port of Meta's Encodec in C/C++☆226Updated 9 months ago
- ONNX Inference of Pyannote Segmentation☆93Updated 8 months ago
- Open models for Coqui STT☆142Updated 2 years ago
- Improving transcription performance of OpenAI Whisper for CPU based deployment☆249Updated 2 years ago
- Read, write, convert and segment WebVTT caption files in Python.☆221Updated last year
- Timething is a library for aligning text transcripts with their audio recordings.☆123Updated 9 months ago
- On-device noise suppression powered by deep learning☆74Updated last month
- An experiment of trying out whisper.cpp for real-time speech-to-text☆20Updated 2 years ago
- ☆40Updated last year
- ✨ Split text by languages (e.g. 你喜欢看アニメ吗 -> 你喜欢看 | アニメ | 吗) for NLP tasks (e.g. parse, TTS). Powered by fasttext and budoux☆64Updated this week
- Model for recasing and repunctuating ASR transcripts☆138Updated last year
- ONNX-compatible Fast SeamlessM4T—Massively Multilingual & Multimodal Machine Translation☆43Updated 2 years ago
- C++17 port of Open-Unmix-PyTorch with streaming LSTM inference, ggml, quantization, and Eigen☆47Updated 6 months ago
- SEPIA server to support open-source speech recognition via WebSocket connection.☆131Updated 10 months ago
- 📈 A forced aligner intended for synchronization of narrated text☆98Updated last month