msqr1 / Vosklet
A speech recognizer that can run on the browser, inspired by vosk-browser
☆33Updated last month
Related projects ⓘ
Alternatives and complementary repositories for Vosklet
- A speech recognition library running in the browser thanks to a WebAssembly build of Vosk☆382Updated 10 months ago
- An even smaller speech recognizer / force aligner☆32Updated last week
- ONNX Inference of Pyannote Segmentation☆66Updated 2 months ago
- ☆29Updated 7 months ago
- C++ version of pyannote audio speaker diarizaiton pipeline☆18Updated 9 months ago
- On-device noise suppression powered by deep learning☆63Updated last month
- Kaldi API for Android, Python and Node. Forked from vosk-api with minimal modifications.☆16Updated 4 years ago
- ez audio transcription tool with flexible processing and post-processing options☆130Updated 9 months ago
- Real-Time Whisper Voice Recognition with vosk model feedback.☆105Updated last year
- Buildings block for voice-enabled applications in the browser☆33Updated last week
- A library for real-time voice processing in web browsers☆200Updated last month
- 🌼 Daisy-TTS: Simulating Wider Spectrum of Emotions via Prosody Embedding Decomposition☆15Updated 8 months ago
- Google Chrome Text to Speech command line client☆30Updated 3 years ago
- On-device speaker diarization powered by deep learning☆25Updated this week
- Timething is a library for aligning text transcripts with their audio recordings.☆103Updated last year
- ☆16Updated 3 years ago
- An experiment of trying out whisper.cpp for real-time speech-to-text☆20Updated last year
- This UI serves as a Synthetic ASR Dataset Generator powered by/for OpenAI Whisper, enabling users to capture audio, transcribing it, on t…☆24Updated 2 weeks ago
- Create modular, cross-browser, web audio pipelines to record and process audio in background threads. Comes with modules for VAD, ASR, re…☆44Updated last year
- Vosk ASR offline engine API for NodeJs developers. With a simple HTTP ASR server.☆44Updated 3 years ago
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelines☆84Updated 6 months ago
- This repo is an exploratory experiment to enable frozen pretrained RWKV language models to accept speech modality input. We followed the …☆32Updated last week
- Whisper fine-tuning event script to use multiple hf datasets☆32Updated last year
- ☆98Updated 4 months ago
- On-device voice activity detection (VAD) powered by deep learning☆179Updated this week
- A converter from Arpabet to IPA (see https://en.wikipedia.org/wiki/Arpabet)☆16Updated 6 years ago
- A minimalistic automatic speech recognition streamlit based webapp powered by OpenAI's Whisper "State of the Art" models☆65Updated 2 years ago
- Application for viewing Rich Transcription Time Marked (RTTM) files in an interactive way☆39Updated last year
- Model for recasing and repunctuating ASR transcripts☆129Updated 7 months ago
- Colab notebooks for Next-gen Kaldi☆26Updated 3 weeks ago