Vosk ASR offline engine API for NodeJs developers. With a simple HTTP ASR server.
☆56Jul 2, 2021Updated 4 years ago
Alternatives and similar repositories for voskJs
Users that are interested in voskJs are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Coqui STT offline engine API for NodeJs developers. With a simple HTTP ASR server.☆30Jun 8, 2021Updated 4 years ago
- voice interface prototyping application☆11Mar 2, 2023Updated 3 years ago
- wake word spotting with kaldi☆19Dec 3, 2020Updated 5 years ago
- Java Bindings for the C++ library DeepSpeech☆10Jun 4, 2020Updated 5 years ago
- Perform the forced decoding with target transcription☆11Sep 12, 2018Updated 7 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- A pipeline to isolate and transcribe one language in mixed-language speech☆20Oct 25, 2022Updated 3 years ago
- A speech recognition library running in the browser thanks to a WebAssembly build of Vosk☆507Dec 7, 2025Updated 4 months ago
- Text-Dependent Speaker Recognition System with Machine Learning Techniques☆10Dec 31, 2017Updated 8 years ago
- python wrap for hts engine☆14Jan 30, 2018Updated 8 years ago
- brainless concatenative text to speech☆14May 11, 2021Updated 4 years ago
- A PyTorch implementation of a punctuation prediction system using (B)LSTM, which automatically adds suitable punctuation into text withou…☆63May 13, 2020Updated 5 years ago
- ☆13Oct 27, 2021Updated 4 years ago
- Free noise reduction of speech signals☆12Jul 26, 2016Updated 9 years ago
- NaifJs, a simple state-machine based dialog manager.☆25May 9, 2021Updated 4 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- A SPMI Lab toolkit for language models.☆11Apr 12, 2017Updated 9 years ago
- A Simple Flask App to interact with your Machine Translation Model☆13Feb 26, 2020Updated 6 years ago
- NMT based punctuation prediction system using lexical and acoustic features .☆14Mar 30, 2020Updated 6 years ago
- Wenet speech to text for react native☆10Nov 1, 2022Updated 3 years ago
- *NIX SHELL with Local AI/LLM integration☆26Feb 26, 2025Updated last year
- Unsupervised speech activity detection system.☆11Jul 2, 2018Updated 7 years ago
- A "Crowd-Built" continuously growing speech dataset with transcripts. The dataset contains multiple languages and is intended for anyone …☆43Aug 3, 2022Updated 3 years ago
- Scripts to create speech corpora from open.bible☆13Jan 3, 2022Updated 4 years ago
- This is an extension of kaldi speech recognition software which allows to perform decoding of speech with hybrid word and phoneme graphs.…☆11Feb 4, 2020Updated 6 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- 将normalize过的中文文本,做逆向normalize。具体功能即实现 chinese_text_normalization的逆向版本。☆13Apr 7, 2021Updated 5 years ago
- A C++ library implementing fast language models estimation using the 1-Sort algorithm.☆16May 18, 2023Updated 2 years ago
- steps to perform text-based speaker diarization with kaldi toolkit☆12Nov 2, 2018Updated 7 years ago
- Text frontend for ESPnet tts recipes☆34Jun 1, 2021Updated 4 years ago
- Kaldi recipe to train commonvoice corpus in Thai language☆49Aug 12, 2022Updated 3 years ago
- Open source cross-platform implementation of MRCP protocol☆20Mar 3, 2022Updated 4 years ago
- RunCloud Let's Encrypt Automation on Free /Paid Plan☆11Jan 24, 2021Updated 5 years ago
- The Kotlin wrapper of llama.cpp, powered by JNA☆14Aug 8, 2023Updated 2 years ago
- Detect and remove or lower the volume of breathing in speech recordings.☆14May 14, 2025Updated 10 months ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- A C++ library for parsing and manipulating JSGF grammar files.☆14Feb 13, 2024Updated 2 years ago
- Speaker Diarization library in Python. Performs VAD, Segmentation, Linear Clustering, Hierarchical Clustering☆15Jul 28, 2017Updated 8 years ago
- ☆14Jun 12, 2015Updated 10 years ago
- ☆16Jun 13, 2022Updated 3 years ago
- Web page for ISCA Special Interest Group: Robust Speech Processing (RoSP)☆11Dec 4, 2023Updated 2 years ago
- ☆10May 7, 2020Updated 5 years ago
- Multi-package repo for Jargon's nodejs SDKs☆13May 9, 2021Updated 4 years ago