AASHISHAG / DeepSpeech-API
The code enables users to use Mozilla's Deep Speech model over the Web Browser.
β32Updated 2 years ago
Alternatives and similar repositories for DeepSpeech-API:
Users that are interested in DeepSpeech-API are comparing it to the libraries listed below
- Server & client for DeepSpeech using WebSockets for real-time speech recognition in separate environmentsβ102Updated 4 years ago
- πΈSTT integration examplesβ124Updated 2 years ago
- Web app for keyword spotting using TensorflowJSβ69Updated 2 years ago
- Gecko - A Tool for Effective Annotation of Human Conversationsβ280Updated last year
- DeepSpeech based forced alignment toolβ237Updated 4 years ago
- Wake word detection modeling toolkit for Firefox Voice, supporting open datasets like Speech Commands and Common Voice.β202Updated 6 months ago
- π Coqui's machine learning job schedulerβ32Updated 3 years ago
- πΈTTS recipes for different datasetsβ85Updated 2 years ago
- A model that predicts the punctuation of English, Italian, French and German texts.β77Updated last year
- Scripts to simplify data prepping for Mozilla DeepSpeech.β14Updated 5 years ago
- Tool for creation, manipulation and maintenance of voice corporaβ81Updated 9 months ago
- SEPIA server to support open-source speech recognition via WebSocket connection.β123Updated 3 months ago
- Command line tool to create corpora for Common Voiceβ75Updated 8 months ago
- Model for recasing and repunctuating ASR transcriptsβ133Updated 10 months ago
- How to create your own model for voskβ69Updated 3 years ago
- Scripts for training general-purpose large vocabulary German acoustic models for ASR with Kaldi.β173Updated last year
- A python package for deep multilingual punctuation prediction.β116Updated 5 months ago
- Automatically constructing corpus for automatic speech recognition from YouTube videosβ153Updated 5 years ago
- voice services stack from audio hardware through hotword, ASR, NLU, AI routing and TTS bound by messaging protocol over MQTTβ93Updated last year
- Simplified diarization pipeline using some pretrained models - audio file to diarized segments in a few lines of codeβ145Updated 9 months ago
- Coqui STT Model Manager - install, manage and try out Coqui STT models from the Model Zooβ25Updated last year
- Open Source AI Benchmarking toolkit for benchmarking speech to text servicesβ55Updated 9 months ago
- Interface for using TTS and vocoder models in the form of a text editorβ19Updated 2 years ago
- Tools to create your own voice dataset for TTS trainingβ66Updated 4 years ago
- A small Javascript library for browser-based real-time speech recognition, which uses Recorderjs for audio capture, and a WebSocket conneβ¦β216Updated 4 years ago
- Desktop application for neural speech synthesis written in C++β213Updated last year
- A data annotation pipeline to generate high-quality, large-scale speech datasets with machine pre-labeling and fully manual auditing.β101Updated last year
- Buildings block for voice-enabled applications in the browserβ34Updated last week
- Finetune VITS and MMS using HuggingFace's toolsβ132Updated 10 months ago
- On-device voice activity detection (VAD) powered by deep learningβ197Updated this week