AASHISHAG / DeepSpeech-APILinks
The code enables users to use Mozilla's Deep Speech model over the Web Browser.
β31Updated 2 years ago
Alternatives and similar repositories for DeepSpeech-API
Users that are interested in DeepSpeech-API are comparing it to the libraries listed below
Sorting:
- πΈSTT integration examplesβ130Updated 2 years ago
- Server & client for DeepSpeech using WebSockets for real-time speech recognition in separate environmentsβ102Updated 5 years ago
- Mimic Recording Studio is a Docker-based application you can install to record voice samples, which can then be trained into a TTS voice β¦β513Updated 2 years ago
- πΈTTS recipes for different datasetsβ86Updated 3 years ago
- Web app for keyword spotting using TensorflowJSβ73Updated 2 years ago
- π€ Nix-TTS: Lightweight and End-to-end Text-to-Speech via Module-wise Distillationβ255Updated last year
- A testing server for a speech to text service based on coqui.aiβ216Updated 3 years ago
- A tokenizer, text cleaner, and phonemizer for many human languages.β321Updated 8 months ago
- Wake word detection modeling toolkit for Firefox Voice, supporting open datasets like Speech Commands and Common Voice.β210Updated last year
- Buildings block for voice-enabled applications in the browserβ37Updated 3 months ago
- Automatic Speech Recognition (ASR) - Germanβ315Updated 2 years ago
- Grapheme to phoneme conversion with deep learning.β393Updated last year
- An implementation of Tacotron 2 that supports multilingual experiments with parameter-sharing, code-switching, and voice cloning.β838Updated last year
- DeepSpeech based forced alignment toolβ238Updated 4 years ago
- A live speech recognition using Facebooks wav2vec 2.0 model.β363Updated last year
- β22Updated 2 years ago
- Scripts for training general-purpose large vocabulary German acoustic models for ASR with Kaldi.β173Updated 2 years ago
- voice services stack from audio hardware through hotword, ASR, NLU, AI routing and TTS bound by messaging protocol over MQTTβ94Updated 2 years ago
- A model that predicts the punctuation of English, Italian, French and German texts.β80Updated 2 years ago
- A small Javascript library for browser-based real-time speech recognition, which uses Recorderjs for audio capture, and a WebSocket conneβ¦β217Updated 5 years ago
- Deep Learning - one shot learning for speaker recognition using Filter Banksβ170Updated last year
- On-device voice activity detection (VAD) powered by deep learningβ223Updated last week
- Gecko - A Tool for Effective Annotation of Human Conversationsβ293Updated 2 years ago
- Open tools and data for cloudless automatic speech recognitionβ446Updated 4 years ago
- openvino version of openai/whisperβ170Updated last year
- β37Updated last year
- Simplified diarization pipeline using some pretrained models - audio file to diarized segments in a few lines of codeβ149Updated last year
- SEPIA server to support open-source speech recognition via WebSocket connection.β128Updated 9 months ago
- Text to Speech engine based on the Tacotron architecture, initially implemented by Keith Ito.β584Updated 4 years ago
- A minimalist hotword / wake word for the web, based on Porcupineβ60Updated 3 weeks ago