AASHISHAG / DeepSpeech-APILinks
The code enables users to use Mozilla's Deep Speech model over the Web Browser.
β31Updated 2 years ago
Alternatives and similar repositories for DeepSpeech-API
Users that are interested in DeepSpeech-API are comparing it to the libraries listed below
Sorting:
- Server & client for DeepSpeech using WebSockets for real-time speech recognition in separate environmentsβ102Updated 5 years ago
- πΈSTT integration examplesβ129Updated 2 years ago
- Web app for keyword spotting using TensorflowJSβ72Updated 2 years ago
- Automatic Speech Recognition (ASR) - Germanβ314Updated 2 years ago
- π€ Nix-TTS: Lightweight and End-to-end Text-to-Speech via Module-wise Distillationβ254Updated last year
- A tokenizer, text cleaner, and phonemizer for many human languages.β320Updated 8 months ago
- πΈTTS recipes for different datasetsβ86Updated 2 years ago
- Scripts for training general-purpose large vocabulary German acoustic models for ASR with Kaldi.β173Updated last year
- Thorsten-Voice: A free to use, offline working, high quality german TTS voice should be available for every project without any license sβ¦β627Updated 6 months ago
- Mimic Recording Studio is a Docker-based application you can install to record voice samples, which can then be trained into a TTS voice β¦β511Updated 2 years ago
- Wake word detection modeling toolkit for Firefox Voice, supporting open datasets like Speech Commands and Common Voice.β210Updated 11 months ago
- Performant and accurate speech recognition built on Pytorchβ253Updated 3 years ago
- An implementation of Tacotron 2 that supports multilingual experiments with parameter-sharing, code-switching, and voice cloning.β838Updated last year
- Gecko - A Tool for Effective Annotation of Human Conversationsβ292Updated 2 years ago
- A live speech recognition using Facebooks wav2vec 2.0 model.β361Updated last year
- Web Browser Audio Detection/Speech Recording Events APIβ75Updated 3 years ago
- β© Generating speech in a single forward pass without any attention!β579Updated 11 months ago
- Buildings block for voice-enabled applications in the browserβ37Updated 3 months ago
- Flowtron is an auto-regressive flow-based generative network for text to speech synthesis with control over speech variation and style trβ¦β898Updated 2 years ago
- Desktop application for neural speech synthesis written in C++β215Updated 2 years ago
- Grapheme to phoneme conversion with deep learning.β389Updated last year
- A large-scale multilingual speech corpus for representation learning, semi-supervised learning and interpretationβ544Updated 2 years ago
- A testing server for a speech to text service based on coqui.aiβ215Updated 3 years ago
- Voice models for Mimic 3 text to speech systemβ150Updated last year
- A tool for automatic phoneme transcriptionβ157Updated 2 years ago
- π A list of accessible speech corpora for ASR, TTS, and other Speech Technologiesβ1,343Updated last year
- Tacotron 2 - PyTorch implementation with faster-than-realtime inference modified to enable cross lingual voice cloning.β362Updated 2 years ago
- DeepSpeech based forced alignment toolβ238Updated 4 years ago
- SEPIA server to support open-source speech recognition via WebSocket connection.β128Updated 8 months ago
- On-device voice activity detection (VAD) powered by deep learningβ220Updated last week