AASHISHAG / DeepSpeech-APILinks
The code enables users to use Mozilla's Deep Speech model over the Web Browser.
β32Updated 3 years ago
Alternatives and similar repositories for DeepSpeech-API
Users that are interested in DeepSpeech-API are comparing it to the libraries listed below
Sorting:
- Server & client for DeepSpeech using WebSockets for real-time speech recognition in separate environmentsβ103Updated 5 years ago
- πΈSTT integration examplesβ130Updated 3 years ago
- Mimic Recording Studio is a Docker-based application you can install to record voice samples, which can then be trained into a TTS voice β¦β509Updated 2 years ago
- Web app for keyword spotting using TensorflowJSβ74Updated 3 years ago
- A testing server for a speech to text service based on coqui.aiβ219Updated 3 years ago
- A tokenizer, text cleaner, and phonemizer for many human languages.β332Updated last year
- A library for real-time voice processing in web browsersβ238Updated 2 weeks ago
- SEPIA server to support open-source speech recognition via WebSocket connection.β135Updated last year
- Wake word detection modeling toolkit for Firefox Voice, supporting open datasets like Speech Commands and Common Voice.β215Updated last year
- πΈTTS recipes for different datasetsβ86Updated 3 years ago
- Thorsten-Voice: A free to use, offline working, high quality german TTS voice should be available for every project without any license sβ¦β694Updated this week
- π€ Nix-TTS: Lightweight and End-to-end Text-to-Speech via Module-wise Distillationβ260Updated 2 months ago
- Automatic Speech Recognition (ASR) - Germanβ319Updated 2 years ago
- Create modular, cross-browser, web audio pipelines to record and process audio in background threads. Comes with modules for VAD, ASR, reβ¦β46Updated 2 years ago
- Text to Speech engine based on the Tacotron architecture, initially implemented by Keith Ito.β586Updated 4 years ago
- Performant and accurate speech recognition built on Pytorchβ254Updated 3 years ago
- DeepSpeech based forced alignment toolβ239Updated 5 years ago
- voice services stack from audio hardware through hotword, ASR, NLU, AI routing and TTS bound by messaging protocol over MQTTβ93Updated 2 years ago
- An implementation of Tacotron 2 that supports multilingual experiments with parameter-sharing, code-switching, and voice cloning.β844Updated 2 years ago
- β© Generating speech in a single forward pass without any attention!β581Updated 2 weeks ago
- Personal wake word detectorβ69Updated 2 years ago
- Flowtron is an auto-regressive flow-based generative network for text to speech synthesis with control over speech variation and style trβ¦β900Updated 2 years ago
- openvino version of openai/whisperβ182Updated 2 years ago
- A live speech recognition using Facebooks wav2vec 2.0 model.β376Updated 2 years ago
- A small Javascript library for browser-based real-time speech recognition, which uses Recorderjs for audio capture, and a WebSocket conneβ¦β217Updated 5 years ago
- On-device speech-to-text engine powered by deep learningβ471Updated this week
- A large-scale multilingual speech corpus for representation learning, semi-supervised learning and interpretationβ569Updated 2 years ago
- On-device voice activity detection (VAD) powered by deep learningβ242Updated 2 weeks ago
- Working online speech recognition based on RNN Transducer. ( Trained model release available in release )β293Updated 4 years ago
- Voice models for Mimic 3 text to speech systemβ162Updated last year