AASHISHAG / DeepSpeech-API
The code enables users to use Mozilla's Deep Speech model over the Web Browser.
β33Updated last year
Related projects: β
- Server & client for DeepSpeech using WebSockets for real-time speech recognition in separate environmentsβ101Updated 4 years ago
- πΈTTS recipes for different datasetsβ84Updated 2 years ago
- Scripts for training general-purpose large vocabulary German acoustic models for ASR with Kaldi.β172Updated last year
- DeepSpeech based forced alignment toolβ232Updated 3 years ago
- A tokenizer, text cleaner, and phonemizer for many human languages.β272Updated 2 months ago
- πΈSTT integration examplesβ118Updated last year
- How to create your own model for voskβ63Updated 3 years ago
- Automatic Speech Recognition (ASR) - Germanβ18Updated 4 years ago
- Pytorch implementation of Deepmind's WaveRNN modelβ120Updated 5 years ago
- Web app for keyword spotting using TensorflowJSβ69Updated last year
- SEPIA server to support open-source speech recognition via WebSocket connection.β120Updated last year
- Wake word detection modeling toolkit for Firefox Voice, supporting open datasets like Speech Commands and Common Voice.β194Updated last month
- Simplified diarization pipeline using some pretrained models - audio file to diarized segments in a few lines of codeβ132Updated 4 months ago
- Multi-Language Dataset Cleaner/Creator for Mozilla's DeepSpeech Frameworkβ47Updated last year
- [deprecated] Pretrained models for pyannote-audio 1.xβ70Updated 2 years ago
- A tool for automatic phoneme transcriptionβ155Updated last year
- A large-scale multilingual speech corpus for representation learning, semi-supervised learning and interpretationβ508Updated last year
- Open Source AI Benchmarking toolkit for benchmarking speech to text servicesβ54Updated 5 months ago
- π€ Nix-TTS: Lightweight and End-to-end Text-to-Speech via Module-wise Distillationβ234Updated last year
- Scripts for training Mozilla's DeepSpeech using german speech dataβ41Updated 4 years ago
- A testing server for a speech to text service based on coqui.aiβ214Updated 2 years ago
- Automatically constructing corpus for automatic speech recognition from YouTube videosβ151Updated 4 years ago
- Model for recasing and repunctuating ASR transcriptsβ126Updated 5 months ago
- VCTK multi-speaker tacotron for ICASSP 2020β265Updated 2 years ago
- An automatic speech recognition APIβ40Updated 2 weeks ago
- Gecko - A Tool for Effective Annotation of Human Conversationsβ274Updated last year
- π Coqui's machine learning job schedulerβ31Updated 3 years ago
- Grapheme to phoneme conversion with deep learning.β349Updated 9 months ago
- A model that predicts the punctuation of English, Italian, French and German texts.β70Updated last year
- Deep Learning - one shot learning for speaker recognition using Filter Banksβ152Updated 2 months ago