AASHISHAG / deepspeech-german
Automatic Speech Recognition (ASR) - German
☆313Updated last year
Related projects ⓘ
Alternatives and complementary repositories for deepspeech-german
- Scripts for training general-purpose large vocabulary German acoustic models for ASR with Kaldi.☆173Updated last year
- Automatic Speech Recognition (ASR) - German☆18Updated 4 years ago
- Scripts for training Mozilla's DeepSpeech using german speech data☆41Updated 4 years ago
- Open tools and data for cloudless automatic speech recognition☆443Updated 3 years ago
- ☆38Updated 2 months ago
- Python library for handling audio datasets.☆131Updated last year
- Tooling for producing French dataset for Common Voice☆100Updated last year
- Multi-Language Dataset Cleaner/Creator for Mozilla's DeepSpeech Framework☆47Updated last year
- Examples of how to use or integrate DeepSpeech☆821Updated last year
- Tooling for producing Italian model (public release available) for DeepSpeech and text corpus☆93Updated 2 years ago
- 🐸STT integration examples☆121Updated 2 years ago
- Text to Speech engine based on the Tacotron architecture, initially implemented by Keith Ito.☆580Updated 3 years ago
- A testing server for a speech to text service based on coqui.ai☆215Updated 2 years ago
- Voice Activity Detection based on Deep Learning & TensorFlow☆355Updated last year
- Crawling and creating a German language model resource☆19Updated 2 years ago
- Wake word detection modeling toolkit for Firefox Voice, supporting open datasets like Speech Commands and Common Voice.☆200Updated 3 months ago
- Open Source AI Benchmarking toolkit for benchmarking speech to text services☆54Updated 7 months ago
- A large-scale multilingual speech corpus for representation learning, semi-supervised learning and interpretation☆512Updated last year
- Voice Activity Detection (VAD) using deep learning.☆192Updated 5 years ago
- Command line tool to create corpora for Common Voice☆75Updated 5 months ago
- VOSK Speech Recognition Toolkit☆383Updated 2 years ago
- Simple Kaldi model server for chain (nnet3) models in online recognition mode directly from a local microphone☆37Updated 2 years ago
- DeepSpeech based forced alignment tool☆234Updated 3 years ago
- A small package for handy conversion of german numerals (also ordinal / signed) written as words to numbers.☆12Updated 2 years ago
- Few-shot Keyword Spotting in Any Language and Multilingual Spoken Word Corpus☆165Updated 4 months ago
- A list of publically available audio data that anyone can download for ASR or other speech activities☆200Updated 3 years ago
- Pytorch implementation of deep audio embedding calculation☆99Updated last year
- Mimic Recording Studio is a Docker-based application you can install to record voice samples, which can then be trained into a TTS voice …☆500Updated last year
- Keras (tensorflow) implementation of SincNet (Mirco Ravanelli, Yoshua Bengio - https://github.com/mravanelli/SincNet)☆72Updated 3 years ago
- The code enables users to use Mozilla's Deep Speech model over the Web Browser.☆33Updated last year