AASHISHAG / DeepSpeech-APILinks
The code enables users to use Mozilla's Deep Speech model over the Web Browser.
β31Updated 2 years ago
Alternatives and similar repositories for DeepSpeech-API
Users that are interested in DeepSpeech-API are comparing it to the libraries listed below
Sorting:
- πΈSTT integration examplesβ129Updated 2 years ago
- πΈTTS recipes for different datasetsβ87Updated 2 years ago
- Server & client for DeepSpeech using WebSockets for real-time speech recognition in separate environmentsβ102Updated 5 years ago
- Web app for keyword spotting using TensorflowJSβ72Updated 2 years ago
- Command line tool to create corpora for Common Voiceβ77Updated last year
- A model that predicts the punctuation of English, Italian, French and German texts.β80Updated 2 years ago
- Linguistic processing for Common Voiceβ55Updated last year
- JavaScript deployment for Howl, the wake word detection modeling toolkit for Firefox Voiceβ10Updated 4 years ago
- π Coqui's machine learning job schedulerβ32Updated 3 years ago
- Wake word detection modeling toolkit for Firefox Voice, supporting open datasets like Speech Commands and Common Voice.β208Updated 11 months ago
- A neural attention model for speech command recognitionβ185Updated 2 years ago
- Buildings block for voice-enabled applications in the browserβ37Updated 2 months ago
- An automatic speech recognition APIβ61Updated this week
- Scripts for training general-purpose large vocabulary German acoustic models for ASR with Kaldi.β173Updated last year
- Automatically constructing corpus for automatic speech recognition from YouTube videosβ154Updated 5 years ago
- Automatic Speech Recognition (ASR) - Germanβ18Updated 4 years ago
- Multi-Language Dataset Cleaner/Creator for Mozilla's DeepSpeech Frameworkβ47Updated 2 years ago
- π€ Nix-TTS: Lightweight and End-to-end Text-to-Speech via Module-wise Distillationβ254Updated last year
- Simple text to phonemes converter for multiple languagesβ20Updated 2 years ago
- Scripts to simplify data prepping for Mozilla DeepSpeech.β14Updated 5 years ago
- Tool for creation, manipulation and maintenance of voice corporaβ81Updated last year
- A data annotation pipeline to generate high-quality, large-scale speech datasets with machine pre-labeling and fully manual auditing.β102Updated 2 years ago
- Tooling for producing French dataset for Common Voiceβ101Updated 5 months ago
- Converts spoken words into text form.β75Updated last month
- Forced Alignments for Common Voiceβ31Updated 4 years ago
- Python library for handling audio datasets.β138Updated last year
- This project is about performing Speaker diarization for Hindi Language.β50Updated 4 years ago
- Small repo describing how to use Hugging Face's Wav2Vec2 with PyCTCDecodeβ111Updated 2 years ago
- Deep Learning - one shot learning for speaker recognition using Filter Banksβ168Updated last year
- A repo listing known open source voice tools, ordered by where they sit in the voice stackβ26Updated 2 years ago