mozilla-services / deepspeech-server
☆13Updated 3 years ago
Alternatives and similar repositories for deepspeech-server:
Users that are interested in deepspeech-server are comparing it to the libraries listed below
- Python server for communicating with Kaldi from the browser using WebRTC☆69Updated last year
- A "Crowd-Built" continuously growing speech dataset with transcripts. The dataset contains multiple languages and is intended for anyone …☆41Updated 2 years ago
- Official home of the Idlak Speech Synthesis Toolkit☆66Updated 3 years ago
- Some simple wrappers around kaldi-asr intended to make using kaldi's (online) decoders as convenient as possible.☆170Updated 4 years ago
- Discriminative Neural Clustering for Speaker Diarisation☆78Updated 3 years ago
- This is the Python library for an unsupervised, fast method for robust voice activity detection (rVAD), as in the paper rVAD: An Unsuperv…☆137Updated 4 months ago
- Python library for handling audio datasets.☆137Updated last year
- Command line tool to create corpora for Common Voice☆75Updated 10 months ago
- 🐸STT integration examples☆127Updated 2 years ago
- A list of publically available audio data that anyone can download for ASR or other speech activities☆207Updated 3 years ago
- Improved speech enhancement with the Wave-U-Net, a deep convolutional neural network architecture for audio source separation, implemente…☆218Updated 2 years ago
- Pytorch implementation of "Generalized End-to-End Loss for Speaker Verification"☆101Updated 6 years ago
- [deprecated] Pretrained models for pyannote-audio 1.x☆72Updated 2 years ago
- ☆75Updated 3 years ago
- Simple Kaldi model server for chain (nnet3) models in online recognition mode directly from a local microphone☆35Updated 3 years ago
- On-device voice activity detection (VAD) powered by deep learning☆206Updated last week
- ☆257Updated 2 years ago
- Voice Activity Detection (VAD) using deep learning.☆196Updated 5 years ago
- Implementation of audio degradation processes☆102Updated 9 years ago
- Adapting your own Language Model for Kaldi☆63Updated 6 years ago
- Wake word detection modeling toolkit for Firefox Voice, supporting open datasets like Speech Commands and Common Voice.☆205Updated 9 months ago
- Advanced data structures for handling temporal segments with attached labels.☆111Updated 2 months ago
- Python bindings of WebRTC Audio Processing☆189Updated 7 months ago
- Onnx wrapper for espnet infrernce model☆162Updated 6 months ago
- An open-source speech separation and enhancement library☆211Updated 4 years ago
- Server framework for Kaldi ASR Toolkit☆97Updated last year
- Automatically constructing corpus for automatic speech recognition from YouTube videos☆154Updated 5 years ago
- A complete speech segmentation system using Kaldi and x-vectors for voice activity detection (VAD) and speaker diarisation.☆28Updated 10 months ago
- Multispeaker & Emotional TTS based on Tacotron 2 and Waveglow☆128Updated 4 years ago
- Use your data to create a speech recognition system in Kaldi. Fast.☆65Updated 5 years ago