silenterus / deepspeech-cleaner
Multi-Language Dataset Cleaner/Creator for Mozilla's DeepSpeech Framework
☆47Updated last year
Alternatives and similar repositories for deepspeech-cleaner:
Users that are interested in deepspeech-cleaner are comparing it to the libraries listed below
- Jupyter Notebooks for creating Speech datasets☆46Updated 5 years ago
- Tool for creation, manipulation and maintenance of voice corpora☆81Updated 8 months ago
- Forced Alignments for Common Voice☆31Updated 4 years ago
- Coqui STT Model Manager - install, manage and try out Coqui STT models from the Model Zoo☆25Updated last year
- Speaker diarization python system based on binary key speaker modelling☆61Updated 3 years ago
- Running Mozilla's implementation of Baidu DeepSpeech on Google Colaboratory☆16Updated 5 years ago
- Linguistic processing for Common Voice☆52Updated last year
- 🐸TTS recipes for different datasets☆85Updated 2 years ago
- Incorporating KenLM language model with HuggingFace implementation of Wav2Vec2CTC Model using beam search decoding☆73Updated 3 years ago
- A repo listing known open source voice tools, ordered by where they sit in the voice stack☆25Updated 2 years ago
- Command line tool to create corpora for Common Voice☆75Updated 8 months ago
- Adapting your own Language Model for Kaldi☆64Updated 6 years ago
- Pytorch implementation of Deepmind's WaveRNN model☆121Updated 5 years ago
- [deprecated] Pretrained models for pyannote-audio 1.x☆72Updated 2 years ago
- Using YouTube to prepare a speech recognition dataset for any language☆10Updated 3 years ago
- ☆34Updated 4 months ago
- Multilingual Grapheme to Phoneme☆49Updated 8 years ago
- Simple Kaldi model server for chain (nnet3) models in online recognition mode directly from a local microphone☆36Updated 2 years ago
- A data annotation pipeline to generate high-quality, large-scale speech datasets with machine pre-labeling and fully manual auditing.☆100Updated last year
- Server & client for DeepSpeech using WebSockets for real-time speech recognition in separate environments☆102Updated 4 years ago
- An easy way to fine-tune Wav2Vec 2.0 for low-resource languages.☆81Updated last year
- ☆74Updated 3 years ago
- Simple Diarization model☆46Updated last year
- Silence detection in audio stream using webrtcvad☆46Updated last year
- Stable timestamps and confidence score for words of OpenAI's Whisper outputs down to word-level.☆25Updated 2 years ago
- Deep Convolution Text to Speech☆35Updated 6 years ago
- A crash course for training speech recognition models using DeepSpeech.☆24Updated 3 years ago
- Tools to create your own voice dataset for TTS training☆65Updated 4 years ago
- Use your data to create a speech recognition system in Kaldi. Fast.☆65Updated 5 years ago
- This is a legacy repo. Dev occurs now on GitHub.☆11Updated 3 years ago