pguyot / zamia-speech
Open tools and data for cloudless automatic speech recognition
☆10Updated 5 years ago
Alternatives and similar repositories for zamia-speech:
Users that are interested in zamia-speech are comparing it to the libraries listed below
- Classify audio with neural nets on embedded systems like the Raspberry Pi☆84Updated 9 months ago
- Adapt Kaldi-ASR nnet3 chain models from Zamia-Speech.org to a different language model☆34Updated 4 years ago
- Crawling and creating a German language model resource☆19Updated 2 years ago
- Quartznet implementation on pytorch [https://arxiv.org/abs/1910.10261]☆26Updated 3 years ago
- This is a mirror of https://gitlab.com/tiro-is/tiro-speech-core☆15Updated last year
- SubER - Subtitle Edit Rate☆22Updated 4 months ago
- ☆9Updated 3 months ago
- ☆11Updated last year
- XCORE-VOICE Solution☆12Updated last month
- Dataset of ICASSP 2021 MULTILINGUAL PHONETIC DATASET FOR LOW RESOURCE SPEECH RECOGNITION☆39Updated last year
- This is the experimental description of MnTTS2.☆9Updated 9 months ago
- Convert kaldi feature extraction and nnet3 models into Tensorflow Lite models. Currently aimed at converting kaldi's x-vector models and …☆20Updated 2 years ago
- Open Source Wearable Microphone Array Glasses for Multi-Speaker Speech Recognition☆16Updated 2 years ago
- David Talkin's pitch tracker, get_f0, modified to be a stand-alone binary using dpwelib for sound I/O☆12Updated 11 years ago
- ☆8Updated 3 years ago
- An open-source, easily accessible package for training and deploying Speech-to-Intent models on microcontrollers and SBCs☆37Updated 10 months ago
- ☆11Updated 2 years ago
- Python wrapper for phonetisaurus grapheme to phoneme tool☆12Updated 3 years ago
- An even smaller speech recognizer / force aligner☆32Updated last month
- Voice Framework☆14Updated 2 months ago
- Scripts to simplify data prepping for Mozilla DeepSpeech.☆14Updated 5 years ago
- Russian phonetical transcription☆9Updated last year
- Transfer learning approach to pronunciation scoring☆10Updated last year
- Simple Kaldi model server for chain (nnet3) models in online recognition mode directly from a local microphone☆36Updated 2 years ago
- A semi-supervised sequence-to-sequence ASR☆10Updated 2 years ago
- Repository for sharing the data in the Tamasheq language, one of the target languages for the low-resource speech translation track at IW…☆16Updated 2 years ago
- Code for the paper: "Leveraging speaker attribute information using multi task learning for speaker verification and diarization" present…☆25Updated 2 years ago
- Open models for Coqui STT☆127Updated last year
- Evaluation of STT models for german language☆15Updated 2 years ago
- Simple Kaldi recipe for forced alignment☆10Updated last year