speechcatcher-asr / speechcatcher-data
☆10Updated last week
Alternatives and similar repositories for speechcatcher-data:
Users that are interested in speechcatcher-data are comparing it to the libraries listed below
- ☆12Updated 2 years ago
- Using YouTube to prepare a speech recognition dataset for any language☆10Updated 3 years ago
- ☆17Updated last year
- Word Error Rate Estimation☆11Updated 4 years ago
- Kaldi style neural network training in pytorch for use in place of nnet3 in Kaldi.☆26Updated 6 months ago
- This is a mirror of https://gitlab.com/tiro-is/tiro-speech-core☆15Updated last year
- ☆17Updated 3 years ago
- Evaluation of STT models for german language☆15Updated 3 years ago
- ☆20Updated 6 years ago
- C++ version of pyannote audio overlapped speech detection pipeline☆11Updated last year
- Pronunciation-assisted Subword Modeling☆29Updated 5 years ago
- Code for the method proposed in the paper:- ccc-wav2vec 2.0: Clustering aided Cross-Contrastive learning of Self-Supervised speech repres…☆20Updated 11 months ago
- Rescoring methods for end-to-end Automatic Speech Recognition☆27Updated 4 years ago
- Code for the paper: How Much Context Does My Attention-Based ASR System Need?☆11Updated this week
- ☆39Updated last month
- Adapt Kaldi-ASR nnet3 chain models from Zamia-Speech.org to a different language model☆34Updated 5 years ago
- ☆34Updated 5 months ago
- phone inventory library☆16Updated last year
- Repository for sharing the data in the Tamasheq language, one of the target languages for the low-resource speech translation track at IW…☆17Updated 2 years ago
- This repository contains data used in the NAACL 2021 Paper - Proteno: Text Normalization with Limited Data for Fast Deployment in Text to…☆43Updated 3 years ago
- A semi-supervised sequence-to-sequence ASR☆10Updated 2 years ago
- This app is intended to automatically create a corpus for ASR systems using pseudo-labeling.☆27Updated last year
- ☆38Updated 3 years ago
- A handy dataset of noises for ASR☆19Updated 5 years ago
- Simple Kaldi model server for chain (nnet3) models in online recognition mode directly from a local microphone☆36Updated 3 years ago
- ☆11Updated 3 years ago
- ☆12Updated 3 weeks ago
- Properly handle position-dependent phones in a subword lexicon FST☆31Updated 4 years ago
- ☆11Updated 3 years ago
- SpeechGLUE is a speech version of the GLUE benchmark, driven by text-to-speech.☆13Updated last year