german-asr / megs
A merged version of multiple open-source German speech datasets.
☆31Updated 9 months ago
Alternatives and similar repositories for megs:
Users that are interested in megs are comparing it to the libraries listed below
- Incorporating KenLM language model with HuggingFace implementation of Wav2Vec2CTC Model using beam search decoding☆74Updated 3 years ago
- Linguistic processing for Common Voice☆53Updated last year
- Phonetically-Oriented Word Error Rate☆33Updated 5 years ago
- Code for the method proposed in the paper:- ccc-wav2vec 2.0: Clustering aided Cross-Contrastive learning of Self-Supervised speech repres…☆20Updated 11 months ago
- ☆34Updated 5 months ago
- This repository describes our reproducible framework for assessing self-supervised representation learning from speech☆51Updated 3 years ago
- SHAS: Approaching optimal Segmentation for End-to-End Speech Translation☆38Updated 2 years ago
- A toolkit for Spoken Language Understanding Evaluation (SLUE) benchmark. Refer paper https://arxiv.org/abs/2111.10367 for more details. O…☆64Updated 11 months ago
- Rescoring methods for end-to-end Automatic Speech Recognition☆27Updated 4 years ago
- Dataset of ICASSP 2021 MULTILINGUAL PHONETIC DATASET FOR LOW RESOURCE SPEECH RECOGNITION☆40Updated last year
- Repository for sharing the data in the Tamasheq language, one of the target languages for the low-resource speech translation track at IW…☆17Updated 2 years ago
- ☆43Updated 2 years ago
- Pronunciation-assisted Subword Modeling☆29Updated 5 years ago
- ☆43Updated 2 years ago
- Word Error Rate Estimation☆11Updated 4 years ago
- Code repository for the paper "Improving End-to-End SLU performance with Prosodic Attention and Distillation" accepted at Interspeech 202…☆24Updated last year
- Speech-MASSIVE is a multilingual Spoken Language Understanding (SLU) dataset comprising the speech counterpart for a portion of the MASSI…☆21Updated 5 months ago
- Example code for a neural transducer model.☆61Updated last year
- ☆56Updated 2 years ago
- Python wrappers for Kaldi Levenshtein's distance and alignment code.☆62Updated 11 months ago
- Properly handle position-dependent phones in a subword lexicon FST☆31Updated 4 years ago
- Implementation of the paper "Confidence estimation for attention based sequence to sequence models for speech recognition"☆16Updated 3 years ago
- Analysis and investigating the confounding effect of accents in end-to-end Automatic Speech Recognition models.☆14Updated 4 years ago
- End-to-end MOdeling of ASR (Automatic Speech Recognition)☆33Updated 2 years ago
- Code for the Paper Speech Recognition and Multi-Speaker Diarization of Long Conversations☆36Updated last year
- ☆38Updated 3 years ago
- Multilingual and code-switching ASR challenges for low resource Indian languages.☆20Updated 3 years ago
- ☆11Updated last year
- ☆11Updated 8 months ago
- Accompanying code for paper "Attention-Based Contextual Language Model Adaptation for Speech Recognition", submitted to ACL 2021.☆14Updated last year