german-asr / megsView external linksLinks
A merged version of multiple open-source German speech datasets.
☆34May 3, 2024Updated last year
Alternatives and similar repositories for megs
Users that are interested in megs are comparing it to the libraries listed below
Sorting:
- Scripts for training Kaldi for German speech recognition (ASR).☆26Feb 11, 2021Updated 5 years ago
- Implementation of the paper "Confidence estimation for attention based sequence to sequence models for speech recognition"☆16May 9, 2021Updated 4 years ago
- Using YouTube to prepare a speech recognition dataset for any language☆10Mar 30, 2021Updated 4 years ago
- Repository for Accent Recognition (Hackathon @SLT2022)☆38May 12, 2024Updated last year
- ☆12Feb 9, 2021Updated 5 years ago
- ☆11Sep 5, 2025Updated 5 months ago
- This is the official repository for the HUI-Audio-Corpus-German. The corresponding paper is in the process of publication. With the repo…☆34Mar 31, 2023Updated 2 years ago
- ☆14Feb 9, 2023Updated 3 years ago
- ☆14Aug 1, 2025Updated 6 months ago
- Crawling and creating a German language model resource☆18Aug 23, 2022Updated 3 years ago
- phone inventory library☆17May 15, 2023Updated 2 years ago
- A toolkit for Spoken Language Understanding Evaluation (SLUE) benchmark. Refer paper https://arxiv.org/abs/2111.10367 for more details. O…☆66Feb 26, 2024Updated last year
- WarpRNNT loss ported in Numba CPU/CUDA for Pytorch☆17Mar 11, 2022Updated 3 years ago
- Goodness of Pronunciation algorithm using PyKaldi☆18Jun 12, 2022Updated 3 years ago
- ☆16Apr 2, 2021Updated 4 years ago
- Repository for multilingual speech data resources for native languages of Zambia.☆20Oct 9, 2024Updated last year
- Simple Kaldi model server for chain (nnet3) models in online recognition mode directly from a local microphone☆35Feb 18, 2022Updated 3 years ago
- ☆46Nov 2, 2025Updated 3 months ago
- Thorsten-Voice: A free to use, offline working, high quality german TTS voice should be available for every project without any license s…☆695Feb 2, 2026Updated 2 weeks ago
- A wrapper for Audeering's wav2vec-based dimensional speech emotion recognition☆18Aug 9, 2023Updated 2 years ago
- Github repository for ACL 2025 paper: VoxEval: Benchmarking the Knowledge Understanding Capabilities of End-to-End Spoken Language Models☆24Jun 16, 2025Updated 8 months ago
- ☆19Jun 28, 2022Updated 3 years ago
- 🔁 Async JSON-RPC 2.0 protocol + server powered by asyncio & py35+. json-rpc successor.☆20Jul 21, 2023Updated 2 years ago
- Acoustic Neighbor Embeddings☆29Jul 13, 2025Updated 7 months ago
- Automatic Speech Recognition (ASR) - German☆22Aug 26, 2019Updated 6 years ago
- PyTorch implementation of WASE described in our ICASSP 2021: "Wase: Learning When to Attend for Speaker Extraction in Cocktail Party Envi…☆26Jan 11, 2022Updated 4 years ago
- A collection of all our phonemeizers for dataset construction and inference☆27Feb 21, 2025Updated 11 months ago
- German Tacotron 2 and Multi-band MelGAN in TensorFlow with TF Lite inference support☆25Jun 7, 2021Updated 4 years ago
- Just another FastSpeech 2 but cleaner code :)☆29Jun 28, 2024Updated last year
- Streamlit demo app to demonstrate the features of transformers interpret with multiple models.☆25Jun 24, 2021Updated 4 years ago
- Decoders from Kaldi using OpenFst☆34Jan 29, 2026Updated 2 weeks ago
- ☆25Jun 14, 2022Updated 3 years ago
- ☆29Aug 20, 2023Updated 2 years ago
- Computes the MWER (minimum WER) Loss with CTC beam search. Knowledge distillation for CTC loss.☆59Sep 6, 2023Updated 2 years ago
- Linguistic processing for Common Voice☆58Jan 18, 2024Updated 2 years ago
- Articulatory (text-to-) speech synthesis for Python☆28May 7, 2025Updated 9 months ago
- dnsmasq docker image, fully configurable through ENV☆32Feb 1, 2026Updated 2 weeks ago
- This repository contains the code for our ICASSP paper `Speech Emotion Recognition using Semantic Information` https://arxiv.org/pdf/2103…☆27Mar 18, 2021Updated 4 years ago
- Uyghur Single Speaker Speech Dataset. ウイグル語音声データセット☆33Apr 3, 2022Updated 3 years ago