A merged version of multiple open-source German speech datasets.
☆34May 3, 2024Updated last year
Alternatives and similar repositories for megs
Users that are interested in megs are comparing it to the libraries listed below
Sorting:
- Scripts for training Kaldi for German speech recognition (ASR).☆27Feb 11, 2021Updated 5 years ago
- Implementation of the paper "Confidence estimation for attention based sequence to sequence models for speech recognition"☆16May 9, 2021Updated 4 years ago
- Using YouTube to prepare a speech recognition dataset for any language☆10Mar 30, 2021Updated 4 years ago
- Repository for Accent Recognition (Hackathon @SLT2022)☆38May 12, 2024Updated last year
- ☆12Feb 9, 2021Updated 5 years ago
- ☆11Sep 5, 2025Updated 6 months ago
- ☆14Feb 9, 2023Updated 3 years ago
- phone inventory library☆17May 15, 2023Updated 2 years ago
- Crawling and creating a German language model resource☆18Aug 23, 2022Updated 3 years ago
- This is the official repository for the HUI-Audio-Corpus-German. The corresponding paper is in the process of publication. With the repo…☆36Mar 31, 2023Updated 2 years ago
- WarpRNNT loss ported in Numba CPU/CUDA for Pytorch☆17Mar 11, 2022Updated 3 years ago
- ☆15Aug 1, 2025Updated 7 months ago
- A toolkit for Spoken Language Understanding Evaluation (SLUE) benchmark. Refer paper https://arxiv.org/abs/2111.10367 for more details. O…☆66Feb 26, 2024Updated 2 years ago
- Goodness of Pronunciation algorithm using PyKaldi☆18Jun 12, 2022Updated 3 years ago
- Simple Kaldi model server for chain (nnet3) models in online recognition mode directly from a local microphone☆35Feb 18, 2022Updated 4 years ago
- Repository for multilingual speech data resources for native languages of Zambia.☆20Oct 9, 2024Updated last year
- ☆46Nov 2, 2025Updated 4 months ago
- Thorsten-Voice: A free to use, offline working, high quality german TTS voice should be available for every project without any license s…☆704Feb 2, 2026Updated last month
- A wrapper for Audeering's wav2vec-based dimensional speech emotion recognition☆21Aug 9, 2023Updated 2 years ago
- Github repository for ACL 2025 paper: VoxEval: Benchmarking the Knowledge Understanding Capabilities of End-to-End Spoken Language Models☆24Jun 16, 2025Updated 8 months ago
- ☆19Jun 28, 2022Updated 3 years ago
- Acoustic Neighbor Embeddings☆28Jul 13, 2025Updated 7 months ago
- PyTorch implementation of WASE described in our ICASSP 2021: "Wase: Learning When to Attend for Speaker Extraction in Cocktail Party Envi…☆26Jan 11, 2022Updated 4 years ago
- German Tacotron 2 and Multi-band MelGAN in TensorFlow with TF Lite inference support☆26Jun 7, 2021Updated 4 years ago
- A collection of all our phonemeizers for dataset construction and inference☆28Feb 21, 2025Updated last year
- Streamlit demo app to demonstrate the features of transformers interpret with multiple models.☆25Jun 24, 2021Updated 4 years ago
- Decoders from Kaldi using OpenFst☆34Jan 29, 2026Updated last month
- ☆25Jun 14, 2022Updated 3 years ago
- ☆29Aug 20, 2023Updated 2 years ago
- Just another FastSpeech 2 but cleaner code :)☆29Jun 28, 2024Updated last year
- Computes the MWER (minimum WER) Loss with CTC beam search. Knowledge distillation for CTC loss.☆59Sep 6, 2023Updated 2 years ago
- Linguistic processing for Common Voice☆58Jan 18, 2024Updated 2 years ago
- This repository contains the code for our ICASSP paper `Speech Emotion Recognition using Semantic Information` https://arxiv.org/pdf/2103…☆27Mar 18, 2021Updated 4 years ago
- Articulatory (text-to-) speech synthesis for Python☆29May 7, 2025Updated 10 months ago
- Uyghur Single Speaker Speech Dataset. ウイグル語音声データセット☆34Apr 3, 2022Updated 3 years ago
- One command to build TLG.fst for WeNet.☆30Oct 11, 2022Updated 3 years ago
- ☆28Dec 14, 2021Updated 4 years ago
- IPA Phonemizer/Dephonemizer for 140 human languages☆55Feb 11, 2026Updated 3 weeks ago
- The EveryVoice TTS Toolkit - Text To Speech for your language☆43Feb 24, 2026Updated last week