This is the official repository for the HUI-Audio-Corpus-German. The corresponding paper is in the process of publication. With the repository it is possible to automatically recreate the dataset. It is also possible to add more speakers to the processing pipeline.
β36Mar 31, 2023Updated 2 years ago
Alternatives and similar repositories for HUI-Audio-Corpus-German
Users that are interested in HUI-Audio-Corpus-German are comparing it to the libraries listed below
Sorting:
- π« check your data, before you wreck your modelβ16Aug 11, 2022Updated 3 years ago
- Python module to clean and transliterate (i.e. normalize) German text including abbreviations, numbers, timestamps etc. It can be used toβ¦β36Jan 16, 2021Updated 5 years ago
- Java Bindings for the C++ library DeepSpeechβ10Jun 4, 2020Updated 5 years ago
- β15Jun 4, 2021Updated 4 years ago
- β61Nov 4, 2023Updated 2 years ago
- phone inventory libraryβ17May 15, 2023Updated 2 years ago
- Parallelized automatic corpus collection for ASR. Forked from https://github.com/EgorLakomkin/KTSpeechCrawlerβ23Mar 21, 2021Updated 5 years ago
- Repository for Accent Recognition (Hackathon @SLT2022)β38May 12, 2024Updated last year
- This is the source code of the paper "Neural grapheme-to-phoneme conversion with pretrained grapheme modelsβ48Mar 25, 2022Updated 3 years ago
- NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessmentβ16Apr 13, 2022Updated 3 years ago
- My hybrid TTS network that combines, VALL-E, VoiceBox, SpeechFlow, Seamless and TortoiseTTS into oneβ26Aug 5, 2024Updated last year
- β67Aug 16, 2023Updated 2 years ago
- Avocodo: Generative Adversarial Network for Artifact-free Vocoderβ122Jul 14, 2022Updated 3 years ago
- Deep Neural Pitch Extractor for Voice Conversion and TTS Trainingβ147Aug 22, 2022Updated 3 years ago
- Dictionary of pairs of Korean word and IPA crawled from Wiktionary (Korean edition)β23Nov 12, 2025Updated 4 months ago
- SLMGAN: Exploiting Speech Language Model Representations for Unsupervised Zero-Shot Voice Conversion in GANsβ16Jul 19, 2023Updated 2 years ago
- A high-quality, varied ~30hr voice dataset suitable for training a TTS modelβ67Jan 7, 2023Updated 3 years ago
- Sequence-to-sequence TTS based on Kyubyong's dc_ttsβ61Feb 2, 2023Updated 3 years ago
- This app is intended to automatically create a corpus for ASR systems using pseudo-labeling.β27Feb 15, 2024Updated 2 years ago
- Provides training, inference and voice conversion recipes for RADTTS and RADTTS++: Flow-based TTS models with Robust Alignment Learning, β¦β291Apr 6, 2023Updated 2 years ago
- β20Jul 22, 2022Updated 3 years ago
- phoneme tokenizer and grapheme-to-phoneme model for 8k languagesβ174Jun 9, 2023Updated 2 years ago
- β55Jan 13, 2023Updated 3 years ago
- β163Sep 19, 2022Updated 3 years ago
- ICASSP 2023 Acceptedβ190May 6, 2024Updated last year
- pytorch model for contexless-phoneme prediction from speech audioβ32Oct 30, 2025Updated 4 months ago
- PyTorch implementation of Miipher-2 [2025] which is a speech restoration model by Google DeepMindβ64Sep 22, 2025Updated 5 months ago
- Unofficial implementation of miipherβ135Apr 19, 2024Updated last year
- This is Pytorch Implementation of Google's Non-attentive Tacotron.β57Dec 21, 2022Updated 3 years ago
- Audio Splitter provides a user-friendly solution for splitting audio files based on silence detection.β18May 28, 2023Updated 2 years ago
- Unsupervised Voice Activity Detection by Modeling Source and System Information using Zero Frequency Filteringβ24Oct 19, 2023Updated 2 years ago
- Script to train a German n-gram Language Model on articles of Wikipediaβ14Oct 20, 2018Updated 7 years ago
- NeMo text processing for ASR and TTSβ443Mar 13, 2026Updated last week
- β82Jan 22, 2025Updated last year
- Generate synthetic wind noise signals based on a wind speed profile (Python)β49Apr 23, 2024Updated last year
- The YouTube Text-To-Speech dataset is comprised of waveform audio extracted from YouTube videos alongside their English transcriptionsβ52Apr 1, 2021Updated 4 years ago
- scipts for working with open.bible dataβ26Jan 24, 2022Updated 4 years ago
- SERAB: a multi-lingual benchmark for speech emotion recognitionβ28Dec 16, 2022Updated 3 years ago
- Open Source Crimean Tatar Text-to-Speech datasetsβ14Feb 23, 2025Updated last year