This is the official repository for the HUI-Audio-Corpus-German. The corresponding paper is in the process of publication. With the repository it is possible to automatically recreate the dataset. It is also possible to add more speakers to the processing pipeline.
β36Mar 31, 2023Updated 3 years ago
Alternatives and similar repositories for HUI-Audio-Corpus-German
Users that are interested in HUI-Audio-Corpus-German are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- π« check your data, before you wreck your modelβ16Aug 11, 2022Updated 3 years ago
- Python module to clean and transliterate (i.e. normalize) German text including abbreviations, numbers, timestamps etc. It can be used toβ¦β36Jan 16, 2021Updated 5 years ago
- Java Bindings for the C++ library DeepSpeechβ10Jun 4, 2020Updated 5 years ago
- β15Jun 4, 2021Updated 4 years ago
- β61Nov 4, 2023Updated 2 years ago
- Managed Kubernetes at scale on DigitalOcean β’ AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- phone inventory libraryβ17May 15, 2023Updated 2 years ago
- Parallelized automatic corpus collection for ASR. Forked from https://github.com/EgorLakomkin/KTSpeechCrawlerβ23Mar 21, 2021Updated 5 years ago
- Repository for Accent Recognition (Hackathon @SLT2022)β38May 12, 2024Updated last year
- This is the source code of the paper "Neural grapheme-to-phoneme conversion with pretrained grapheme modelsβ48Mar 25, 2022Updated 4 years ago
- NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessmentβ16Apr 13, 2022Updated 3 years ago
- My hybrid TTS network that combines, VALL-E, VoiceBox, SpeechFlow, Seamless and TortoiseTTS into oneβ26Aug 5, 2024Updated last year
- β68Aug 16, 2023Updated 2 years ago
- Avocodo: Generative Adversarial Network for Artifact-free Vocoderβ122Jul 14, 2022Updated 3 years ago
- Deep Neural Pitch Extractor for Voice Conversion and TTS Trainingβ148Aug 22, 2022Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways β’ AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- Dictionary of pairs of Korean word and IPA crawled from Wiktionary (Korean edition)β23Nov 12, 2025Updated 4 months ago
- SLMGAN: Exploiting Speech Language Model Representations for Unsupervised Zero-Shot Voice Conversion in GANsβ16Jul 19, 2023Updated 2 years ago
- A high-quality, varied ~30hr voice dataset suitable for training a TTS modelβ66Jan 7, 2023Updated 3 years ago
- Sequence-to-sequence TTS based on Kyubyong's dc_ttsβ61Feb 2, 2023Updated 3 years ago
- This app is intended to automatically create a corpus for ASR systems using pseudo-labeling.β27Feb 15, 2024Updated 2 years ago
- Provides training, inference and voice conversion recipes for RADTTS and RADTTS++: Flow-based TTS models with Robust Alignment Learning, β¦β291Apr 6, 2023Updated 3 years ago
- β20Jul 22, 2022Updated 3 years ago
- phoneme tokenizer and grapheme-to-phoneme model for 8k languagesβ174Jun 9, 2023Updated 2 years ago
- β55Jan 13, 2023Updated 3 years ago
- Wordpress hosting with auto-scaling on Cloudways β’ AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- β163Sep 19, 2022Updated 3 years ago
- ICASSP 2023 Acceptedβ190May 6, 2024Updated last year
- pytorch model for contexless-phoneme prediction from speech audioβ32Oct 30, 2025Updated 5 months ago
- Unofficial implementation of miipherβ136Apr 19, 2024Updated last year
- PyTorch implementation of Miipher-2 [2025] which is a speech restoration model by Google DeepMindβ64Sep 22, 2025Updated 6 months ago
- This is Pytorch Implementation of Google's Non-attentive Tacotron.β57Dec 21, 2022Updated 3 years ago
- Audio Splitter provides a user-friendly solution for splitting audio files based on silence detection.β18May 28, 2023Updated 2 years ago
- Script to train a German n-gram Language Model on articles of Wikipediaβ14Oct 20, 2018Updated 7 years ago
- Unsupervised Voice Activity Detection by Modeling Source and System Information using Zero Frequency Filteringβ24Oct 19, 2023Updated 2 years ago
- Wordpress hosting with auto-scaling on Cloudways β’ AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- NeMo text processing for ASR and TTSβ450Updated this week
- β82Jan 22, 2025Updated last year
- The YouTube Text-To-Speech dataset is comprised of waveform audio extracted from YouTube videos alongside their English transcriptionsβ52Apr 1, 2021Updated 5 years ago
- scipts for working with open.bible dataβ26Jan 24, 2022Updated 4 years ago
- SERAB: a multi-lingual benchmark for speech emotion recognitionβ28Dec 16, 2022Updated 3 years ago
- Generate synthetic wind noise signals based on a wind speed profile (Python)β51Apr 23, 2024Updated last year
- Open Source Crimean Tatar Text-to-Speech datasetsβ14Feb 23, 2025Updated last year