This is the official repository for the HUI-Audio-Corpus-German. The corresponding paper is in the process of publication. With the repository it is possible to automatically recreate the dataset. It is also possible to add more speakers to the processing pipeline.
β35Mar 31, 2023Updated 3 years ago
Alternatives and similar repositories for HUI-Audio-Corpus-German
Users that are interested in HUI-Audio-Corpus-German are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- π« check your data, before you wreck your modelβ16Aug 11, 2022Updated 3 years ago
- Using Tacotron2 to do Cherokee Text to Speechβ10May 10, 2022Updated 4 years ago
- Python module to clean and transliterate (i.e. normalize) German text including abbreviations, numbers, timestamps etc. It can be used toβ¦β38Jan 16, 2021Updated 5 years ago
- Java Bindings for the C++ library DeepSpeechβ10Jun 4, 2020Updated 6 years ago
- β15Jun 4, 2021Updated 5 years ago
- Deploy on Railway without the complexity - Free Credits Offer β’ AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- β61Nov 4, 2023Updated 2 years ago
- phone inventory libraryβ17May 15, 2023Updated 3 years ago
- Parallelized automatic corpus collection for ASR. Forked from https://github.com/EgorLakomkin/KTSpeechCrawlerβ23Mar 21, 2021Updated 5 years ago
- Repository for Accent Recognition (Hackathon @SLT2022)β41May 12, 2024Updated 2 years ago
- This is the source code of the paper "Neural grapheme-to-phoneme conversion with pretrained grapheme modelsβ48Mar 25, 2022Updated 4 years ago
- NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessmentβ16Apr 13, 2022Updated 4 years ago
- My hybrid TTS network that combines, VALL-E, VoiceBox, SpeechFlow, Seamless and TortoiseTTS into oneβ26Aug 5, 2024Updated last year
- β68Aug 16, 2023Updated 2 years ago
- Avocodo: Generative Adversarial Network for Artifact-free Vocoderβ122Jul 14, 2022Updated 3 years ago
- AI Agents on DigitalOcean Gradient AI Platform β’ AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Deep Neural Pitch Extractor for Voice Conversion and TTS Trainingβ151Aug 22, 2022Updated 3 years ago
- Dictionary of pairs of Korean word and IPA crawled from Wiktionary (Korean edition)β23Nov 12, 2025Updated 6 months ago
- SLMGAN: Exploiting Speech Language Model Representations for Unsupervised Zero-Shot Voice Conversion in GANsβ16Jul 19, 2023Updated 2 years ago
- A high-quality, varied ~30hr voice dataset suitable for training a TTS modelβ69Jan 7, 2023Updated 3 years ago
- Sequence-to-sequence TTS based on Kyubyong's dc_ttsβ61Feb 2, 2023Updated 3 years ago
- This app is intended to automatically create a corpus for ASR systems using pseudo-labeling.β27Feb 15, 2024Updated 2 years ago
- Provides training, inference and voice conversion recipes for RADTTS and RADTTS++: Flow-based TTS models with Robust Alignment Learning, β¦β293Apr 6, 2023Updated 3 years ago
- β22Jul 22, 2022Updated 3 years ago
- phoneme tokenizer and grapheme-to-phoneme model for 8k languagesβ174Jun 9, 2023Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways β’ AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- β55Jan 13, 2023Updated 3 years ago
- β163Sep 19, 2022Updated 3 years ago
- ICASSP 2023 Acceptedβ191May 6, 2024Updated 2 years ago
- pytorch model for contexless-phoneme prediction from speech audioβ32Oct 30, 2025Updated 7 months ago
- Unofficial implementation of miipherβ136Apr 19, 2024Updated 2 years ago
- PyTorch implementation of Miipher-2 [2025] which is a speech restoration model by Google DeepMindβ66Sep 22, 2025Updated 8 months ago
- This is Pytorch Implementation of Google's Non-attentive Tacotron.β57Dec 21, 2022Updated 3 years ago
- Audio Splitter provides a user-friendly solution for splitting audio files based on silence detection.β18May 28, 2023Updated 3 years ago
- Script to train a German n-gram Language Model on articles of Wikipediaβ14Oct 20, 2018Updated 7 years ago
- Proton VPN Special Offer - Get 70% off β’ AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Unsupervised Voice Activity Detection by Modeling Source and System Information using Zero Frequency Filteringβ23Oct 19, 2023Updated 2 years ago
- NeMo text processing for ASR and TTSβ466Updated this week
- β81Jan 22, 2025Updated last year
- The YouTube Text-To-Speech dataset is comprised of waveform audio extracted from YouTube videos alongside their English transcriptionsβ52Apr 1, 2021Updated 5 years ago
- scipts for working with open.bible dataβ26Jan 24, 2022Updated 4 years ago
- SERAB: a multi-lingual benchmark for speech emotion recognitionβ28Dec 16, 2022Updated 3 years ago
- Generate synthetic wind noise signals based on a wind speed profile (Python)β51Apr 23, 2024Updated 2 years ago