iisys-hof / HUI-Audio-Corpus-German
This is the official repository for the HUI-Audio-Corpus-German. The corresponding paper is in the process of publication. With the repository it is possible to automatically recreate the dataset. It is also possible to add more speakers to the processing pipeline.
☆30Updated 2 years ago
Alternatives and similar repositories for HUI-Audio-Corpus-German:
Users that are interested in HUI-Audio-Corpus-German are comparing it to the libraries listed below
- HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis☆42Updated 4 years ago
- ☆23Updated 10 months ago
- PyTorch Implementation of Robust and fine-grained prosody control of end-to-end speech synthesis☆41Updated 3 years ago
- Python wrappers for Kaldi Levenshtein's distance and alignment code.☆64Updated last year
- multilingual speech aligner☆74Updated last year
- ☆51Updated 5 months ago
- Conditional Variational Auto-Encoder with Jointly Training FastSpeech2(+Conformer) and HiFi-GAN for End to End Text to Speech☆46Updated 2 years ago
- [IJCAI'23] Learning to Speak from Text for Low-Resource TTS☆63Updated last year
- ☆25Updated 8 months ago
- This repository provides a multi-mode and multi-speaker expressive speech synthesis framework, including multi-attentive Tacotron, DurIAN…☆74Updated 2 years ago
- Autovocoder: Fast Waveform Generation from a Learned Speech Representation using Differentiable Digital Signal Processing☆70Updated 2 years ago
- An unofficial PyTorch implementation of Mix-Phoneme-Bert☆39Updated last year
- ☆65Updated last year
- PyTorch Implementation of Daft-Exprt: Robust Prosody Transfer Across Speakers for Expressive Speech Synthesis☆56Updated 3 years ago
- Objective metrics used in several text-to-speech (TTS) papers.☆48Updated 3 years ago
- ☆45Updated 2 years ago
- The TTSDS benchmark evaluates synthetic speech quality by considering prosody, speaker identity, and intelligibility, comparing these fac…☆31Updated last week
- ☆62Updated 11 months ago
- Dataset of ICASSP 2021 MULTILINGUAL PHONETIC DATASET FOR LOW RESOURCE SPEECH RECOGNITION☆42Updated last year
- Transcripts and segmentation for the Blizzard 2013 audiobooks also known as the Lessac or Blizzard 2013 dataset.☆44Updated 5 years ago
- Pytorch implementation of LearnableUpsamplingLayer (NaturalSpeech, Tan et al., 2022)☆54Updated last year
- ☆72Updated 3 months ago
- Code for vec2wav 2.0, a speech token vocoder for VC. Paper: https://arxiv.org/abs/2409.01995☆76Updated 4 months ago
- ☆28Updated 11 months ago
- ☆30Updated 2 years ago
- Qualtric or Qualtreat? Generate Qualtrics listening tests for Text-To-Speech evaluations.☆35Updated 10 months ago
- Implementation of the AlignTTS☆76Updated last year
- Speech Human Evaluation Estimation Toolkit (SHEET)☆65Updated 5 months ago
- A toolkit to calculate speech audio quality. Not affiliated with the original authors☆50Updated 8 months ago
- Avocodo: Generative Adversarial Network for Artifact-free Vocoder☆120Updated 2 years ago