iisys-hof / HUI-Audio-Corpus-GermanLinks
This is the official repository for the HUI-Audio-Corpus-German. The corresponding paper is in the process of publication. With the repository it is possible to automatically recreate the dataset. It is also possible to add more speakers to the processing pipeline.
☆31Updated 2 years ago
Alternatives and similar repositories for HUI-Audio-Corpus-German
Users that are interested in HUI-Audio-Corpus-German are comparing it to the libraries listed below
Sorting:
- ☆23Updated last year
- A toolkit to calculate speech audio quality. Not affiliated with the original authors☆52Updated 10 months ago
- Official implementation of the paper "Laughter Synthesis using Pseudo Phonetic Tokens with a Large-scale In-the-wild Laughter Corpus" acc…☆76Updated last year
- multilingual speech aligner☆74Updated last year
- ☆37Updated last year
- [IJCAI'23] Learning to Speak from Text for Low-Resource TTS☆63Updated 2 years ago
- HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis☆43Updated 4 years ago
- Python wrappers for Kaldi Levenshtein's distance and alignment code.☆67Updated last month
- ☆25Updated 10 months ago
- PyTorch Implementation of Robust and fine-grained prosody control of end-to-end speech synthesis☆41Updated 3 years ago
- Autovocoder: Fast Waveform Generation from a Learned Speech Representation using Differentiable Digital Signal Processing☆70Updated 2 years ago
- Conditional Variational Auto-Encoder with Jointly Training FastSpeech2(+Conformer) and HiFi-GAN for End to End Text to Speech☆46Updated 2 years ago
- Qualtric or Qualtreat? Generate Qualtrics listening tests for Text-To-Speech evaluations.☆36Updated 11 months ago
- Alignment files of LibriTTS.☆62Updated 5 years ago
- ☆54Updated 7 months ago
- PyTorch Implementation of Daft-Exprt: Robust Prosody Transfer Across Speakers for Expressive Speech Synthesis☆57Updated 3 years ago
- Transcripts and segmentation for the Blizzard 2013 audiobooks also known as the Lessac or Blizzard 2013 dataset.☆44Updated 5 years ago
- An automatic prosodic boundary annotation tool for Text-to-Speech Synthesis (TTS).☆48Updated last year
- Implementation of the AlignTTS☆76Updated last year
- ☆78Updated 5 months ago
- ☆30Updated 2 years ago
- Sequence-to-sequence TTS based on Kyubyong's dc_tts☆60Updated 2 years ago
- Official implementation for the paper: A Unified One-Shot Prosody and Speaker Conversion System with Self-Supervised Discrete Speech Unit…☆80Updated 2 years ago
- S3PRL-VC: A Voice Conversion Toolkit based on S3PRL☆101Updated 11 months ago
- A PyTorch implementation of the universal neural vocoder☆67Updated 4 years ago
- An implement of GlowTTS model. Several modes are added: speaker embedding, prosody encoder(GST), and gradient reversal.☆53Updated 2 years ago
- Code for vec2wav 2.0, a speech token vocoder for VC. Paper: https://arxiv.org/abs/2409.01995☆76Updated 6 months ago
- This repository provides a multi-mode and multi-speaker expressive speech synthesis framework, including multi-attentive Tacotron, DurIAN…☆74Updated 2 years ago
- ☆87Updated 2 years ago
- Objective metrics used in several text-to-speech (TTS) papers.☆49Updated last week