iisys-hof / HUI-Audio-Corpus-German
This is the official repository for the HUI-Audio-Corpus-German. The corresponding paper is in the process of publication. With the repository it is possible to automatically recreate the dataset. It is also possible to add more speakers to the processing pipeline.
☆30Updated last year
Alternatives and similar repositories for HUI-Audio-Corpus-German:
Users that are interested in HUI-Audio-Corpus-German are comparing it to the libraries listed below
- ☆23Updated 9 months ago
- Conditional Variational Auto-Encoder with Jointly Training FastSpeech2(+Conformer) and HiFi-GAN for End to End Text to Speech☆46Updated 2 years ago
- Python wrappers for Kaldi Levenshtein's distance and alignment code.☆62Updated 11 months ago
- ☆50Updated 4 months ago
- HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis☆41Updated 4 years ago
- ☆31Updated 2 years ago
- Code for paper titled "Perception of prosodic variation for speech synthesis using an unsupervised discrete representation of F0" submitt…☆16Updated 4 years ago
- ☆69Updated last month
- Autovocoder: Fast Waveform Generation from a Learned Speech Representation using Differentiable Digital Signal Processing☆70Updated 2 years ago
- multilingual speech aligner☆72Updated last year
- ☆25Updated 6 months ago
- The TTSDS benchmark evaluates synthetic speech quality by considering prosody, speaker identity, and intelligibility, comparing these fac…☆29Updated 3 months ago
- Code for vec2wav 2.0, a speech token vocoder for VC. Paper: https://arxiv.org/abs/2409.01995☆69Updated 3 months ago
- [InterSpeech 24] FreeV: Free Lunch For Vocoders Through Pseudo Inversed Mel Filter☆86Updated 8 months ago
- An unofficial PyTorch implementation of Mix-Phoneme-Bert☆39Updated last year
- PyTorch Implementation of Robust and fine-grained prosody control of end-to-end speech synthesis☆41Updated 3 years ago
- Objective metrics used in several text-to-speech (TTS) papers.☆48Updated 2 years ago
- ☆63Updated last year
- This repository provides a multi-mode and multi-speaker expressive speech synthesis framework, including multi-attentive Tacotron, DurIAN…☆74Updated 2 years ago
- Official implementation for the paper: A Unified One-Shot Prosody and Speaker Conversion System with Self-Supervised Discrete Speech Unit…☆77Updated 2 years ago
- A toolkit to calculate speech audio quality. Not affiliated with the original authors☆51Updated 6 months ago
- PyTorch Implementation of Daft-Exprt: Robust Prosody Transfer Across Speakers for Expressive Speech Synthesis☆56Updated 3 years ago
- Official implementation of the paper "Laughter Synthesis using Pseudo Phonetic Tokens with a Large-scale In-the-wild Laughter Corpus" acc…☆74Updated last year
- Alignment files of LibriTTS.☆61Updated 4 years ago
- Speech Human Evaluation Estimation Toolkit (SHEET)☆53Updated 3 months ago
- Dataset of ICASSP 2021 MULTILINGUAL PHONETIC DATASET FOR LOW RESOURCE SPEECH RECOGNITION☆40Updated last year
- Qualtric or Qualtreat? Generate Qualtrics listening tests for Text-To-Speech evaluations.☆34Updated 8 months ago
- Prosody and Pronunciation Modification Network☆49Updated this week
- ☆64Updated last year
- Official Code for SyllableLM: Learning Coarse Semantic Units for Speech Language Models☆50Updated this week