iisys-hof / HUI-Audio-Corpus-German
This is the official repository for the HUI-Audio-Corpus-German. The corresponding paper is in the process of publication. With the repository it is possible to automatically recreate the dataset. It is also possible to add more speakers to the processing pipeline.
☆26Updated last year
Related projects ⓘ
Alternatives and complementary repositories for HUI-Audio-Corpus-German
- A toolkit to calculate speech audio quality. Not affiliated with the original authors☆38Updated 2 months ago
- Code for paper titled "Perception of prosodic variation for speech synthesis using an unsupervised discrete representation of F0" submitt…☆16Updated 4 years ago
- multilingual speech aligner☆71Updated 11 months ago
- Python wrappers for Kaldi Levenshtein's distance and alignment code.☆61Updated 7 months ago
- ☆25Updated 3 months ago
- Objective metrics used in several text-to-speech (TTS) papers.☆46Updated 2 years ago
- Dataset of ICASSP 2021 MULTILINGUAL PHONETIC DATASET FOR LOW RESOURCE SPEECH RECOGNITION☆34Updated last year
- Conditional Variational Auto-Encoder with Jointly Training FastSpeech2(+Conformer) and HiFi-GAN for End to End Text to Speech☆46Updated 2 years ago
- Official implementation of the paper "Laughter Synthesis using Pseudo Phonetic Tokens with a Large-scale In-the-wild Laughter Corpus" acc…☆70Updated last year
- ☆45Updated last week
- Autovocoder: Fast Waveform Generation from a Learned Speech Representation using Differentiable Digital Signal Processing☆68Updated last year
- HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis☆37Updated 3 years ago
- [IJCAI'23] Learning to Speak from Text for Low-Resource TTS☆64Updated last year
- An unofficial PyTorch implementation of Mix-Phoneme-Bert☆39Updated last year
- PyTorch Implementation of Robust and fine-grained prosody control of end-to-end speech synthesis☆41Updated 2 years ago
- VoicePAT is a modular and efficient toolkit for voice privacy research, with main focus on speaker anonymization.☆46Updated 5 months ago
- CMU multilingual speech repository☆31Updated 2 years ago
- ☆62Updated last year
- Reference-aware automatic speech evaluation toolkit☆106Updated 8 months ago
- UTokyo-SaruLab MOS Prediction System☆83Updated this week
- This repository provides a multi-mode and multi-speaker expressive speech synthesis framework, including multi-attentive Tacotron, DurIAN…☆74Updated 2 years ago
- Transcripts and segmentation for the Blizzard 2013 audiobooks also known as the Lessac or Blizzard 2013 dataset.☆44Updated 4 years ago
- A list of papers for child ASR☆26Updated last month
- Clustering-based methods for overlapping diarization☆68Updated 9 months ago
- ☆62Updated 6 months ago
- Official implementation of the paper: "LDNet: Unified Listener Dependent Modeling in MOS Prediction for Synthetic Speech"☆61Updated 2 years ago
- Official implementation for the paper: A Unified One-Shot Prosody and Speaker Conversion System with Self-Supervised Discrete Speech Unit…☆73Updated last year
- ☆31Updated last year
- Train no-reference speech quality estimators with multiple datasets via learned, per-dataset alignments.☆10Updated last month
- ☆44Updated last year