iisys-hof / HUI-Audio-Corpus-GermanLinks
This is the official repository for the HUI-Audio-Corpus-German. The corresponding paper is in the process of publication. With the repository it is possible to automatically recreate the dataset. It is also possible to add more speakers to the processing pipeline.
☆33Updated 2 years ago
Alternatives and similar repositories for HUI-Audio-Corpus-German
Users that are interested in HUI-Audio-Corpus-German are comparing it to the libraries listed below
Sorting:
- A toolkit to calculate speech audio quality. Not affiliated with the original authors☆65Updated last year
- Autovocoder: Fast Waveform Generation from a Learned Speech Representation using Differentiable Digital Signal Processing☆71Updated 3 years ago
- multilingual speech aligner☆77Updated 2 years ago
- ☆80Updated 5 months ago
- ☆64Updated last year
- S3PRL-VC: A Voice Conversion Toolkit based on S3PRL☆101Updated last year
- [IJCAI'23] Learning to Speak from Text for Low-Resource TTS☆63Updated 2 years ago
- Avocodo: Generative Adversarial Network for Artifact-free Vocoder☆122Updated 3 years ago
- A sequence-to-sequence voice conversion toolkit.☆106Updated last year
- Official implementation of the paper "Laughter Synthesis using Pseudo Phonetic Tokens with a Large-scale In-the-wild Laughter Corpus" acc…☆77Updated 2 years ago
- ☆26Updated last year
- Python wrappers for Kaldi Levenshtein's distance and alignment code.☆68Updated this week
- SelfRemaster: SSL Speech Restoration☆93Updated 2 years ago
- Unofficial implementation of wavenext vocoder☆53Updated last year
- VoicePAT is a modular and efficient toolkit for voice privacy research, with main focus on speaker anonymization.☆53Updated last year
- ☆59Updated 2 months ago
- An automatic prosodic boundary annotation tool for Text-to-Speech Synthesis (TTS).☆51Updated last year
- Official implementation for the paper: A Unified One-Shot Prosody and Speaker Conversion System with Self-Supervised Discrete Speech Unit…☆83Updated 3 years ago
- Prosodic Speech Segmentation with Transformers☆26Updated last year
- ☆70Updated 2 years ago
- The TTSDS benchmark evaluates synthetic speech quality by considering prosody, speaker identity, and intelligibility, comparing these fac…☆72Updated 3 months ago
- ☆82Updated 11 months ago
- Qualtric or Qualtreat? Generate Qualtrics listening tests for Text-To-Speech evaluations.☆36Updated last year
- High-Fidelity Neural Phonetic Posteriorgrams☆121Updated 10 months ago
- ☆37Updated last year
- ☆44Updated last year
- Official Code for SyllableLM: Learning Coarse Semantic Units for Speech Language Models☆58Updated 6 months ago
- PyTorch Implementation of Daft-Exprt: Robust Prosody Transfer Across Speakers for Expressive Speech Synthesis☆55Updated 4 years ago
- Transcripts and segmentation for the Blizzard 2013 audiobooks also known as the Lessac or Blizzard 2013 dataset.☆44Updated 6 years ago
- [Interspeech 2025] Official implementation of "Training-Free Voice Conversion with Factorized Optimal Transport"☆43Updated 3 months ago