iisys-hof / HUI-Audio-Corpus-GermanLinks
This is the official repository for the HUI-Audio-Corpus-German. The corresponding paper is in the process of publication. With the repository it is possible to automatically recreate the dataset. It is also possible to add more speakers to the processing pipeline.
☆34Updated 2 years ago
Alternatives and similar repositories for HUI-Audio-Corpus-German
Users that are interested in HUI-Audio-Corpus-German are comparing it to the libraries listed below
Sorting:
- ☆80Updated 5 months ago
- S3PRL-VC: A Voice Conversion Toolkit based on S3PRL☆101Updated last year
- Python wrappers for Kaldi Levenshtein's distance and alignment code.☆68Updated 3 weeks ago
- multilingual speech aligner☆76Updated 2 years ago
- A toolkit to calculate speech audio quality. Not affiliated with the original authors☆65Updated last year
- Official implementation of the paper "Laughter Synthesis using Pseudo Phonetic Tokens with a Large-scale In-the-wild Laughter Corpus" acc…☆77Updated 2 years ago
- VoicePAT is a modular and efficient toolkit for voice privacy research, with main focus on speaker anonymization.☆55Updated last year
- Autovocoder: Fast Waveform Generation from a Learned Speech Representation using Differentiable Digital Signal Processing☆71Updated 3 years ago
- Avocodo: Generative Adversarial Network for Artifact-free Vocoder☆122Updated 3 years ago
- A sequence-to-sequence voice conversion toolkit.☆108Updated last year
- SelfRemaster: SSL Speech Restoration☆93Updated 2 years ago
- [IJCAI'23] Learning to Speak from Text for Low-Resource TTS☆63Updated 2 years ago
- High-Fidelity Neural Phonetic Posteriorgrams☆121Updated 11 months ago
- Official implementation for the paper: A Unified One-Shot Prosody and Speaker Conversion System with Self-Supervised Discrete Speech Unit…☆83Updated 3 years ago
- Transcripts and segmentation for the Blizzard 2013 audiobooks also known as the Lessac or Blizzard 2013 dataset.☆45Updated 6 years ago
- The TTSDS benchmark evaluates synthetic speech quality by considering prosody, speaker identity, and intelligibility, comparing these fac…☆78Updated 4 months ago
- Qualtric or Qualtreat? Generate Qualtrics listening tests for Text-To-Speech evaluations.☆36Updated last year
- Training code and dataset cleasing with Sidon☆73Updated 2 weeks ago
- ☆64Updated last year
- ☆31Updated 2 years ago
- ☆26Updated last year
- This is the implementation for "ControlVC: Zero-Shot Voice Conversion with Time-Varying Controls on Pitch and Rhythm"☆134Updated 2 years ago
- ☆82Updated last year
- Unofficial implementation of wavenext vocoder☆55Updated last year
- ☆26Updated last year
- ☆59Updated 3 months ago
- Code for vec2wav 2.0, a speech token vocoder for VC. Paper: https://arxiv.org/abs/2409.01995☆78Updated last year
- Alignment files of LibriTTS.☆67Updated 5 years ago
- This repository provides a multi-mode and multi-speaker expressive speech synthesis framework, including multi-attentive Tacotron, DurIAN…☆74Updated 3 years ago
- ☆44Updated last year