iisys-hof / HUI-Audio-Corpus-GermanLinks
This is the official repository for the HUI-Audio-Corpus-German. The corresponding paper is in the process of publication. With the repository it is possible to automatically recreate the dataset. It is also possible to add more speakers to the processing pipeline.
☆33Updated 2 years ago
Alternatives and similar repositories for HUI-Audio-Corpus-German
Users that are interested in HUI-Audio-Corpus-German are comparing it to the libraries listed below
Sorting:
- ☆80Updated 3 months ago
- multilingual speech aligner☆77Updated 2 years ago
- A toolkit to calculate speech audio quality. Not affiliated with the original authors☆62Updated last year
- S3PRL-VC: A Voice Conversion Toolkit based on S3PRL☆101Updated last year
- [IJCAI'23] Learning to Speak from Text for Low-Resource TTS☆63Updated 2 years ago
- Autovocoder: Fast Waveform Generation from a Learned Speech Representation using Differentiable Digital Signal Processing☆70Updated 2 years ago
- Official implementation of the paper "Laughter Synthesis using Pseudo Phonetic Tokens with a Large-scale In-the-wild Laughter Corpus" acc…☆77Updated 2 years ago
- Avocodo: Generative Adversarial Network for Artifact-free Vocoder☆122Updated 3 years ago
- This is the implementation for "ControlVC: Zero-Shot Voice Conversion with Time-Varying Controls on Pitch and Rhythm"☆133Updated 2 years ago
- Qualtric or Qualtreat? Generate Qualtrics listening tests for Text-To-Speech evaluations.☆36Updated last year
- ☆64Updated last year
- PyTorch Implementation of Robust and fine-grained prosody control of end-to-end speech synthesis☆41Updated 3 years ago
- ☆58Updated last month
- PyTorch Implementation of Daft-Exprt: Robust Prosody Transfer Across Speakers for Expressive Speech Synthesis☆55Updated 4 years ago
- ☆82Updated 10 months ago
- Python wrappers for Kaldi Levenshtein's distance and alignment code.☆68Updated 6 months ago
- This repository provides a multi-mode and multi-speaker expressive speech synthesis framework, including multi-attentive Tacotron, DurIAN…☆74Updated 3 years ago
- VoicePAT is a modular and efficient toolkit for voice privacy research, with main focus on speaker anonymization.☆52Updated last year
- SelfRemaster: SSL Speech Restoration☆93Updated last year
- A sequence-to-sequence voice conversion toolkit.☆105Updated last year
- Official implementation for the paper: A Unified One-Shot Prosody and Speaker Conversion System with Self-Supervised Discrete Speech Unit…☆83Updated 2 years ago
- Alignment files of LibriTTS.☆65Updated 5 years ago
- HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis☆44Updated 4 years ago
- Code for vec2wav 2.0, a speech token vocoder for VC. Paper: https://arxiv.org/abs/2409.01995☆80Updated 11 months ago
- An automatic prosodic boundary annotation tool for Text-to-Speech Synthesis (TTS).☆51Updated last year
- Dataset of ICASSP 2021 MULTILINGUAL PHONETIC DATASET FOR LOW RESOURCE SPEECH RECOGNITION☆45Updated 2 years ago
- High-Fidelity Neural Phonetic Posteriorgrams☆120Updated 9 months ago
- Prosodic Speech Segmentation with Transformers☆26Updated last year
- This is the M-AILABS Speech Dataset☆90Updated last year
- ☆43Updated last year