iisys-hof / HUI-Audio-Corpus-GermanLinks
This is the official repository for the HUI-Audio-Corpus-German. The corresponding paper is in the process of publication. With the repository it is possible to automatically recreate the dataset. It is also possible to add more speakers to the processing pipeline.
☆32Updated 2 years ago
Alternatives and similar repositories for HUI-Audio-Corpus-German
Users that are interested in HUI-Audio-Corpus-German are comparing it to the libraries listed below
Sorting:
- A toolkit to calculate speech audio quality. Not affiliated with the original authors☆53Updated 11 months ago
- Autovocoder: Fast Waveform Generation from a Learned Speech Representation using Differentiable Digital Signal Processing☆70Updated 2 years ago
- Official implementation of the paper "Laughter Synthesis using Pseudo Phonetic Tokens with a Large-scale In-the-wild Laughter Corpus" acc…☆76Updated 2 years ago
- multilingual speech aligner☆75Updated last year
- ☆80Updated last year
- SelfRemaster: SSL Speech Restoration☆89Updated last year
- S3PRL-VC: A Voice Conversion Toolkit based on S3PRL☆101Updated last year
- Python wrappers for Kaldi Levenshtein's distance and alignment code.☆67Updated 2 months ago
- ☆63Updated last year
- Avocodo: Generative Adversarial Network for Artifact-free Vocoder☆120Updated 3 years ago
- [IJCAI'23] Learning to Speak from Text for Low-Resource TTS☆63Updated 2 years ago
- ☆78Updated 6 months ago
- ☆26Updated 11 months ago
- An unofficial PyTorch implementation of Mix-Phoneme-Bert☆39Updated 2 years ago
- Unofficial implementation of wavenext vocoder☆48Updated 11 months ago
- A diffusion-based cross-lingual voice conversion model, as my bachelor's thesis☆44Updated 2 years ago
- This repository provides a multi-mode and multi-speaker expressive speech synthesis framework, including multi-attentive Tacotron, DurIAN…☆74Updated 2 years ago
- ☆24Updated last year
- This is the M-AILABS Speech Dataset☆72Updated 8 months ago
- ☆55Updated 9 months ago
- High-Fidelity Neural Phonetic Posteriorgrams☆112Updated 5 months ago
- Official implementation for the paper: A Unified One-Shot Prosody and Speaker Conversion System with Self-Supervised Discrete Speech Unit…☆81Updated 2 years ago
- Alignment files of LibriTTS.☆64Updated 5 years ago
- Code for vec2wav 2.0, a speech token vocoder for VC. Paper: https://arxiv.org/abs/2409.01995☆77Updated 8 months ago
- Qualtric or Qualtreat? Generate Qualtrics listening tests for Text-To-Speech evaluations.☆36Updated last year
- ☆41Updated 10 months ago
- The TTSDS benchmark evaluates synthetic speech quality by considering prosody, speaker identity, and intelligibility, comparing these fac…☆50Updated 2 weeks ago
- A sequence-to-sequence voice conversion toolkit.☆102Updated last year
- HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis☆43Updated 4 years ago
- Contains the code associated with the ICLR submission for our text-to-speech diffusion model☆54Updated last year