The official source code of UniAudio
☆95Mar 29, 2024Updated last year
Alternatives and similar repositories for UniAudio
Users that are interested in UniAudio are comparing it to the libraries listed below
Sorting:
- Spherical residual vector quantization (SRVQ)☆31Aug 25, 2024Updated last year
- Please visit https://thuhcsi.github.io/SnakeGAN/☆37Apr 25, 2023Updated 2 years ago
- RepVgg + HiFiGAN☆36Aug 10, 2022Updated 3 years ago
- ☆54Mar 2, 2023Updated 3 years ago
- TTS FrontEnd DataSet: Polyphone / Prosody / TextNormalization☆103Feb 5, 2024Updated 2 years ago
- ICASSP 2024 - Generative De-Quantization for Neural Speech Codec via Latent Diffusion.☆55Nov 16, 2025Updated 3 months ago
- LVCNet: Efficient Condition-Dependent Modeling Network for Waveform Generation☆80Feb 24, 2021Updated 5 years ago
- Include Basis-MelGAN, MelGAN, HifiGAN and Multiband-HifiGAN, maybe NHV in the future.☆157Jul 2, 2021Updated 4 years ago
- Yin pitch estimator in PyTorch☆117Nov 7, 2022Updated 3 years ago
- Implementation of SpatialCodec.☆69Sep 23, 2023Updated 2 years ago
- ☆49Apr 1, 2025Updated 11 months ago
- E2E TTS using Conditional Flow Matching (Experimental*)☆71Nov 10, 2023Updated 2 years ago
- Unofficial pytorch reproduction for the paper "Utilizing Neural Transducers for Two-Stage Text-to-Speech via Semantic Token Prediction" (…☆61Apr 4, 2024Updated last year
- ICASSP 2023 Accepted☆190May 6, 2024Updated last year
- Official implementation of the source-filter HiFiGAN vocoder☆268Jul 29, 2023Updated 2 years ago
- Ultra-low-bitrate Speech Codec for Speech Language Modeling Applications☆87Dec 20, 2024Updated last year
- Unofficial implementation of HiFi-GAN+ from the paper "Bandwidth Extension is All You Need" by Su, et al.☆223Oct 20, 2023Updated 2 years ago
- Next-generation TTS model using flow-matching and DiT, inspired by Stable Diffusion 3☆434Sep 13, 2024Updated last year
- Official Implement of Multi-Stage Multi-Codebook (MSMC) TTS