MelGAN implementation with Multi-Band and Full Band supports...
☆62Aug 27, 2020Updated 5 years ago
Alternatives and similar repositories for melgan
Users that are interested in melgan are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- An unofficial implementation of https://arxiv.org/abs/2005.05106☆49Mar 10, 2021Updated 5 years ago
- Include Basis-MelGAN, MelGAN, HifiGAN and Multiband-HifiGAN, maybe NHV in the future.☆157Jul 2, 2021Updated 4 years ago
- Fatcord's Alternative WaveRNN (Faster training)☆132Nov 29, 2020Updated 5 years ago
- Implementation of "Duration Informed Attention Network for Multimodal Synthesis" paper in PyTorch.☆184Aug 12, 2020Updated 5 years ago
- A pytroch implementation of the FB-MelGAN☆90May 26, 2020Updated 5 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Unofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN & HiFi-GAN & StyleMelGAN) with Pytorch☆1,641Apr 22, 2024Updated 2 years ago
- GAN-based Mel-Spectrogram Inversion Network for Text-to-Speech Synthesis☆1,040Aug 28, 2023Updated 2 years ago
- Implementation of "DurIAN: Duration Informed Attention Network For Multimodal Synthesis".☆14Jul 6, 2020Updated 5 years ago
- MelGAN vocoder (compatible with NVIDIA/tacotron2)☆650Oct 3, 2020Updated 5 years ago
- This repo contains conv-tasnet for basis-melgan. If you want to get code of basis-melgan, please refer to FastVocoder.☆21Jul 21, 2021Updated 4 years ago
- VCTK multi-speaker tacotron for ICASSP 2020☆266Mar 29, 2022Updated 4 years ago
- A PyTorch implementation of "Robust Universal Neural Vocoding"☆238Nov 14, 2020Updated 5 years ago
- Crystal - C++ implementation of a unified framework for multilingual TTS synthesis engine with SSML specification as interface.☆229Aug 17, 2020Updated 5 years ago
- Fre-GAN: Adversarial Frequency-consistent Audio Synthesis☆112Aug 26, 2021Updated 4 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis☆84May 23, 2023Updated 2 years ago
- Joint magnitude estimation and phase recovery using Cycle-in-Cycle GAN for non-parallel speech enhancement☆10Jan 24, 2022Updated 4 years ago
- A Generative Flow for Text-to-Speech via Monotonic Alignment Search☆712Jul 12, 2022Updated 3 years ago
- A repository for benchmarking neural vocoders by their quality and speed.☆211May 30, 2025Updated 11 months ago
- An unofficial PyTorch implementation of Mix-Phoneme-Bert☆40Jul 10, 2023Updated 2 years ago
- Digital Speech Processing in PyTorch.☆15Aug 12, 2022Updated 3 years ago
- 基于随机森林和条件随机场的中文韵律预测模型☆28Jul 25, 2024Updated last year
- VocGAN: A High-Fidelity Real-time Vocoder with a Hierarchically-nested Adversarial Network☆321Jul 25, 2024Updated last year
- The audio demos with respect to the paper "DBT-Net: Dual-branch federative magnitude and phase estimation with attention-in-attention tra…☆29Jul 25, 2022Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆199May 3, 2024Updated 2 years ago
- Official repository of STYLER: Style Factor Modeling with Rapidity and Robustness via Speech Decomposition for Expressive and Controllabl…☆159Jun 5, 2025Updated 11 months ago
- Gaussian Mixture VAE Tacotron☆54Jul 6, 2023Updated 2 years ago
- ☆261Dec 8, 2022Updated 3 years ago
- A chinese singing voice dataset, professional male singer, 105 songs, 132 minutes☆11Oct 19, 2023Updated 2 years ago
- The code for aishell-3 baseline acoustic model☆69Nov 30, 2020Updated 5 years ago
- ☆64May 23, 2022Updated 3 years ago
- Implementation of Global Style Token Tacotron in TensorFlow2☆26Sep 28, 2020Updated 5 years ago
- RepVgg + HiFiGAN☆36Aug 10, 2022Updated 3 years ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- Audio LPC (linear prediction code) using mel spectorgram, compatible for LPCNet☆62Jun 8, 2021Updated 4 years ago
- ☆254Jul 6, 2023Updated 2 years ago
- SyntaSpeech: Syntax-aware Generative Adversarial Text-to-Speech; IJCAI 2022; Official code☆201Sep 4, 2022Updated 3 years ago
- This repository is a collection of TTS Models in TFLite☆200Feb 12, 2021Updated 5 years ago
- NU-Wave 2: A General Neural Audio Upsampling Model for Various Sampling Rates [WIP]☆25Jul 5, 2022Updated 3 years ago
- Chinese text normalization for speech processing☆728Mar 18, 2023Updated 3 years ago
- This repository contains the scripts to use CURRENNT☆66Jun 15, 2020Updated 5 years ago