MelGAN implementation with Multi-Band and Full Band supports...
☆62Aug 27, 2020Updated 5 years ago
Alternatives and similar repositories for melgan
Users that are interested in melgan are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- An unofficial implementation of https://arxiv.org/abs/2005.05106☆49Mar 10, 2021Updated 5 years ago
- Include Basis-MelGAN, MelGAN, HifiGAN and Multiband-HifiGAN, maybe NHV in the future.☆157Jul 2, 2021Updated 4 years ago
- Fatcord's Alternative WaveRNN (Faster training)☆132Nov 29, 2020Updated 5 years ago
- Implementation of "Duration Informed Attention Network for Multimodal Synthesis" paper in PyTorch.☆184Aug 12, 2020Updated 5 years ago
- A pytroch implementation of the FB-MelGAN☆90May 26, 2020Updated 5 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Unofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN & HiFi-GAN & StyleMelGAN) with Pytorch☆1,637Apr 22, 2024Updated last year
- GAN-based Mel-Spectrogram Inversion Network for Text-to-Speech Synthesis☆1,037Aug 28, 2023Updated 2 years ago
- Implementation of "DurIAN: Duration Informed Attention Network For Multimodal Synthesis".☆14Jul 6, 2020Updated 5 years ago
- MelGAN vocoder (compatible with NVIDIA/tacotron2)☆650Oct 3, 2020Updated 5 years ago
- This repo contains conv-tasnet for basis-melgan. If you want to get code of basis-melgan, please refer to FastVocoder.☆21Jul 21, 2021Updated 4 years ago
- VCTK multi-speaker tacotron for ICASSP 2020☆266Mar 29, 2022Updated 3 years ago
- A PyTorch implementation of "Robust Universal Neural Vocoding"☆238Nov 14, 2020Updated 5 years ago
- Crystal - C++ implementation of a unified framework for multilingual TTS synthesis engine with SSML specification as interface.☆229Aug 17, 2020Updated 5 years ago
- Fre-GAN: Adversarial Frequency-consistent Audio Synthesis☆112Aug 26, 2021Updated 4 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis☆84May 23, 2023Updated 2 years ago
- Joint magnitude estimation and phase recovery using Cycle-in-Cycle GAN for non-parallel speech enhancement☆10Jan 24, 2022Updated 4 years ago
- A Generative Flow for Text-to-Speech via Monotonic Alignment Search☆704Jul 12, 2022Updated 3 years ago
- A repository for benchmarking neural vocoders by their quality and speed.☆211May 30, 2025Updated 9 months ago
- An unofficial PyTorch implementation of Mix-Phoneme-Bert☆40Jul 10, 2023Updated 2 years ago
- Digital Speech Processing in PyTorch.☆15Aug 12, 2022Updated 3 years ago
- 基于随机森林和条件随机场的中文韵律预测模型☆28Jul 25, 2024Updated last year
- VocGAN: A High-Fidelity Real-time Vocoder with a Hierarchically-nested Adversarial Network☆321Jul 25, 2024Updated last year
- The audio demos with respect to the paper "DBT-Net: Dual-branch federative magnitude and phase estimation with attention-in-attention tra…☆29Jul 25, 2022Updated 3 years ago
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- Official repository of STYLER: Style Factor Modeling with Rapidity and Robustness via Speech Decomposition for Expressive and Controllabl…☆160Jun 5, 2025Updated 9 months ago
- ☆198May 3, 2024Updated last year
- Gaussian Mixture VAE Tacotron☆54Jul 6, 2023Updated 2 years ago
- ☆262Dec 8, 2022Updated 3 years ago
- The code for aishell-3 baseline acoustic model☆69Nov 30, 2020Updated 5 years ago
- A chinese singing voice dataset, professional male singer, 105 songs, 132 minutes☆11Oct 19, 2023Updated 2 years ago
- ☆64May 23, 2022Updated 3 years ago
- Implementation of Global Style Token Tacotron in TensorFlow2☆26Sep 28, 2020Updated 5 years ago
- RepVgg + HiFiGAN☆36Aug 10, 2022Updated 3 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Audio LPC (linear prediction code) using mel spectorgram, compatible for LPCNet☆62Jun 8, 2021Updated 4 years ago
- ☆255Jul 6, 2023Updated 2 years ago
- SyntaSpeech: Syntax-aware Generative Adversarial Text-to-Speech; IJCAI 2022; Official code☆203Sep 4, 2022Updated 3 years ago
- NU-Wave 2: A General Neural Audio Upsampling Model for Various Sampling Rates [WIP]☆25Jul 5, 2022Updated 3 years ago
- This repository is a collection of TTS Models in TFLite☆200Feb 12, 2021Updated 5 years ago
- Chinese text normalization for speech processing☆722Mar 18, 2023Updated 3 years ago
- This repository contains the scripts to use CURRENNT☆66Jun 15, 2020Updated 5 years ago