MelGAN implementation with Multi-Band and Full Band supports...
☆62Aug 27, 2020Updated 5 years ago
Alternatives and similar repositories for melgan
Users that are interested in melgan are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- An unofficial implementation of https://arxiv.org/abs/2005.05106☆49Mar 10, 2021Updated 5 years ago
- Include Basis-MelGAN, MelGAN, HifiGAN and Multiband-HifiGAN, maybe NHV in the future.☆157Jul 2, 2021Updated 4 years ago
- Fatcord's Alternative WaveRNN (Faster training)☆132Nov 29, 2020Updated 5 years ago
- Implementation of "Duration Informed Attention Network for Multimodal Synthesis" paper in PyTorch.☆184Aug 12, 2020Updated 5 years ago
- A pytroch implementation of the FB-MelGAN☆90May 26, 2020Updated 5 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Unofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN & HiFi-GAN & StyleMelGAN) with Pytorch☆1,637Apr 22, 2024Updated last year
- GAN-based Mel-Spectrogram Inversion Network for Text-to-Speech Synthesis☆1,039Aug 28, 2023Updated 2 years ago
- Implementation of "DurIAN: Duration Informed Attention Network For Multimodal Synthesis".☆14Jul 6, 2020Updated 5 years ago
- MelGAN vocoder (compatible with NVIDIA/tacotron2)☆649Oct 3, 2020Updated 5 years ago
- This repo contains conv-tasnet for basis-melgan. If you want to get code of basis-melgan, please refer to FastVocoder.☆21Jul 21, 2021Updated 4 years ago
- VCTK multi-speaker tacotron for ICASSP 2020☆266Mar 29, 2022Updated 4 years ago
- A PyTorch implementation of "Robust Universal Neural Vocoding"☆238Nov 14, 2020Updated 5 years ago
- Crystal - C++ implementation of a unified framework for multilingual TTS synthesis engine with SSML specification as interface.☆229Aug 17, 2020Updated 5 years ago
- Fre-GAN: Adversarial Frequency-consistent Audio Synthesis☆112Aug 26, 2021Updated 4 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis☆84May 23, 2023Updated 2 years ago
- Joint magnitude estimation and phase recovery using Cycle-in-Cycle GAN for non-parallel speech enhancement☆10Jan 24, 2022Updated 4 years ago
- A Generative Flow for Text-to-Speech via Monotonic Alignment Search☆707Jul 12, 2022Updated 3 years ago
- A repository for benchmarking neural vocoders by their quality and speed.☆211May 30, 2025Updated 10 months ago
- An unofficial PyTorch implementation of Mix-Phoneme-Bert☆40Jul 10, 2023Updated 2 years ago
- Digital Speech Processing in PyTorch.☆15Aug 12, 2022Updated 3 years ago
- 基于随机森林和条件随机场的中文韵律预测模型☆28Jul 25, 2024Updated last year
- VocGAN: A High-Fidelity Real-time Vocoder with a Hierarchically-nested Adversarial Network☆321Jul 25, 2024Updated last year
- The audio demos with respect to the paper "DBT-Net: Dual-branch federative magnitude and phase estimation with attention-in-attention tra…☆29Jul 25, 2022Updated 3 years ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- ☆198May 3, 2024Updated last year
- Official repository of STYLER: Style Factor Modeling with Rapidity and Robustness via Speech Decomposition for Expressive and Controllabl…☆160Jun 5, 2025Updated 10 months ago
- Gaussian Mixture VAE Tacotron☆54Jul 6, 2023Updated 2 years ago
- ☆261Dec 8, 2022Updated 3 years ago
- A chinese singing voice dataset, professional male singer, 105 songs, 132 minutes☆11Oct 19, 2023Updated 2 years ago
- The code for aishell-3 baseline acoustic model☆69Nov 30, 2020Updated 5 years ago
- ☆64May 23, 2022Updated 3 years ago
- Implementation of Global Style Token Tacotron in TensorFlow2☆26Sep 28, 2020Updated 5 years ago
- RepVgg + HiFiGAN☆36Aug 10, 2022Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Audio LPC (linear prediction code) using mel spectorgram, compatible for LPCNet☆62Jun 8, 2021Updated 4 years ago
- ☆255Jul 6, 2023Updated 2 years ago
- SyntaSpeech: Syntax-aware Generative Adversarial Text-to-Speech; IJCAI 2022; Official code☆201Sep 4, 2022Updated 3 years ago
- This repository is a collection of TTS Models in TFLite☆200Feb 12, 2021Updated 5 years ago
- NU-Wave 2: A General Neural Audio Upsampling Model for Various Sampling Rates [WIP]☆25Jul 5, 2022Updated 3 years ago
- Chinese text normalization for speech processing☆728Mar 18, 2023Updated 3 years ago
- This repository contains the scripts to use CURRENNT☆66Jun 15, 2020Updated 5 years ago