HudsonHuang/waveglow_vocoder

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/HudsonHuang/waveglow_vocoder)

HudsonHuang / waveglow_vocoder

A vocoder that can convert audio to Mel-Spectrogram and reverse with WaveGlow, with GPU.

☆16

Alternatives and similar repositories for waveglow_vocoder

Users that are interested in waveglow_vocoder are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

rhasspy / wav2mel
View on GitHub
Transform audio files into mel spectrograms for text-to-speech model training
☆12Aug 25, 2021Updated 4 years ago
b04901014 / ISGAN
View on GitHub
☆21Nov 1, 2018Updated 7 years ago
jonmmease / jupyterlab_delux
View on GitHub
Proof-of-concept of a conda package that includes JupyterLab with preinstalled extensions.
☆19Nov 18, 2020Updated 5 years ago
YoavRamon / Speech-Recognition-Israel
View on GitHub
The repository for Speech Recognition Israel meetup group. It is used to material collection and sharing.
☆13Jul 12, 2020Updated 6 years ago
horvathandris / dime
View on GitHub
An ISO-4217 currency library for Gleam
☆13Updated this week
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
gulico / Plants_vs_Zombies
View on GitHub
植物大战僵尸纯c++&SDK
☆10Jun 28, 2020Updated 6 years ago
cjerry1243 / TransferLearning-CLVC
View on GitHub
Transfer Learning from Monolingual ASR to Transcription-free Cross-lingual Voice Conversion
☆40Oct 22, 2022Updated 3 years ago
qiuqiangkong / mini_music_tagging
View on GitHub
☆13Jul 14, 2024Updated 2 years ago
ariacat3366 / pytorch-StarGAN-VC2-implementation
View on GitHub
This is a pytorch implementation of StarGAN-VC2.
☆13Dec 17, 2019Updated 6 years ago
foamliu / Speaker-Embeddings
View on GitHub
PyTorch implementation of a self-attentive speaker embedding
☆17Sep 24, 2019Updated 6 years ago
ariacat3366 / ACVAE-VC
View on GitHub
☆22Jan 15, 2019Updated 7 years ago
anonymous84654 / RAVE_anonymous
View on GitHub
☆14Mar 20, 2022Updated 4 years ago
Asthestarsfalll / Symbolic-Music-Genre-Transfer-with-CycleGAN-for-pytorch
View on GitHub
The PyTorch Implement of Symbolic Music Genre Transfer with CycleGAN
☆10Jan 8, 2022Updated 4 years ago
boblsturm / aimusicgenerationchallenge2023
View on GitHub
☆18Aug 12, 2024Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
lgraesser / MultimodalGame
View on GitHub
☆24Sep 13, 2018Updated 7 years ago
itsuki8914 / Voice-morphing-RelGAN
View on GitHub
A implementation voice morphing using relgan with tensorflow
☆25Mar 24, 2023Updated 3 years ago
moiseshorta / MelSpecVAE
View on GitHub
Variational Autoencoder in the mel-spectrogram domain for one-shot audio synthesis
☆146Dec 12, 2021Updated 4 years ago
narad / amp-space
View on GitHub
Scripts, Files, and Resources for Constructing a Large-scale Dataset of Blackbox Effects for Timbre Transfer
☆16Feb 4, 2023Updated 3 years ago
DCASE2023-Task7-Foley-Sound-Synthesis / dcase2023_task7_baseline
View on GitHub
☆32Apr 1, 2023Updated 3 years ago
bityangke / 3d-DenseNet
View on GitHub
3D Dense Connected Convolutional Network (3D-DenseNet for action recognition)
☆17Jun 24, 2017Updated 9 years ago
juhayna-zh / Awesome-Music-Generation-Papers
View on GitHub
Curated list of groundbreaking music generation research.
☆21Apr 24, 2026Updated 3 months ago
cegeme / iracema
View on GitHub
☆16Mar 25, 2023Updated 3 years ago
phiana / speech-style-transfer-vae-gan-tensorflow
View on GitHub
A TensorFlow implementation of a variational autoencoder-generative adversarial network (VAE-GAN) architecture for speech-to-speech style…
☆19Jan 17, 2022Updated 4 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
inteljack / EL6183-Digital-Signal-Processing-Lab-2015-Fall
View on GitHub
☆23Apr 6, 2016Updated 10 years ago
ms-dot-k / Visual-Audio-Memory
View on GitHub
PyTorch implementation of "Multi-modality Associative Bridging through Memory: Speech Sound Recollected from Face Video" (ICCV2021)
☆22Apr 11, 2022Updated 4 years ago
GANtastic3 / MaskCycleGAN-VC
View on GitHub
Implementation of Kaneko et al.'s MaskCycleGAN-VC model for non-parallel voice conversion.
☆116Jun 6, 2021Updated 5 years ago
Hiroshiba / openjtalk-label-getter
View on GitHub
☆10Dec 10, 2021Updated 4 years ago
wangzuzihan / AIGC-
View on GitHub
☆11Jan 13, 2023Updated 3 years ago
xavierfav / feature-comparison-clustering
View on GitHub
Comparing Audio Features for Unsupervised Sound Classification
☆10Jun 22, 2022Updated 4 years ago
ben-hayes / timbre-dissimilarity-metrics
View on GitHub
A collection of metrics for evaluating timbre dissimilarity using the TorchMetrics API
☆32Dec 30, 2021Updated 4 years ago
stefantaubert / mean-opinion-score
View on GitHub
Python library for calculating the mean opinion score and 95% confidence interval of the standard deviation of text-to-speech ratings acc…
☆24Jan 31, 2025Updated last year
mdx-tutorial / mdx-tutorial.github.io
View on GitHub
Tutorial covering Open Source tools for Source Separation.
☆15Nov 12, 2021Updated 4 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
manymuch / Natural-Noise-Generator
View on GitHub
☆10Aug 3, 2019Updated 6 years ago
mdx-workshop / mdx-submissions21
View on GitHub
Music Demixing Challenge Submission Repo
☆16Sep 8, 2023Updated 2 years ago
GPUPhobia / vocal-mask
View on GitHub
☆12May 1, 2019Updated 7 years ago
david-gimeno / tailored-avsr
View on GitHub
Official source code for the paper "Tailored Design of Audio-Visual Speech Recognition Models using Branchformers"
☆15Feb 24, 2025Updated last year
kan-bayashi / Taco2withBERT
View on GitHub
Tacotron2 with BERT examples
☆10Jul 8, 2019Updated 7 years ago
muhdhuz / audio2spec
View on GitHub
Scripts to convert audio files to spectrograms and back
☆12Nov 23, 2017Updated 8 years ago
zassou65535 / WaveGAN
View on GitHub
WaveGANによる音声生成器
☆13Feb 9, 2024Updated 2 years ago