Joovvhan / MelNet
PyTorch implementation of MelNet
☆10Updated 5 years ago
Related projects ⓘ
Alternatives and complementary repositories for MelNet
- Anonymous ICLR Submission☆14Updated 5 years ago
- ESPnet-TTS Audio Sample HP☆21Updated 5 years ago
- Demos, pretrained models, and (WIP) code supporting Representation Mixing☆51Updated 5 years ago
- Two different PyTorch implementation of Inverse-STFT for discussion at https://github.com/keunwoochoi/torchaudio-contrib/issues/27☆9Updated 4 years ago
- Pytorch Implementation of WaveNODE☆64Updated 4 years ago
- PyTorch implementation of the paper "NanoFlow: Scalable Normalizing Flows with Sublinear Parameter Complexity." (NeurIPS 2020)☆64Updated 3 years ago
- ☆9Updated 5 years ago
- Tensorflow Implementation of "Theory and Experiments on Vector Quantized Autoencoders"☆14Updated 5 years ago
- Network specification and demo☆35Updated 7 years ago
- Code for the paper "Improving Sound Event Classification by Increasing Shift Invariance in Convolutional Neural Networks".☆13Updated last year
- Contrastive Language-Audio Pretraining☆15Updated 3 years ago
- Pytorch implementation of sparse_image_warp and an example of GoogleBrain's SpecAugment is given: A Simple Data Augmentation Method for A…☆23Updated 5 years ago
- The training code for the 4th place model at MDX 2021 leaderboard A.☆34Updated 3 years ago
- Source code for "Towards a Deeper Understanding of Adversarial Losses under a Discriminative Adversarial Network Setting"☆42Updated 2 years ago
- Tensor2tensor experiment with SpecAugment☆47Updated 5 years ago
- VoxSRC Challenge☆31Updated 5 years ago
- Pytorch implementation of "Group Latent Embedding for Vector Quantized Variational Autoencoder in Non-Parallel Voice Conversion" [Intersp…☆28Updated 5 years ago
- Implementation of "FastSpeech: Fast, Robust and Controllable Text to Speech"☆51Updated 4 years ago
- This is a template for the Non-autoregressive Deep Learning-Based TTS model (in PyTorch).☆15Updated 3 years ago
- Code accompanying ML4MD ICML 2020 paper - "Generative Modelling for Controllable Audio Synthesis of Expressive Piano Performance".☆29Updated 4 years ago
- Code for the paper: Unified Gradient Reweighting for Model Biasing with Applications to Source Separation☆14Updated 4 years ago
- Code base for WaveTransformer: A novel architecture for automated audio captioning☆45Updated 3 years ago
- Compressed version of Tacotron 2 using Tensor Train + Waveglow.☆22Updated 4 years ago
- An implementation of the Prism layer (https://arxiv.org/abs/2011.04823)☆12Updated 4 years ago
- PPSpeech: Phrase based Parallel End-to-End TTS System☆35Updated 4 years ago
- Unsupervised Speech Decomposition via Triple Information Bottleneck☆14Updated 4 years ago
- custom cuda kernel for {2, 3}d relative attention with pytorch wrapper☆43Updated 4 years ago
- Representation learning for NLP @ JSALT19☆36Updated 4 years ago