muhdhuz / Audio_NeuralStyle
An implementation of Neural Style Transfer for Audio using Pytorch.
☆10Updated 7 years ago
Alternatives and similar repositories for Audio_NeuralStyle:
Users that are interested in Audio_NeuralStyle are comparing it to the libraries listed below
- Can Neural Networks reconstruct missing audio data? What about GANs?☆16Updated 5 years ago
- This is an unofficial implementation of universal melgan according to https://arxiv.org/abs/2011.09631☆23Updated 2 years ago
- Contains code for our work on speech to singing conversion (ICASSP 2020)☆50Updated 4 years ago
- Code for Unconditional Audio Generation with GAN and Cycle Regularization☆75Updated 3 years ago
- Streaming source separation for music and speech files, using the Open-Unmix LSTM architecture.☆18Updated 2 years ago
- ERISHA is a mulitilingual multispeaker expressive speech synthesis framework. It can transfer the expressivity to the speaker's voice for…☆43Updated 4 years ago
- Semi-supervised learning using teacher-student models for vocal melody extraction☆42Updated 3 years ago
- ☆18Updated 5 years ago
- Pitch estimation network (PiENet) for noise-robust neural F0 estimation of speech signals☆50Updated 5 years ago
- ☆23Updated 3 years ago
- Labels for kiritan_singing data with extra resources for DNN-based singing voice synthesis (SVS) systems.☆29Updated last year
- The official PyTorch implementation of paper: An Improved StarGAN for Emotional Voice Conversion: Enhancing Voice Quality and Data Augmen…☆9Updated 3 years ago
- Real-time melgan based on cpu !!!☆13Updated 5 years ago
- CycleGAN-VC2: Improved CycleGAN-based Non-parallel Voice Conversion☆41Updated 5 years ago
- Transfer Learning from Monolingual ASR to Transcription-free Cross-lingual Voice Conversion☆39Updated 2 years ago
- NU-Wave: A Diffusion Probabilistic Model for Neural Audio Upsampling☆37Updated 3 years ago
- Sound examples for the Neural Parametric Singing Synthesizer (NPSS)☆22Updated 3 years ago
- An implementation of "Towards Improving Harmonic Sensitivity and Prediction Stability for Singing Melody Extraction", in ISMIR 2023☆21Updated last year
- A toy-like Text-to-Speech for Chinese/Mandarin synthesize, inspired by Tacotron & FastSpeech2 & RefineGAN.☆15Updated 2 years ago
- An unofficial implementation of the paper titled "PitchNet: Unsupervised Singing Voice Conversion with Pitch Adversarial Network".☆27Updated 4 years ago
- ☆87Updated 2 years ago
- Pytorch implementation of "f0-consistent many-to-many non-parallel voice conversion via conditional autoencoder"☆28Updated 4 years ago
- Code for ISMIR 2020 paper: "Multiple F0 Estimation in Vocal Ensembles using Convolutional Neural Networks"☆55Updated 4 months ago
- Generative adversarial context encoder for audio inpainting☆25Updated 3 years ago
- One-shot TTS with Improved Unseen Speaker and Style Transfer☆37Updated 3 years ago
- Voice conversion (VC) investigation using three variants of VAE☆57Updated 5 years ago
- Official implementation of DualCycleGAN for nonparallel audio super resolution☆53Updated 2 years ago
- Open source code for the paper 'Music Source Separation with Generative Flow'☆22Updated 2 years ago
- Implementation of the framework described in the paper Spectrogram Inpainting for Interactive Generation of Instrument Sounds published a…☆39Updated 2 years ago
- An unofficial implementation of https://arxiv.org/abs/2005.05106☆46Updated 4 years ago