This is PyTorch Implementation of Neural Style Transfer Algorithm which is modified for Audios.
☆85Mar 16, 2022Updated 3 years ago
Alternatives and similar repositories for Neural-Style-Transfer-Audio
Users that are interested in Neural-Style-Transfer-Audio are comparing it to the libraries listed below
Sorting:
- An implementation of Neural Style Transfer for Audio using Pytorch.☆10Dec 14, 2017Updated 8 years ago
- Github repository for inzva-ai project Audio Style Transfer☆56Oct 13, 2018Updated 7 years ago
- NIPS2017 "Time Domain Neural Audio Style Transfer" code repository☆139Apr 12, 2022Updated 3 years ago
- Audio style transfer with shallow random parameters CNN.☆406Feb 19, 2025Updated last year
- Torch implementation for audio neural style.☆141Feb 8, 2017Updated 9 years ago
- Real-time melgan based on cpu !!!☆13Dec 3, 2019Updated 6 years ago
- TensorFlow implementation for audio neural style.☆450Apr 23, 2022Updated 3 years ago
- MSc AI Project on generative deep networks and neural style transfer for audio☆63May 18, 2017Updated 8 years ago
- This is the implementation of the paper "VAW-GAN for Singing Voice Conversion withNon-parallel Training Data".☆17Aug 12, 2020Updated 5 years ago
- Mel spectrum based on tacotron2 for melgan speech synthesis☆15Mar 24, 2023Updated 2 years ago
- MelGAN-VC: Voice Conversion and Audio Style Transfer on arbitrarily long samples using Spectrograms☆229Apr 17, 2022Updated 3 years ago
- Audio style transfer AI☆155Oct 20, 2024Updated last year
- Reproducing PARALLEL-DATA-FREE VOICE CONVERSION USING CYCLE-CONSISTENT ADVERSARIAL NETWORKS (https://arxiv.org/pdf/1711.11293.pdf)☆21Jul 18, 2019Updated 6 years ago
- ☆57Apr 22, 2024Updated last year
- A Text2Speech Engine built in Pytorch.☆12Dec 9, 2018Updated 7 years ago
- Symbolic Music Genre Transfer with CycleGAN - Refactorization☆36Sep 13, 2021Updated 4 years ago
- Application of OpenAI tools such as Whisper, DALL-E, and ChatGPT to generate album covers from audio☆12May 31, 2023Updated 2 years ago
- ☆12Jul 5, 2024Updated last year
- PyTorch implementation of A Neural Algorithm of Artistic Style☆10Dec 20, 2019Updated 6 years ago
- This is a intuitive explanation of Representation Learning with Contrastive Predictive Coding using code provided by jefflai108 that use…☆10Jan 25, 2021Updated 5 years ago
- ☆54May 4, 2018Updated 7 years ago
- Lightweight speaker anonymization [IEEE SLT2021]☆27Jun 6, 2022Updated 3 years ago
- ☆110Dec 14, 2016Updated 9 years ago
- DUSTED: Spoken-Term Discovery using Discrete Speech Units☆18Oct 2, 2024Updated last year
- Voice Conversion using Tacotron.☆11Dec 29, 2022Updated 3 years ago
- Codebase and project page for EDMSound☆35Nov 20, 2023Updated 2 years ago
- Official PyTorch implementation of "t-EER: Parameter-Free Tandem Evaluation Metric of Countermeasures and Biometric Comparators"☆14Sep 25, 2023Updated 2 years ago
- Prosody-semantics Interface in Seoul Korean☆12Oct 9, 2020Updated 5 years ago
- Symbolic Music Genre Transfer with CycleGAN☆278Apr 4, 2021Updated 4 years ago
- Self-supervised VQ-VAE for One-Shot Music Style Transfer☆99Feb 24, 2025Updated last year
- Speech Emotion Recognition using transfer learning with wav2vec on IEMOCAP.☆17Aug 8, 2021Updated 4 years ago
- A real time implementation of the ddsp from google magenta.☆15Nov 8, 2021Updated 4 years ago
- This is the implementation of the Speaker Odyssey 2020 paper " Transforming spectrum and prosody for emotional voice conversion with non-…☆125Dec 14, 2020Updated 5 years ago
- PyTorch based speaker embedding model☆16Apr 13, 2024Updated last year
- A Lightweight One-Shot Whisper to Normal Voice Conversion Model Using Distillation of Self-Supervised Features☆22Dec 10, 2025Updated 2 months ago
- Implementation of the paper "BERTphone: Phonetically-aware Encoder Representations for Utterance-level Speaker and Language Recognition"☆17Dec 10, 2020Updated 5 years ago
- logWMSE, an audio quality metric & loss function with support for digital silence target. Useful for training and evaluating audio source…☆45Jan 29, 2026Updated last month
- Official implementation of the paper: "LDNet: Unified Listener Dependent Modeling in MOS Prediction for Synthetic Speech"☆68Dec 13, 2021Updated 4 years ago
- Pytorch Code for S2IGAN☆40Aug 11, 2020Updated 5 years ago