Veleslavia / conditioned-u-net
Conditioned U-Net for Music Source Separation
☆20Updated 3 years ago
Alternatives and similar repositories for conditioned-u-net:
Users that are interested in conditioned-u-net are comparing it to the libraries listed below
- Deep Performer: Score-to-audio music performance synthesis☆42Updated last year
- ☆79Updated last year
- Who calls the shots? Rethinking Few-Shot Learning for Audio (WASPAA 2021)☆42Updated 2 years ago
- ☆24Updated 2 years ago
- NU-Wave: A Diffusion Probabilistic Model for Neural Audio Upsampling☆37Updated 3 years ago
- Implementation for "Music Enhancement via Image Translation and Vocoding"☆53Updated 2 years ago
- UnivNet: A Neural Vocoder with Multi-Resolution Spectrogram Discriminators for High-Fidelity Waveform Generation☆72Updated 3 years ago
- ☆39Updated 4 years ago
- PyTorch implementation of the ICASSP-24 paper: "Improving Audio Captioning Models with Fine-grained Audio Features, Text Embedding Superv…☆36Updated last year
- spectrogram inversion tools in PyTorch. Documentation: https://spectrogram-inversion.readthedocs.io☆47Updated last year
- ICASSP 2022☆61Updated 3 years ago
- An evaluation toolkit for voice conversion models.☆40Updated 3 years ago
- A PyTorch implementation of the paper: "LaSAFT: Latent Source Attentive Frequency Transformation for Conditioned Source Separation" (ICAS…☆85Updated 2 years ago
- An unofficial implementation of Vector Quantization Voice Conversion (VQVC).☆29Updated 3 years ago
- This repository contains laughter-related synthesis systems.☆12Updated 4 years ago
- PyTorch Dataset for Speech and Music audio☆73Updated 6 months ago
- ☆23Updated 5 years ago
- Source code for training models and using the hyperbolic interface proposed in our ICASSP 2023 paper, “Hyperbolic Audio Source Separation…☆62Updated last year
- Code for ISMIR 2020 paper: "Multiple F0 Estimation in Vocal Ensembles using Convolutional Neural Networks"☆54Updated last month
- Deep Speech Distances PyTorch☆27Updated 2 years ago
- follow NVIDIA, simplify it and support data parallel.☆13Updated 5 years ago
- An unofficial implementation of https://arxiv.org/abs/2005.05106☆46Updated 3 years ago
- Labels for kiritan_singing data with extra resources for DNN-based singing voice synthesis (SVS) systems.☆29Updated last year
- Learning and controlling the source-filter representation of speech with a variational autoencoder☆45Updated last year
- Simple baseline model for the HEAR benchmark☆23Updated last week
- Rough implementation of Simultaneous Separation and Transcription of Mixtures with Multiple Polyphonic and Percussive Instruments (Ethan …☆23Updated 4 years ago
- Control mechanisms to the U-Net architecture for doing multiple source separation instruments☆49Updated 4 years ago
- Chorale Music Separation Dataset and Model Framework☆33Updated 2 years ago
- Official repository of the paper "Solving Audio Inverse Problems with a Diffusion Model", submitted to ICASSP 23☆112Updated last year
- Upsampling Artifacts in Neural Audio Synthesis – https://arxiv.org/abs/2010.14356☆77Updated 3 years ago