Conditioned U-Net for Music Source Separation
☆20May 15, 2021Updated 5 years ago
Alternatives and similar repositories for conditioned-u-net
Users that are interested in conditioned-u-net are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A Benchmark and Evaluation Suite for Zero-shot Singing Voice Synthesis☆27Feb 11, 2026Updated 3 months ago
- Official implementation of A cappella: Audio-visual Singing VoiceSeparation, from BMVC21☆17May 14, 2022Updated 4 years ago
- A PyTorch Implementation of the paper - Choi, Woosung, et al. "Investigating u-nets with various intermediate blocks for spectrogram-base…☆80Jul 1, 2022Updated 3 years ago
- Implementations for master thesis "Musical Instrument Recognition in Multi-Instrument Audio Contexts" with MedleyDB.☆16Apr 4, 2019Updated 7 years ago
- Room impulse response simulation for various array architectures using Monte-Carlo simulation and quaternions (Python)☆18Feb 25, 2026Updated 3 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆14Nov 22, 2022Updated 3 years ago
- Voice Framework☆18Jan 21, 2026Updated 4 months ago
- ☆12Oct 14, 2020Updated 5 years ago
- ☆26Mar 5, 2018Updated 8 years ago
- A PyTorch implementation of Meta-TasNet from "Meta-learning Extractors for Music Source Separation☆137Jul 25, 2024Updated last year
- ☆19Aug 23, 2024Updated last year
- Visually-informed Music Source Separation project at Jeju 2018 Deep Learning Summer Camp☆30Sep 14, 2018Updated 7 years ago
- This is an implementation of the audio source separation model as well as the evaluation metrics proposed in the paper "Weakly Informed A…☆12Nov 26, 2019Updated 6 years ago
- ☆12Oct 9, 2025Updated 7 months ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Glow-TTS with Stochastic Duration Predictor and Stochastic Pitch Predictor☆19Jun 5, 2023Updated 2 years ago
- Block-Online Multi-Channel Speech Enhancement Using DNN-Supported Relative Transfer Function Estimates☆34May 26, 2020Updated 6 years ago
- Event Relation in Text-to-Audio (TTA) Generation☆21Feb 26, 2025Updated last year
- ☆37Jun 9, 2025Updated 11 months ago
- Code for the ISMIR 2021 tutorial "Programming MIR Baselines from Scratch: Three Cases Studies"☆30Nov 21, 2021Updated 4 years ago
- Speech Resynthesis and Language Modeling☆27Jun 11, 2025Updated 11 months ago
- Pytorch implementation of Deepmind's WaveRNN model☆13Apr 5, 2020Updated 6 years ago
- ☆13Nov 26, 2019Updated 6 years ago
- Official repository of Wavehax vocoder☆72Dec 20, 2025Updated 5 months ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- A simple and humble image captioning application, based on a neural network built with Keras☆10Sep 23, 2022Updated 3 years ago
- ISMIR2018 Tutorial on Open Source and Reproducibility in MIR Research☆41Aug 16, 2022Updated 3 years ago
- A Tensorflow LSTM spam detector utilizing GloVe word embeddings.☆12Nov 9, 2019Updated 6 years ago
- ☆55Jul 16, 2025Updated 10 months ago
- Supplementary code for the experiments described in the 2021 ISMIR submission: Leveraging Hierarchical Structures for Few Shot Musical In…☆41Aug 12, 2022Updated 3 years ago
- This repository contains the training code from paper "SpidR Learning Fast and Stable Linguistic Units for Spoken Language Models Without…☆57May 18, 2026Updated last week
- Self-supervised VQ-VAE for One-Shot Music Style Transfer☆99Feb 24, 2025Updated last year
- FCTalker: Fine and Coarse Grained Context Modeling for Expressive Conversational Speech Synthesis (Accepted by ISCSLP'2024)☆26Feb 22, 2024Updated 2 years ago
- Code for "A diffusion-inspired training strategy for singing voice extraction in the waveform domain" (ISMIR 2022)☆17Feb 16, 2023Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- [NeurIPS 2023 - ML for Audio Workshop (Oral)] Zero-shot audio captioning with audio-language model guidance and audio context keywords☆19Nov 30, 2024Updated last year
- ☆17May 28, 2018Updated 8 years ago
- ☆15Sep 26, 2022Updated 3 years ago
- ☆16Oct 31, 2022Updated 3 years ago
- Training and evaluation code for Re-MOVE models with embedding distillation☆31Jul 6, 2023Updated 2 years ago
- The code about “LABNet: A Lightweight Attentive Beamforming Network for Ad-hoc Multichannel Microphone Invariant Real-Time Speech Enhance…☆45Oct 10, 2025Updated 7 months ago
- Implementation of Emo-StarGAN☆46Dec 19, 2023Updated 2 years ago