Conditioned U-Net for Music Source Separation
☆20May 15, 2021Updated 4 years ago
Alternatives and similar repositories for conditioned-u-net
Users that are interested in conditioned-u-net are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A Benchmark and Evaluation Suite for Zero-shot Singing Voice Synthesis☆24Feb 11, 2026Updated last month
- Official implementation of A cappella: Audio-visual Singing VoiceSeparation, from BMVC21☆17May 14, 2022Updated 3 years ago
- A PyTorch Implementation of the paper - Choi, Woosung, et al. "Investigating u-nets with various intermediate blocks for spectrogram-base…☆80Jul 1, 2022Updated 3 years ago
- Implementations for master thesis "Musical Instrument Recognition in Multi-Instrument Audio Contexts" with MedleyDB.☆16Apr 4, 2019Updated 6 years ago
- Room impulse response simulation for various array architectures using Monte-Carlo simulation and quaternions (Python)☆17Feb 25, 2026Updated last month
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ☆14Nov 22, 2022Updated 3 years ago
- Voice Framework☆18Jan 21, 2026Updated 2 months ago
- ☆12Oct 14, 2020Updated 5 years ago
- ☆26Mar 5, 2018Updated 8 years ago
- A PyTorch implementation of Meta-TasNet from "Meta-learning Extractors for Music Source Separation☆138Jul 25, 2024Updated last year
- ☆18Aug 23, 2024Updated last year
- Visually-informed Music Source Separation project at Jeju 2018 Deep Learning Summer Camp☆30Sep 14, 2018Updated 7 years ago
- ☆10Oct 9, 2025Updated 5 months ago
- Glow-TTS with Stochastic Duration Predictor and Stochastic Pitch Predictor☆19Jun 5, 2023Updated 2 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Block-Online Multi-Channel Speech Enhancement Using DNN-Supported Relative Transfer Function Estimates☆34May 26, 2020Updated 5 years ago
- Event Relation in Text-to-Audio (TTA) Generation☆20Feb 26, 2025Updated last year
- DiFlow-TTS delivers low-latency zero-shot TTS via discrete flow matching and factorized speech tokens. A compact, open framework for fast…☆53Mar 20, 2026Updated last week
- ☆35Jun 9, 2025Updated 9 months ago
- Speech Resynthesis and Language Modeling☆27Jun 11, 2025Updated 9 months ago
- Code for the ISMIR 2021 tutorial "Programming MIR Baselines from Scratch: Three Cases Studies"☆30Nov 21, 2021Updated 4 years ago
- Pytorch implementation of Deepmind's WaveRNN model☆13Apr 5, 2020Updated 5 years ago
- Official repository of Wavehax vocoder☆66Dec 20, 2025Updated 3 months ago
- ☆13Nov 26, 2019Updated 6 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- A simple and humble image captioning application, based on a neural network built with Keras☆10Sep 23, 2022Updated 3 years ago
- ISMIR2018 Tutorial on Open Source and Reproducibility in MIR Research☆42Aug 16, 2022Updated 3 years ago
- A Tensorflow LSTM spam detector utilizing GloVe word embeddings.☆12Nov 9, 2019Updated 6 years ago
- ☆55Jul 16, 2025Updated 8 months ago
- Supplementary code for the experiments described in the 2021 ISMIR submission: Leveraging Hierarchical Structures for Few Shot Musical In…☆41Aug 12, 2022Updated 3 years ago
- This repository contains the training code from paper "SpidR Learning Fast and Stable Linguistic Units for Spoken Language Models Without…☆54Mar 17, 2026Updated last week
- Self-supervised VQ-VAE for One-Shot Music Style Transfer☆99Feb 24, 2025Updated last year
- FCTalker: Fine and Coarse Grained Context Modeling for Expressive Conversational Speech Synthesis (Accepted by ISCSLP'2024)☆26Feb 22, 2024Updated 2 years ago
- Code for "A diffusion-inspired training strategy for singing voice extraction in the waveform domain" (ISMIR 2022)☆17Feb 16, 2023Updated 3 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- [NeurIPS 2023 - ML for Audio Workshop (Oral)] Zero-shot audio captioning with audio-language model guidance and audio context keywords☆18Nov 30, 2024Updated last year
- ☆17May 28, 2018Updated 7 years ago
- ☆15Sep 26, 2022Updated 3 years ago
- Training and evaluation code for Re-MOVE models with embedding distillation☆31Jul 6, 2023Updated 2 years ago
- ☆16Oct 31, 2022Updated 3 years ago
- The code about “LABNet: A Lightweight Attentive Beamforming Network for Ad-hoc Multichannel Microphone Invariant Real-Time Speech Enhance…☆40Oct 10, 2025Updated 5 months ago
- Implementation of Emo-StarGAN☆46Dec 19, 2023Updated 2 years ago