[NeurIPS 2025] Separate Anything in Audio with Zero Training
☆56Nov 3, 2025Updated 4 months ago
Alternatives and similar repositories for ZeroSep
Users that are interested in ZeroSep are comparing it to the libraries listed below
Sorting:
- [🏆 IJCV 2025 & ACCV 2024 Best Paper Honorable Mention] Official pytorch implementation of the paper "High-Quality Visually-Guided Sound …☆28Nov 1, 2025Updated 4 months ago
- ☆117Updated this week
- This is the official repository of ``Scalable Neural Vocoder from Range-Null Space Decomposition'', which is submitted to TPAMI.☆35Oct 11, 2025Updated 4 months ago
- ☆17Oct 2, 2023Updated 2 years ago
- Chorale Music Separation Dataset and Model Framework☆40Dec 5, 2022Updated 3 years ago
- ☆16Jun 12, 2025Updated 8 months ago
- Reimplementation of Bandit for "Remastering Divide and Remaster: A Cinematic Audio Source Separation Dataset with Multilingual Support"☆52Jul 29, 2025Updated 7 months ago
- Open, royalty free, lyrics2song / song generation data collection / cleaning pipeline.☆17May 9, 2025Updated 9 months ago
- Code and data recipes for the paper: Optimal Condition Training for Target Source Separation by Efthymios Tzinis, Gordon Wichern, Paris S…☆14Feb 15, 2023Updated 3 years ago
- A Python Library for Fundamental Frequency Estimation in Music Recordings☆55Jan 16, 2026Updated last month
- ☆10Sep 25, 2024Updated last year
- Sound Separation, Omni modal☆28Sep 15, 2025Updated 5 months ago
- Music repair method to convert lossy MP3 compressed music to lossless music.☆358Aug 12, 2025Updated 6 months ago
- Official Repository for "Music Source Restoration"☆32Jun 1, 2025Updated 9 months ago
- The source code for the paper XiaoiceSing2 (interspeech2023)☆49Jan 15, 2024Updated 2 years ago
- A standardized toolkit of Kernel Audio Distance (KAD)—a distribution-free, unbiased, and computationally efficient metric for evaluating …☆95Jun 12, 2025Updated 8 months ago
- Official Repository for paper "Ambisonizer: Neural Upmixing as Spherical Harmonics Generation"☆15May 27, 2024Updated last year
- Audio Entailment: Deductive Reasoning for Audio Understanding☆17Dec 10, 2024Updated last year
- Official implementation of WildFX Dataset Generating pipeline.☆15Oct 21, 2025Updated 4 months ago
- Sound field reconstruction using neural processes with dynamic kernels☆15Mar 25, 2025Updated 11 months ago
- ☆25Jan 26, 2026Updated last month
- CCMusic, an open Chinese music database, integrates diverse datasets. It ensures data consistency via cleaning, label refinement and stru…☆26Oct 31, 2025Updated 4 months ago
- Crawled from FreeMidi.org, MIDI files library including over twenty thousand files!☆32Jun 6, 2020Updated 5 years ago
- Comprehensive benchmark suite comparing pitch detection algorithms across multiple datasets.☆58Sep 1, 2025Updated 6 months ago
- BUDDy: Single-Channel Blind Unsupervised Dereverberation with Diffusion Models☆63Oct 18, 2024Updated last year
- Causality Check in Frame-online Speech Separation☆50Dec 11, 2022Updated 3 years ago
- Feed-forward compressor experiments source code for "Differentiable All-pole Filters for Time-varying Audio Systems".☆22Jun 10, 2024Updated last year
- Official repository for "Structure-Enhanced Pop Music Generation via Harmony-Aware Learning", ACM MM 2022.☆15Mar 22, 2023Updated 2 years ago
- ☆13Mar 11, 2025Updated 11 months ago
- Official repo for DisCoder: High-Fidelity Music Vocoder using Neural Audio Codecs presented at ICASSP 2025☆38Feb 24, 2025Updated last year
- ☆16Jan 11, 2026Updated last month
- Official Repository of IJCAI 2024 Paper: "BATON: Aligning Text-to-Audio Model with Human Preference Feedback"☆32Mar 4, 2025Updated 11 months ago
- ☆70Jan 25, 2025Updated last year
- The open source code of ALMTokenizer2: Towards Low bit-rate and Semantic-rich Audio Tokenizer with Flow-based Scalar Diffusion Transforme…☆45Sep 5, 2025Updated 5 months ago
- This is the implementation of the manuscript "Learning General All-Neural Speech Enhancement based on Taylor's Approximation Theory", whi…☆14Nov 25, 2022Updated 3 years ago
- ☆15Apr 2, 2025Updated 11 months ago
- Pitch-shifting, time-stretching, and vocoding of speech with Controllable LPCNet (CLPCNet)☆163Aug 5, 2022Updated 3 years ago
- [INTERSPEECH 2025 Oral]Official code for "Accelerating Diffusion-based Text-to-Speech Model Training with Dual Modality Alignment"☆64Jun 16, 2025Updated 8 months ago
- Official Repository for "Training-Free Multi-Step Audio Source Separation"☆54May 26, 2025Updated 9 months ago