[NeurIPS 2025] Separate Anything in Audio with Zero Training
β59Nov 3, 2025Updated 6 months ago
Alternatives and similar repositories for ZeroSep
Users that are interested in ZeroSep are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [π IJCV 2025 & ACCV 2024 Best Paper Honorable Mention] Official pytorch implementation of the paper "High-Quality Visually-Guided Sound β¦β31Mar 30, 2026Updated last month
- β17Oct 2, 2023Updated 2 years ago
- β120May 5, 2026Updated 2 weeks ago
- β27Jan 26, 2026Updated 3 months ago
- β20Jun 12, 2025Updated 11 months ago
- Bare Metal GPUs on DigitalOcean Gradient AI β’ AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- This is the official repository of ``Scalable Neural Vocoder from Range-Null Space Decomposition'', which is submitted to TPAMI.β54Oct 11, 2025Updated 7 months ago
- β74Jan 25, 2025Updated last year
- Feed-forward compressor experiments source code for "Differentiable All-pole Filters for Time-varying Audio Systems".β22Jun 10, 2024Updated last year
- BUDDy: Single-Channel Blind Unsupervised Dereverberation with Diffusion Modelsβ66Oct 18, 2024Updated last year
- Sound Separation, Omni modalβ28Sep 15, 2025Updated 8 months ago
- β12Mar 11, 2025Updated last year
- β41May 12, 2026Updated last week
- Sound field reconstruction using neural processes with dynamic kernelsβ16Mar 25, 2025Updated last year
- Music repair method to convert lossy MP3 compressed music to lossless music.β374Aug 12, 2025Updated 9 months ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits β’ AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Chorale Music Separation Dataset and Model Frameworkβ40Dec 5, 2022Updated 3 years ago
- A standardized toolkit of Kernel Audio Distance (KAD)βa distribution-free, unbiased, and computationally efficient metric for evaluating β¦β103Jun 12, 2025Updated 11 months ago
- Official Repository for paper "Ambisonizer: Neural Upmixing as Spherical Harmonics Generation"β16May 27, 2024Updated last year
- Reimplementation of Bandit for "Remastering Divide and Remaster: A Cinematic Audio Source Separation Dataset with Multilingual Support"β59Jul 29, 2025Updated 9 months ago
- Open, royalty free, lyrics2song / song generation data collection / cleaning pipeline.β17May 9, 2025Updated last year
- Masked Spectrogram Modeling using Masked Autoencoders for Learning General-purpose Audio Representationsβ99Feb 20, 2026Updated 3 months ago
- Official implementation of WildFX Dataset Generating pipeline.β18Oct 21, 2025Updated 7 months ago
- Official Repository for "Music Source Restoration"β32Jun 1, 2025Updated 11 months ago
- Apply Score diffusion to improve speech signals recorded under various adverse conditions and distortions, including noise, reverberationβ¦β77Jul 29, 2024Updated last year
- Deploy on Railway without the complexity - Free Credits Offer β’ AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Zero-Shot Blind Audio Bandwidth Extensionβ27May 25, 2023Updated 2 years ago
- A Python Library for Fundamental Frequency Estimation in Music Recordingsβ57Jan 16, 2026Updated 4 months ago
- β10Sep 25, 2024Updated last year
- β43Feb 21, 2023Updated 3 years ago
- Query-conditioned target sound extraction modelβ30Mar 25, 2025Updated last year
- Make-An-Audio-3: Transforming Text/Video into Audio via Flow-based Large Diffusion Transformersβ120May 19, 2025Updated last year
- Audio Entailment: Deductive Reasoning for Audio Understandingβ17Dec 10, 2024Updated last year
- β179Oct 24, 2023Updated 2 years ago
- The source code for the paper XiaoiceSing2 (interspeech2023)β49Jan 15, 2024Updated 2 years ago
- AI Agents on DigitalOcean Gradient AI Platform β’ AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Causality Check in Frame-online Speech Separationβ49Dec 11, 2022Updated 3 years ago
- Pitch-shifting, time-stretching, and vocoding of speech with Controllable LPCNet (CLPCNet)β166Aug 5, 2022Updated 3 years ago
- β36Jun 16, 2023Updated 2 years ago
- An neural full-band audio codec for general audio sampled at 48 kHz with 7.5 kps or 4.5 kbps.β206Apr 30, 2026Updated 3 weeks ago
- Code and data recipes for the paper: Optimal Condition Training for Target Source Separation by Efthymios Tzinis, Gordon Wichern, Paris Sβ¦β14Feb 15, 2023Updated 3 years ago
- [INTERSPEECH 2025 Oral]Official code for "Accelerating Diffusion-based Text-to-Speech Model Training with Dual Modality Alignment"β66Jun 16, 2025Updated 11 months ago
- Crawled from FreeMidi.org, MIDI files library including over twenty thousand files!β32Jun 6, 2020Updated 5 years ago