WikiChao/ZeroSep

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/WikiChao/ZeroSep)

WikiChao / ZeroSep

[NeurIPS 2025] Separate Anything in Audio with Zero Training

☆60

Alternatives and similar repositories for ZeroSep

Users that are interested in ZeroSep are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

WikiChao / DAVIS
View on GitHub
[🏆 IJCV 2025 & ACCV 2024 Best Paper Honorable Mention] Official pytorch implementation of the paper "High-Quality Visually-Guided Sound …
☆33Mar 30, 2026Updated 3 months ago
WikiChao / VisAH
View on GitHub
[CVPR 2025] Pytorch implementation of the paper "Learning to Highlight Audio by Watching Movies"
☆15Oct 1, 2025Updated 9 months ago
smulelabs / windowed-roformer
View on GitHub
Official Repository for "Efficient Vocal Source Separation Through Windowed RoFormer"
☆45Oct 30, 2025Updated 8 months ago
qiuqiangkong / audioflow
View on GitHub
☆128Updated this week
JusperLee / Gull-Codec-Training
View on GitHub
☆12Mar 11, 2025Updated last year
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
yongyizang / music-source-restoration
View on GitHub
Official Repository for "Music Source Restoration"
☆31Jun 1, 2025Updated last year
yongyizang / TrainingFreeMultiStepASR
View on GitHub
Official Repository for "Training-Free Multi-Step Audio Source Separation"
☆54May 26, 2025Updated last year
Audio-AGI / FlowSep
View on GitHub
Official implementation for FlowSep
☆77Jan 2, 2025Updated last year
zengchang233 / CrossSinger
View on GitHub
The source code for the paper CrossSinger (asru2023)
☆18Oct 12, 2023Updated 2 years ago
rxtan2 / AVSeT
View on GitHub
☆17Oct 2, 2023Updated 2 years ago
yunlong10 / Video-R4
View on GitHub
Reinforcing Text-Rich Video Reasoning with Visual Rumination
☆28Jun 5, 2026Updated last month
kaistmm / V2SFlow
View on GitHub
[ICASSP 2025] V2SFlow: Video-to-Speech Generation with Speech Decomposition and Rectified Flow
☆21Jun 3, 2025Updated last year
qiuqiangkong / mini_llm
View on GitHub
☆29Jul 4, 2025Updated last year
MRSAudio / MRSAudio_Main
View on GitHub
MRSAudio: A Large-Scale Multimodal Recorded Spatial Audio Dataset with Refined Annotations
☆43Oct 15, 2025Updated 9 months ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
yangdongchao / ALMTokenizer2
View on GitHub
The open source code of ALMTokenizer2: Towards Low bit-rate and Semantic-rich Audio Tokenizer with Flow-based Scalar Diffusion Transforme…
☆45Sep 5, 2025Updated 10 months ago
AMAAI-Lab / SonicVerse
View on GitHub
SonicVerse: Multi-Task Learning for Music Feature-Informed Captioning
☆53Jul 28, 2025Updated 11 months ago
Aisaka0v0 / CLAPSep
View on GitHub
Query-conditioned target sound extraction model
☆30Mar 25, 2025Updated last year
violet-liang / soundfield-reconstruction-np
View on GitHub
Sound field reconstruction using neural processes with dynamic kernels
☆16Mar 25, 2025Updated last year
JusperLee / SPMamba
View on GitHub
☆227Dec 5, 2024Updated last year
qiuqiangkong / materials_for_students
View on GitHub
☆16Aug 10, 2025Updated 11 months ago
zengchang233 / xiaoicesing2
View on GitHub
The source code for the paper XiaoiceSing2 (interspeech2023)
☆49Jan 15, 2024Updated 2 years ago
starrytong / SCNet
View on GitHub
☆157Sep 8, 2025Updated 10 months ago
EleutherAI / aria-amt
View on GitHub
Efficient and robust implementation of seq-to-seq automatic piano transcription.
☆70Dec 16, 2025Updated 7 months ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
Wangtk311 / SafeEar-Inference-Test-Script
View on GitHub
SafeEar是由浙大和清华共同开发的一种深度伪声探测模型。这是我撰写的模型推理脚本。我不确定它是否正确，目前我还是初学者，如有问题请原谅我并指出，谢谢！
☆16May 16, 2025Updated last year
slSeanWU / beats-conformer-bart-audio-captioner
View on GitHub
PyTorch implementation of the ICASSP-24 paper: "Improving Audio Captioning Models with Fine-grained Audio Features, Text Embedding Superv…
☆41Jan 6, 2024Updated 2 years ago
ETH-DISCO / discoder
View on GitHub
Official repo for DisCoder: High-Fidelity Music Vocoder using Neural Audio Codecs presented at ICASSP 2025
☆42Feb 24, 2025Updated last year
YoonjinXD / kadtk
View on GitHub
A standardized toolkit of Kernel Audio Distance (KAD)—a distribution-free, unbiased, and computationally efficient metric for evaluating …
☆104Jun 12, 2025Updated last year
yoongi43 / MGE-LDM
View on GitHub
Official implementation of the paper MGE-LDM: Joint Latent Diffusion for Simultaneous Music Generation and Source Extraction
☆20Feb 19, 2026Updated 5 months ago
qiuqiangkong / music_llm
View on GitHub
☆56Jul 13, 2025Updated last year
smulelabs / smule-renaissance
View on GitHub
Official Repository of Smule Renaissance, Smule's Vocal Restoration Models
☆43Oct 27, 2025Updated 8 months ago
RetroCirce / Zero_Shot_Audio_Source_Separation
View on GitHub
The official code repo for "Zero-shot Audio Source Separation through Query-based Learning from Weakly-labeled Data", in AAAI 2022
☆212Jul 14, 2022Updated 4 years ago
Andong-Li-speech / BridgeVoC
View on GitHub
This is the repository for the work "BridgeVoC: Revitalizing Neural Vocoder from a Restoration Perspective".
☆67Nov 5, 2025Updated 8 months ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
jhtonyKoo / music_mixing_style_transfer
View on GitHub
☆181Oct 24, 2023Updated 2 years ago
Andong-Li-speech / TaEr
View on GitHub
This is the implementation of the manuscript "Learning General All-Neural Speech Enhancement based on Taylor's Approximation Theory", whi…
☆14Nov 25, 2022Updated 3 years ago
RetroCirce / Choral_Music_Separation
View on GitHub
Chorale Music Separation Dataset and Model Framework
☆41Dec 5, 2022Updated 3 years ago
xiquan-li / FineLAP
View on GitHub
[ACL 2026 Main] FineLAP: Taming Heterogeneous Supervision for Fine-grained Language-Audio Pre-training
☆36Apr 20, 2026Updated 3 months ago
iamycy / golf
View on GitHub
A DDSP-based neural voice synthesiser.
☆135Nov 14, 2024Updated last year
nttcslab / msm-mae
View on GitHub
Masked Spectrogram Modeling using Masked Autoencoders for Learning General-purpose Audio Representations
☆99Feb 20, 2026Updated 5 months ago
yzyouzhang / Audio_Research_in_US
View on GitHub
Audio Research in US. US-based professors who work on audio (music, speech, acoustics). For students who would like to apply for RA, PhD,…
☆27Feb 27, 2026Updated 4 months ago