yongyizang/TrainingFreeMultiStepASR

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/yongyizang/TrainingFreeMultiStepASR)

yongyizang / TrainingFreeMultiStepASR

Official Repository for "Training-Free Multi-Step Audio Source Separation"

☆54

Alternatives and similar repositories for TrainingFreeMultiStepASR

Users that are interested in TrainingFreeMultiStepASR are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

smulelabs / windowed-roformer
View on GitHub
Official Repository for "Efficient Vocal Source Separation Through Windowed RoFormer"
☆45Oct 30, 2025Updated 8 months ago
yongyizang / music-source-restoration
View on GitHub
Official Repository for "Music Source Restoration"
☆31Jun 1, 2025Updated last year
kwatcharasupat / source-separation-landing
View on GitHub
Landing Page for All Things Source Separation
☆38Sep 12, 2025Updated 10 months ago
WingSingFung / TISDiSS
View on GitHub
Official implementation of TISDiSS, a scalable framework for discriminative source separation.
☆16Oct 19, 2025Updated 9 months ago
yukara-ikemiya / floss-torch
View on GitHub
PyTorch implementation of "Source Separation by Flow Matching (FLOSS)" by Google DeepMind
☆96Nov 24, 2025Updated 7 months ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
kwatcharasupat / musdb25
View on GitHub
MUSDB25 - A Fully Multitrack Dataset for Music Source Separation
☆13Mar 29, 2025Updated last year
smulelabs / smule-renaissance
View on GitHub
Official Repository of Smule Renaissance, Smule's Vocal Restoration Models
☆42Oct 27, 2025Updated 8 months ago
JusperLee / Swift-Net
View on GitHub
Power-Guided Grouped SRU for Real-Time Causal Audio-Visual Speech Separation
☆26Updated this week
kwatcharasupat / divide-and-remaster-v3
View on GitHub
Landing Page for Divide and Remaster v3
☆26Jul 29, 2025Updated 11 months ago
amanteur / SCNet-PyTorch
View on GitHub
Unofficial PyTorch implementation of "SCNet: Sparse Compression Network for Music Source Separation"
☆63Apr 14, 2024Updated 2 years ago
JusperLee / TFACM
View on GitHub
☆23Jul 16, 2025Updated last year
iver56 / loudness
View on GitHub
The world's fastest Python package for calculating integrated loudness (LUFS) from audio data as NumPy arrays
☆31Dec 26, 2025Updated 6 months ago
Andong-Li-speech / Neural-Vocoders-as-Speech-Enhancers
View on GitHub
☆52Sep 10, 2024Updated last year
crlandsc / torch-l1-snr
View on GitHub
Variations of L1 SNR Loss function for training audio source separation machine learning models
☆45May 1, 2026Updated 2 months ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
Audio-AGI / FlowSep
View on GitHub
Official implementation for FlowSep
☆77Jan 2, 2025Updated last year
WangHelin1997 / LibriLightMix-WHAMR
View on GitHub
Python scripts to create noisy and reverberant 2-speaker mixture audio with Libri-Light and WHAM
☆17Nov 7, 2024Updated last year
lucidrains / HS-TasNet
View on GitHub
Implementation of HS-TasNet, "Real-time Low-latency Music Source Separation using Hybrid Spectrogram-TasNet"
☆109Apr 23, 2026Updated 2 months ago
yongyizang / GSound-SIR
View on GitHub
A Python Room Spatial Impulse Response Ray-Tracing Toolkit
☆86Mar 4, 2026Updated 4 months ago
junyuchen-cjy / DTTNet-Pytorch
View on GitHub
An official implementation of the ICASSP 2024 paper: Dual-Path TFC-TDF UNet for Music Source Separation
☆109Mar 19, 2024Updated 2 years ago
nanless / universal-speech-enhancement
View on GitHub
Apply Score diffusion to improve speech signals recorded under various adverse conditions and distortions, including noise, reverberation…
☆82Jul 29, 2024Updated last year
JusperLee / Gull-Codec-Training
View on GitHub
☆12Mar 11, 2025Updated last year
Audio-WestlakeU / CleanMel
View on GitHub
Pytorch implementation of "CleanMel: Mel-Spectrogram Enhancement for Improving Both Speech Quality and ASR".
☆94Feb 2, 2026Updated 5 months ago
Yip-Jia-Qi / codecformer
View on GitHub
☆21Jul 15, 2024Updated 2 years ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
lucidrains / BS-RoFormer
View on GitHub
Implementation of Band Split Roformer, SOTA Attention network for music source separation out of ByteDance AI Labs
☆873Jun 14, 2026Updated last month
SonyCSLParis / Stem-JEPA
View on GitHub
Joint Embedding Predictive Architecture for Musical Stem Compatibility Estimation
☆55Aug 6, 2024Updated last year
taishi-n / torchrir
View on GitHub
PyTorch-based room impulse response (RIR) simulation toolkit with dynamic scenes, GPU acceleration.
☆22Feb 18, 2026Updated 5 months ago
haoheliu / Subband-Music-Separation
View on GitHub
Pytorch: Channel-wise subband (CWS) input for better voice and accompaniment separation
☆102Nov 12, 2021Updated 4 years ago
merlresearch / tf-locoformer
View on GitHub
Transformer with Local Modeling by Convolution for Speech Separation and Enhancement
☆133Aug 8, 2025Updated 11 months ago
JusperLee / SPMamba
View on GitHub
☆227Dec 5, 2024Updated last year
kimsunwiub / BLOOM-Net
View on GitHub
Source code for "BLOOM-Net: Blockwise Optimization for Masking Networks Toward Scalable and Efficient Speech Enhancement"
☆14Feb 13, 2022Updated 4 years ago
WikiChao / ZeroSep
View on GitHub
[NeurIPS 2025] Separate Anything in Audio with Zero Training
☆60Nov 3, 2025Updated 8 months ago
Audio-WestlakeU / Mel-McNet
View on GitHub
The Official PyTorch Implementation of "Mel-McNet: A Mel-Scale Framework for Online Multichannel Speech Enhancement" [Interspeech 2025]
☆26May 14, 2026Updated 2 months ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
crlandsc / torch-log-wmse
View on GitHub
logWMSE, an audio quality metric & loss function with support for digital silence target. Useful for training and evaluating audio source…
☆48Apr 29, 2026Updated 2 months ago
nomonosound / log-wmse-audio-quality
View on GitHub
logWMSE, an audio quality metric with support for digital silence target. Useful for evaluating audio source separation systems, even whe…
☆39Jun 24, 2025Updated last year
seongq / flowmse
View on GitHub
(ICASSP 2025, official code)FlowSE: Flow Matching-based Speech Enhancement
☆107Jul 23, 2025Updated 11 months ago
jeonchangbin49 / LimitAug
View on GitHub
☆23Aug 30, 2022Updated 3 years ago
wavlab-speech / versa
View on GitHub
Versatile Evaluation of Speech and Audio
☆423Updated this week
Sreyan88 / ReCLAP
View on GitHub
☆33Dec 23, 2025Updated 6 months ago
sp-uhh / storm
View on GitHub
StoRM: A Diffusion-based Stochastic Regeneration Model for Speech Enhancement and Dereverberation
☆255Sep 13, 2024Updated last year