smulelabs/windowed-roformer

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/smulelabs/windowed-roformer)

smulelabs / windowed-roformer

Official Repository for "Efficient Vocal Source Separation Through Windowed RoFormer"

☆45

Alternatives and similar repositories for windowed-roformer

Users that are interested in windowed-roformer are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

smulelabs / smule-renaissance
View on GitHub
Official Repository of Smule Renaissance, Smule's Vocal Restoration Models
☆43Oct 27, 2025Updated 9 months ago
yongyizang / TrainingFreeMultiStepASR
View on GitHub
Official Repository for "Training-Free Multi-Step Audio Source Separation"
☆54May 26, 2025Updated last year
yongyizang / music-source-restoration
View on GitHub
Official Repository for "Music Source Restoration"
☆31Jun 1, 2025Updated last year
WingSingFung / TISDiSS
View on GitHub
Official implementation of TISDiSS, a scalable framework for discriminative source separation.
☆16Oct 19, 2025Updated 9 months ago
junyuchen-cjy / DTTNet-Pytorch
View on GitHub
An official implementation of the ICASSP 2024 paper: Dual-Path TFC-TDF UNet for Music Source Separation
☆109Mar 19, 2024Updated 2 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
crlandsc / torch-l1-snr
View on GitHub
Variations of L1 SNR Loss function for training audio source separation machine learning models
☆45May 1, 2026Updated 2 months ago
yukara-ikemiya / floss-torch
View on GitHub
PyTorch implementation of "Source Separation by Flow Matching (FLOSS)" by Google DeepMind
☆97Nov 24, 2025Updated 8 months ago
lucidrains / HS-TasNet
View on GitHub
Implementation of HS-TasNet, "Real-time Low-latency Music Source Separation using Hybrid Spectrogram-TasNet"
☆109Apr 23, 2026Updated 3 months ago
JusperLee / Swift-Net
View on GitHub
Power-Guided Grouped SRU for Real-Time Causal Audio-Visual Speech Separation
☆26Jul 20, 2026Updated last week
kwatcharasupat / divide-and-remaster-v3
View on GitHub
Landing Page for Divide and Remaster v3
☆26Jul 29, 2025Updated 11 months ago
JusperLee / Gull-Codec-Training
View on GitHub
☆12Mar 11, 2025Updated last year
itsnotacie / AAAI-26_SepPrune
View on GitHub
SepPrune: Structured Pruning for Efficient Deep Speech Separation-AAAI'26
☆15May 31, 2025Updated last year
JethroWangSir / SincQDR-VAD
View on GitHub
☆26Aug 29, 2025Updated 11 months ago
XiaoyuBIE1994 / SDCodec
View on GitHub
(ICASSP 2025) Learning Source Disentanglement in Neural Audio Codec
☆48May 16, 2025Updated last year
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
Audio-WestlakeU / CleanMel
View on GitHub
Pytorch implementation of "CleanMel: Mel-Spectrogram Enhancement for Improving Both Speech Quality and ASR".
☆94Feb 2, 2026Updated 5 months ago
iver56 / loudness
View on GitHub
The world's fastest Python package for calculating integrated loudness (LUFS) from audio data as NumPy arrays
☆31Dec 26, 2025Updated 7 months ago
nanless / universal-speech-enhancement
View on GitHub
Apply Score diffusion to improve speech signals recorded under various adverse conditions and distortions, including noise, reverberation…
☆83Jul 29, 2024Updated last year
seongq / flowmse
View on GitHub
(ICASSP 2025, official code)FlowSE: Flow Matching-based Speech Enhancement
☆108Jul 23, 2025Updated last year
WikiChao / ZeroSep
View on GitHub
[NeurIPS 2025] Separate Anything in Audio with Zero Training
☆60Nov 3, 2025Updated 8 months ago
JusperLee / TFACM
View on GitHub
☆24Jul 16, 2025Updated last year
weAreMusicAI / dmx-diffusion
View on GitHub
☆15Oct 13, 2025Updated 9 months ago
Audio-AGI / FlowSep
View on GitHub
Official implementation for FlowSep
☆77Jan 2, 2025Updated last year
yluo42 / SRVQ
View on GitHub
Spherical residual vector quantization (SRVQ)
☆31Aug 25, 2024Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
qiuqiangkong / audioflow
View on GitHub
☆130Updated this week
JusperLee / Look2hear
View on GitHub
A toolkit for researchers in the multimodal sound separation.
☆16Oct 20, 2023Updated 2 years ago
aleXiehta / AD-FlowTSE
View on GitHub
Adaptive Flow-Matching for Target Speaker Extraction
☆39Jul 13, 2026Updated 2 weeks ago
violet-liang / soundfield-reconstruction-np
View on GitHub
Sound field reconstruction using neural processes with dynamic kernels
☆16Mar 25, 2025Updated last year
yxlu-0102 / AP-BWE
View on GitHub
Towards High-Quality and Efficient Speech Bandwidth Extension with Parallel Amplitude and Phase Prediction
☆194Apr 15, 2025Updated last year
juhayna-zh / BSRNN-speech-preprocess
View on GitHub
A solution to denoising and separating for two-speaker-mixed noisy speech, using a BSRNN inspired network.
☆15Aug 22, 2023Updated 2 years ago
kwatcharasupat / bandit-v2
View on GitHub
Reimplementation of Bandit for "Remastering Divide and Remaster: A Cinematic Audio Source Separation Dataset with Multilingual Support"
☆65Jul 29, 2025Updated 11 months ago
starrytong / SCNet
View on GitHub
☆157Sep 8, 2025Updated 10 months ago
JusperLee / SPMamba
View on GitHub
☆227Dec 5, 2024Updated last year
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
koudounasalkis / voc2vec
View on GitHub
This repository contains the code for the paper "voc2vec: A Foundation Model for Non-Verbal Vocalization", accepted at ICASSP 2025.
☆58Apr 14, 2025Updated last year
LAION-AI / scaled-echo-tts
View on GitHub
Scaled diffusion transformer for text-to-speech synthesis (DiT + T5Gemma2 conditioning, TorchTitan & Megatron backends, tested up to 1024…
☆24Mar 29, 2026Updated 3 months ago
meaningTeam / tidy-tunes
View on GitHub
Tidy Tunes is an easy-to-use pipeline for mining high-quality audio data for speech generation models. To do so, it chains multiple open …
☆23May 19, 2026Updated 2 months ago
WangHelin1997 / SoloAudio
View on GitHub
SoloAudio: Target Sound Extraction with Language-oriented Audio Diffusion Transformer.
☆121Jan 28, 2026Updated 6 months ago
ozspeech / OZSpeech
View on GitHub
[ACL 2025] OZSpeech: One-step Zero-shot Speech Synthesis with Learned-Prior-Conditioned Flow Matching
☆45Feb 9, 2025Updated last year
YangXusheng-yxs / CodecFormer_5Hz
View on GitHub
☆35Oct 23, 2025Updated 9 months ago
AMAAI-Lab / SonicMaster
View on GitHub
SonicMaster: Towards Controllable All-in-One Music Restoration and Mastering
☆190Jun 5, 2026Updated last month