JusperLee/Swift-Net

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/JusperLee/Swift-Net)

JusperLee / Swift-Net

Power-Guided Grouped SRU for Real-Time Causal Audio-Visual Speech Separation

☆26

Alternatives and similar repositories for Swift-Net

Users that are interested in Swift-Net are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

JusperLee / TFACM
View on GitHub
☆23Jul 16, 2025Updated last year
spkgyk / RTFS-Net
View on GitHub
Official code release for "RTFS-Net: Recurrent time-frequency modelling for efficient audio-visual speech separation", accepted ICLR 2024
☆51Oct 14, 2025Updated 9 months ago
IsraelCohenLab / ConstantBeamwidthUCCA
View on GitHub
☆11Jun 6, 2022Updated 4 years ago
gdalsanto / aec-evaluation
View on GitHub
modules for the evaluation of acoustic echo cancellation systems
☆20Nov 2, 2021Updated 4 years ago
JusperLee / speech-paper-daily-skill
View on GitHub
☆26Mar 31, 2026Updated 3 months ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
Clovermax / AED-TSVAD
View on GitHub
Attention-Based Encoder-Decoder Target-Speaker Voice Activity Detection for Robust Speaker Diarization
☆31Sep 22, 2025Updated 10 months ago
ZBang / USEF-TSE
View on GitHub
☆70Jul 5, 2025Updated last year
narrietal / Fast-ULCNet
View on GitHub
Official repository of Fast-ULCNet.
☆39Jun 17, 2026Updated last month
Yip-Jia-Qi / codecformer
View on GitHub
☆21Jul 15, 2024Updated 2 years ago
JusperLee / Look2hear
View on GitHub
A toolkit for researchers in the multimodal sound separation.
☆16Oct 20, 2023Updated 2 years ago
smulelabs / windowed-roformer
View on GitHub
Official Repository for "Efficient Vocal Source Separation Through Windowed RoFormer"
☆45Oct 30, 2025Updated 8 months ago
itsnotacie / AAAI-26_SepPrune
View on GitHub
SepPrune: Structured Pruning for Efficient Deep Speech Separation-AAAI'26
☆15May 31, 2025Updated last year
audiolabs / MonteCarloRIRSimulation
View on GitHub
Room impulse response simulation for various array architectures using Monte-Carlo simulation and quaternions (Python)
☆18Feb 25, 2026Updated 4 months ago
WingSingFung / TISDiSS
View on GitHub
Official implementation of TISDiSS, a scalable framework for discriminative source separation.
☆16Oct 19, 2025Updated 9 months ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
Tokisywyy / AECNS-CAGCRN
View on GitHub
☆18Mar 9, 2025Updated last year
tencent-ailab / FRA-RIR
View on GitHub
☆214Dec 4, 2023Updated 2 years ago
Audio-WestlakeU / Mel-McNet
View on GitHub
The Official PyTorch Implementation of "Mel-McNet: A Mel-Scale Framework for Online Multichannel Speech Enhancement" [Interspeech 2025]
☆26May 14, 2026Updated 2 months ago
haoxiangsnr / llm-tse
View on GitHub
Typing to Listen at the Cocktail Party: Text-Guided Target Speaker Extraction (LLM-TSE)
☆43Oct 13, 2023Updated 2 years ago
merlresearch / tf-locoformer
View on GitHub
Transformer with Local Modeling by Convolution for Speech Separation and Enhancement
☆133Aug 8, 2025Updated 11 months ago
yhsong06 / LAU-Net
View on GitHub
☆16May 23, 2025Updated last year
IsraelCohenLab / ConstantBeamwidthBeamformingNonuniform
View on GitHub
☆15May 9, 2022Updated 4 years ago
JusperLee / S4M
View on GitHub
Official implementation of Efficient Speech Separation Framework Based on Neural State-Space Models
☆28Feb 25, 2026Updated 4 months ago
ssi-research / FQSS
View on GitHub
Fully quantized Neural Networks for Audio Source Separation
☆17Aug 11, 2024Updated last year
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
sp-uhh / ears_benchmark
View on GitHub
Generation scripts for EARS-WHAM and EARS-Reverb
☆48Jul 4, 2025Updated last year
MyParadise21 / Mamba-SEUNet
View on GitHub
This is the official implement of Mamba-SEUNet: Mamba UNet for Monaural Speech Enhancement
☆94May 26, 2025Updated last year
Beilong-Tang / TSELM
View on GitHub
Official Implementation of TSELM: Target speaker extraction using discrete tokens and language models
☆60Apr 14, 2025Updated last year
IiuZiKai / Evo_TSE
View on GitHub
☆17Apr 9, 2026Updated 3 months ago
aleXiehta / AD-FlowTSE
View on GitHub
Adaptive Flow-Matching for Target Speaker Extraction
☆39Jul 13, 2026Updated last week
yongyizang / TrainingFreeMultiStepASR
View on GitHub
Official Repository for "Training-Free Multi-Step Audio Source Separation"
☆54May 26, 2025Updated last year
JusperLee / SPMamba
View on GitHub
☆227Dec 5, 2024Updated last year
JusperLee / CTCNet
View on GitHub
An Audio-Visual Speech Separation Model Inspired by Cortico-Thalamo-Cortical Circuits
☆82Apr 28, 2024Updated 2 years ago
TaoRuijie / SEANet
View on GitHub
Code for Audio-Visual Target Speaker Extraction with Selective Auditory Attention (TASLP)
☆32Feb 28, 2025Updated last year
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
WangHelin1997 / SoloAudio
View on GitHub
SoloAudio: Target Sound Extraction with Language-oriented Audio Diffusion Transformer.
☆119Jan 28, 2026Updated 5 months ago
isHuangZiling / D-LGTSE
View on GitHub
☆23Updated this week
hyyan2k / LiSenNet
View on GitHub
This is the official implementation of the LiSenNet
☆161Nov 15, 2024Updated last year
Audio-WestlakeU / CleanMel
View on GitHub
Pytorch implementation of "CleanMel: Mel-Spectrogram Enhancement for Improving Both Speech Quality and ASR".
☆94Feb 2, 2026Updated 5 months ago
Dahan-Wang / Rethinking-Flow-and-Diffusion-Bridge-Models-for-Speech-Enhancement
View on GitHub
☆39Feb 23, 2026Updated 4 months ago
HaoFengyuan / EEND-IAAE
View on GitHub
The implementation of "End-to-End Neural Speaker Diarization with an Iterative Adaptive Attractor Estimation", which is accepted by Neura…
☆11Aug 27, 2023Updated 2 years ago
Xiaobin-Rong / deepvqe
View on GitHub
An unofficial implementation of DeepVQE proposed by Microsoft Corp.
☆147Mar 24, 2025Updated last year