spkgyk/RTFS-Net

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/spkgyk/RTFS-Net)

spkgyk / RTFS-Net

Official code release for "RTFS-Net: Recurrent time-frequency modelling for efficient audio-visual speech separation", accepted ICLR 2024

☆51

Alternatives and similar repositories for RTFS-Net

Users that are interested in RTFS-Net are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

JusperLee / Swift-Net
View on GitHub
Power-Guided Grouped SRU for Real-Time Causal Audio-Visual Speech Separation
☆26Updated this week
JusperLee / CTCNet
View on GitHub
An Audio-Visual Speech Separation Model Inspired by Cortico-Thalamo-Cortical Circuits
☆82Apr 28, 2024Updated 2 years ago
lin9x / AV-Sepformer
View on GitHub
☆65Jun 28, 2023Updated 3 years ago
JusperLee / TFACM
View on GitHub
☆23Jul 16, 2025Updated last year
JusperLee / SPMamba
View on GitHub
☆227Dec 5, 2024Updated last year
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
TaoRuijie / SEANet
View on GitHub
Code for Audio-Visual Target Speaker Extraction with Selective Auditory Attention (TASLP)
☆32Feb 28, 2025Updated last year
hmartelb / avlit
View on GitHub
Official source code of the INTERSPEECH 2023 paper: "Audio-Visual Speech Separation in Noisy Environments with a Lightweight Iterative Mo…
☆20Sep 1, 2023Updated 2 years ago
dr-pato / SSGD
View on GitHub
Code of the paper "Low-Latency Speech Separation Guided Diarization for Telephone Conversations"
☆15Dec 22, 2022Updated 3 years ago
zexupan / MuSE
View on GitHub
☆42Nov 22, 2024Updated last year
JusperLee / IIANet
View on GitHub
This is the demo of our paper "IIANet: An Intra- and Inter-Modality Attention Network for Audio-Visual Speech Separation".
☆110Mar 12, 2025Updated last year
smulelabs / windowed-roformer
View on GitHub
Official Repository for "Efficient Vocal Source Separation Through Windowed RoFormer"
☆45Oct 30, 2025Updated 8 months ago
JusperLee / AV-ConvTasNet
View on GitHub
Unofficial Time Domain Audio Visual Speech Separation Implementation
☆45Apr 19, 2023Updated 3 years ago
facebookresearch / VisualVoice
View on GitHub
Audio-Visual Speech Separation with Cross-Modal Consistency
☆250Jul 25, 2023Updated 3 years ago
JusperLee / Look2hear
View on GitHub
A toolkit for researchers in the multimodal sound separation.
☆16Oct 20, 2023Updated 2 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
SiavashShams / ssamba
View on GitHub
[SLT'24] The official implementation of SSAMBA: Self-Supervised Audio Representation Learning with Mamba State Space Model
☆140Nov 5, 2025Updated 8 months ago
JuanFMontesinos / VoViT
View on GitHub
VoViT: Low Latency Graph-based Audio-Visual VoiceSeparation Transformer
☆35Mar 18, 2023Updated 3 years ago
JusperLee / TDANet
View on GitHub
An efficient speech separation method
☆277Apr 11, 2024Updated 2 years ago
WingSingFung / TISDiSS
View on GitHub
Official implementation of TISDiSS, a scalable framework for discriminative source separation.
☆16Oct 19, 2025Updated 9 months ago
kwatcharasupat / divide-and-remaster-v3
View on GitHub
Landing Page for Divide and Remaster v3
☆26Jul 29, 2025Updated 11 months ago
YUCHEN005 / Unified-Enhance-Separation
View on GitHub
Code for paper "Unifying Speech Enhancement and Separation with Gradient Modulation for End-to-End Noise-Robust Speech Separation"
☆45Jul 10, 2024Updated 2 years ago
SarthakYadav / axlstm-official
View on GitHub
Official repository for the paper "Audio xLSTMs: Learning Self-supervised audio representations with xLSTMs"
☆21Sep 7, 2025Updated 10 months ago
XiaoyuBIE1994 / SDCodec
View on GitHub
(ICASSP 2025) Learning Source Disentanglement in Neural Audio Codec
☆48May 16, 2025Updated last year
sc0ttms / SE-DCCRN
View on GitHub
☆22Mar 2, 2022Updated 4 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
haoxiangsnr / llm-tse
View on GitHub
Typing to Listen at the Cocktail Party: Text-Guided Target Speaker Extraction (LLM-TSE)
☆43Oct 13, 2023Updated 2 years ago
xi-j / Mamba-TasNet
View on GitHub
☆116Oct 1, 2024Updated last year
fakufaku / diffusion-separation
View on GitHub
Single channel speech source separation by diffusion process (ICASSP 2023)
☆126Mar 15, 2024Updated 2 years ago
ahmadikalkhorani / AVCrossNet
View on GitHub
☆16Jul 4, 2024Updated 2 years ago
JusperLee / Dolphin
View on GitHub
☆185Updated this week
Dream-High / DJCM
View on GitHub
☆30Apr 22, 2024Updated 2 years ago
yongyizang / TrainingFreeMultiStepASR
View on GitHub
Official Repository for "Training-Free Multi-Step Audio Source Separation"
☆54May 26, 2025Updated last year
donghoney0416 / DeFTAN-II
View on GitHub
Official page of "DeFTAN-II: Efficient multichannel speech enhancement with subgroup processing", IEEE/ACM Transactions on Audio, Speech,…
☆34Nov 21, 2024Updated last year
Sreyan88 / RECAP
View on GitHub
Code for ICASSP 2024 Paper: RECAP: Retrieval-Augmented Audio Captioning
☆16Jun 23, 2024Updated 2 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
jyhan03 / icassp22-dataset
View on GitHub
Dataset simulation for DPCCN.
☆16Dec 25, 2022Updated 3 years ago
Beilong-Tang / TSELM
View on GitHub
Official Implementation of TSELM: Target speaker extraction using discrete tokens and language models
☆60Apr 14, 2025Updated last year
Sreyan88 / LAPE
View on GitHub
A unified framework for Low-resource Audio Processing and Evaluation (SSL Pre-training and Downstream Fine-tuning)
☆29Jul 9, 2024Updated 2 years ago
Sreyan88 / ReCLAP
View on GitHub
☆33Dec 23, 2025Updated 7 months ago
JusperLee / Dual-Path-RNN-Pytorch
View on GitHub
Dual-path RNN: efficient long sequence modeling for time-domain single-channel speech separation implemented by Pytorch
☆468Feb 14, 2023Updated 3 years ago
xi-j / Mamba-ASR
View on GitHub
ConMamba for Automatic Speech Recognition
☆106Aug 12, 2024Updated last year
jyhan03 / dpccn
View on GitHub
This repository provides an implementation of the DPCCN model for single-channel speech separation. More details will be updated soon.
☆13Dec 8, 2021Updated 4 years ago