alibabasglab/MossFormer

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/alibabasglab/MossFormer)

alibabasglab / MossFormer

This repo provides the processed samples of the manuscript "MossFormer: Pushing the Performance Limit of Monaural Speech Separation using Gated Single-head Transformer with Convolution-augmented Joint Self-Attentions", which was submitted to ICASSP 2023.

☆107

Alternatives and similar repositories for MossFormer

Users that are interested in MossFormer are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

alibabasglab / MossFormer2
View on GitHub
This is the audio sample repository for speech separation model "MossFormer2".
☆190Nov 28, 2024Updated last year
LiChenda / Multi-clue-TSE-data
View on GitHub
Data simulation scripts for paper "Target Sound Extraction with Variable Cross-modality Clues"
☆17May 19, 2023Updated 3 years ago
YUCHEN005 / Unified-Enhance-Separation
View on GitHub
Code for paper "Unifying Speech Enhancement and Separation with Gradient Modulation for End-to-End Noise-Robust Speech Separation"
☆45Jul 10, 2024Updated 2 years ago
JusperLee / TDANet
View on GitHub
An efficient speech separation method
☆278Apr 11, 2024Updated 2 years ago
yangdongchao / Tim-TSENet
View on GitHub
The source code of Tim-TSENet
☆15Apr 22, 2022Updated 4 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
huaidanquede / Dense-TSNet
View on GitHub
offical code for Dense-TSNet
☆12Sep 17, 2024Updated last year
xuchenglin28 / speaker_extraction_SpEx
View on GitHub
multi-scale time domain speaker extraction
☆81Jun 7, 2021Updated 5 years ago
yuguochencuc / DB-AIAT
View on GitHub
The implementation of "Dual-branch Attention-In-Attention Transformer for single-channel speech enhancement"
☆126Jun 29, 2022Updated 4 years ago
JusperLee / SPMamba
View on GitHub
☆227Dec 5, 2024Updated last year
JusperLee / speech-paper-daily-skill
View on GitHub
☆26Mar 31, 2026Updated 3 months ago
dmlguq456 / SepReformer
View on GitHub
Official repository of SepReformer for speech separation
☆262May 14, 2026Updated 2 months ago
Honee-W / CPTNN
View on GitHub
unofficial implementation of "CPTNN: CROSS-PARALLEL TRANSFORMER NEURAL NETWORK FOR TIME-DOMAIN SPEECH ENHANCEMENT"
☆15Nov 14, 2023Updated 2 years ago
Audio-WestlakeU / McNet
View on GitHub
The official repo: "McNet: Fuse Multiple Cues for Multichannel Speech Enhancement", ICASSP 2023
☆130Mar 24, 2023Updated 3 years ago
seorim0 / DCCRN-with-various-loss-functions
View on GitHub
DCCRN with various loss functions
☆103Sep 29, 2022Updated 3 years ago
Simple, predictable pricing with DigitalOcean hosting • Ad
Always know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
huutuongtu / Lightvoc
View on GitHub
LIGHTVOC AN UPSAMPLING-FREE GAN VOCODER BASED ON CONFORMER AND INVERSE SHORT-TIME FOURIER TRANSFORM
☆18May 17, 2024Updated 2 years ago
ZBang / USEF-TSE
View on GitHub
☆70Jul 5, 2025Updated last year
hmartelb / avlit
View on GitHub
Official source code of the INTERSPEECH 2023 paper: "Audio-Visual Speech Separation in Noisy Environments with a Lightweight Iterative Mo…
☆20Sep 1, 2023Updated 2 years ago
lin9x / AV-Sepformer
View on GitHub
☆65Jun 28, 2023Updated 3 years ago
yoonsanghyu / FaSNet-TAC-PyTorch
View on GitHub
Full implementation of "End-to-end microphone permutation and number invariant multi-channel speech separation" (Interspeech 2020)
☆76Sep 14, 2021Updated 4 years ago
vkothapally / Subband-Beamformer
View on GitHub
☆33Nov 29, 2022Updated 3 years ago
Audio-WestlakeU / NBSS
View on GitHub
The official repo of NBC & SpatialNet for multichannel speech separation, denoising, and dereverberation
☆363Jan 1, 2025Updated last year
lucacoma / NeuralBeamspaceDomainFilter
View on GitHub
Unofficial Implementation of "Liu, W., Li, A., Wang, X., Yuan, M., Chen, Y., Zheng, C., & Li, X. (2022). A Neural Beamspace-Domain Filter…
☆19Oct 21, 2022Updated 3 years ago
nanless / universal-speech-enhancement
View on GitHub
Apply Score diffusion to improve speech signals recorded under various adverse conditions and distortions, including noise, reverberation…
☆83Jul 29, 2024Updated 2 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
JusperLee / Swift-Net
View on GitHub
Power-Guided Grouped SRU for Real-Time Causal Audio-Visual Speech Separation
☆26Jul 20, 2026Updated last week
HaoFengyuan / X-TF-GridNet
View on GitHub
The implementation of "X-TF-GridNet: A Time-Frequency Domain Target Speaker Extraction Network with Adaptive Speaker Embedding Fusion", w…
☆115Sep 2, 2025Updated 10 months ago
Andong-Li-speech / Neural-Vocoders-as-Speech-Enhancers
View on GitHub
☆52Sep 10, 2024Updated last year
Beilong-Tang / TSELM
View on GitHub
Official Implementation of TSELM: Target speaker extraction using discrete tokens and language models
☆60Apr 14, 2025Updated last year
felixfuyihui / Uformer
View on GitHub
Uformer: A Unet based dilated complex & real dual-path conformer network for simultaneous speech enhancement and dereverberation
☆117Jun 29, 2022Updated 4 years ago
yxlu-0102 / MP-SENet
View on GitHub
Explicit Estimation of Magnitude and Phase Spectra in Parallel for High-Quality Speech Enhancement
☆493May 19, 2025Updated last year
zelokuo / VPIDM
View on GitHub
This is official repository of new SOTA diffusion models based method for speech enhancement
☆43Jul 31, 2024Updated last year
gemengtju / L-SpEx
View on GitHub
☆39Feb 23, 2022Updated 4 years ago
sp-uhh / sgmse
View on GitHub
Score-based Generative Models (Diffusion Models) for Speech Enhancement and Dereverberation
☆764May 12, 2026Updated 2 months ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
JusperLee / Dual-Path-RNN-Pytorch
View on GitHub
Dual-path RNN: efficient long sequence modeling for time-domain single-channel speech separation implemented by Pytorch
☆468Feb 14, 2023Updated 3 years ago
itsnotacie / AAAI-26_SepPrune
View on GitHub
SepPrune: Structured Pruning for Efficient Deep Speech Separation-AAAI'26
☆15May 31, 2025Updated last year
BUTSpeechFIT / speakerbeam
View on GitHub
☆145Oct 25, 2021Updated 4 years ago
alibabasglab / D2Former
View on GitHub
This repository contains the audio samples for "D2Former: A Fully Complex Dual-Path Dual-Decoder Conformer Network using Joint Complex Ma…
☆46Sep 6, 2023Updated 2 years ago
echocatzh / GTCNN
View on GitHub
Personalized AEC
☆19Nov 3, 2022Updated 3 years ago
egruttadauria98 / SSpaVAlDo
View on GitHub
☆37Jan 6, 2026Updated 6 months ago
chentuochao / Target-Conversation-Extraction
View on GitHub
This is the code and dataset repo for Interspeech 2024 paper "Target conversation extraction: Source separation using turn-taking dynamic…
☆58Aug 15, 2025Updated 11 months ago