alibabasglab/MossFormer2

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/alibabasglab/MossFormer2)

alibabasglab / MossFormer2

This is the audio sample repository for speech separation model "MossFormer2".

☆190

Alternatives and similar repositories for MossFormer2

Users that are interested in MossFormer2 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

dmlguq456 / SepReformer
View on GitHub
Official repository of SepReformer for speech separation
☆262May 14, 2026Updated 2 months ago
alibabasglab / MossFormer
View on GitHub
This repo provides the processed samples of the manuscript "MossFormer: Pushing the Performance Limit of Monaural Speech Separation using…
☆107Nov 28, 2024Updated last year
JusperLee / TDANet
View on GitHub
An efficient speech separation method
☆278Apr 11, 2024Updated 2 years ago
JusperLee / SPMamba
View on GitHub
☆227Dec 5, 2024Updated last year
Audio-WestlakeU / FS-EEND
View on GitHub
The official Pytorch implementation of "Frame-wise streaming end-to-end speaker diarization with non-autoregressive self-attention-based …
☆183May 7, 2026Updated 2 months ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
wenet-e2e / wesep
View on GitHub
Target Speaker Extraction Toolkit
☆300Oct 4, 2025Updated 9 months ago
IiuZiKai / Evo_TSE
View on GitHub
☆17Apr 9, 2026Updated 3 months ago
merlresearch / tf-locoformer
View on GitHub
Transformer with Local Modeling by Convolution for Speech Separation and Enhancement
☆133Aug 8, 2025Updated 11 months ago
HaoFengyuan / X-TF-GridNet
View on GitHub
The implementation of "X-TF-GridNet: A Time-Frequency Domain Target Speaker Extraction Network with Adaptive Speaker Embedding Fusion", w…
☆115Sep 2, 2025Updated 10 months ago
nanless / universal-speech-enhancement
View on GitHub
Apply Score diffusion to improve speech signals recorded under various adverse conditions and distortions, including noise, reverberation…
☆83Jul 29, 2024Updated last year
alibabasglab / FRCRN
View on GitHub
☆171Nov 28, 2024Updated last year
Andong-Li-speech / Neural-Vocoders-as-Speech-Enhancers
View on GitHub
☆52Sep 10, 2024Updated last year
xi-j / Mamba-TasNet
View on GitHub
☆116Oct 1, 2024Updated last year
JusperLee / TFACM
View on GitHub
☆24Jul 16, 2025Updated last year
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
Honee-W / FlowSE
View on GitHub
Official repository for FlowSE (Interspeech 2025)
☆112Jul 9, 2025Updated last year
Audio-WestlakeU / NBSS
View on GitHub
The official repo of NBC & SpatialNet for multichannel speech separation, denoising, and dereverberation
☆363Jan 1, 2025Updated last year
ZBang / USEF-TSE
View on GitHub
☆70Jul 5, 2025Updated last year
itsnotacie / AAAI-26_SepPrune
View on GitHub
SepPrune: Structured Pruning for Efficient Deep Speech Separation-AAAI'26
☆15May 31, 2025Updated last year
Emrys365 / se-scaling
View on GitHub
Model configurations for scaling SE models in the paper "Beyond Performance Plateaus: A Comprehensive Study on Scalability in Speech Enha…
☆41Aug 7, 2024Updated last year
Beilong-Tang / lauraTSE_code
View on GitHub
Official Implementation of LauraTSE: Target Speaker Extraction using Auto-Regressive Decoder-Only Language Models.
☆37Nov 9, 2025Updated 8 months ago
Audio-WestlakeU / RealMAN
View on GitHub
A description of "RealMAN: A Real-Recorded and Annotated Microphone Array Dataset for Dynamic Speech Enhancement and Localization" [NeurI…
☆175Apr 29, 2025Updated last year
joonaskalda / PixIT
View on GitHub
Companion repo for the paper "PixIT: Joint Training of Speaker Diarization and Speech Separation from Real-world Multi-speaker Recordings…
☆105Jan 10, 2025Updated last year
KdaiP / conformer-RoPE
View on GitHub
Conformer block with Rotary Position Embedding, modified from lucidrains' implement
☆19Sep 13, 2024Updated last year
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
fgnt / sms_wsj
View on GitHub
SMS-WSJ: Spatialized Multi-Speaker Wall Street Journal database for multi-channel source separation and recognition
☆131Jun 7, 2024Updated 2 years ago
LAION-AI / emotional-speech-annotations
View on GitHub
This repository contains prompts & best practices to annotate audio clips with a very high degree of details using Audio-Language-Models
☆35Oct 13, 2024Updated last year
ssi-research / FQSE
View on GitHub
Fully Quantized Neural Networks For Speech Enhancement
☆65Feb 15, 2024Updated 2 years ago
facebookresearch / ears_dataset
View on GitHub
Expressive Anechoic Recordings of Speech (EARS)
☆221Jun 25, 2024Updated 2 years ago
urgent-challenge / urgent2024_challenge
View on GitHub
Official data preparation scripts for the URGENT 2024 Challenge
☆90May 21, 2025Updated last year
WangHelin1997 / SoloAudio
View on GitHub
SoloAudio: Target Sound Extraction with Language-oriented Audio Diffusion Transformer.
☆121Jan 28, 2026Updated 6 months ago
JusperLee / Swift-Net
View on GitHub
Power-Guided Grouped SRU for Real-Time Causal Audio-Visual Speech Separation
☆26Jul 20, 2026Updated last week
kooBH / DSS
View on GitHub
[WIP]Direction based Multi-Channel Speech Separation
☆14Jan 25, 2024Updated 2 years ago
JusperLee / SonicSim
View on GitHub
SonicSim: A customizable simulation platform for speech processing in moving sound source scenarios
☆277Jan 22, 2025Updated last year
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
WangHelin1997 / LibriLightMix-WHAMR
View on GitHub
Python scripts to create noisy and reverberant 2-speaker mixture audio with Libri-Light and WHAM
☆17Nov 7, 2024Updated last year
Audio-WestlakeU / CleanMel
View on GitHub
Pytorch implementation of "CleanMel: Mel-Spectrogram Enhancement for Improving Both Speech Quality and ASR".
☆94Feb 2, 2026Updated 5 months ago
Audio-WestlakeU / Mel-McNet
View on GitHub
The Official PyTorch Implementation of "Mel-McNet: A Mel-Scale Framework for Online Multichannel Speech Enhancement" [Interspeech 2025]
☆26May 14, 2026Updated 2 months ago
egruttadauria98 / SSpaVAlDo
View on GitHub
☆37Jan 6, 2026Updated 6 months ago
smulelabs / windowed-roformer
View on GitHub
Official Repository for "Efficient Vocal Source Separation Through Windowed RoFormer"
☆45Oct 30, 2025Updated 8 months ago
ASLP-lab / Smart-Glass-Challenge
View on GitHub
☆18Jun 16, 2026Updated last month
RookieJunChen / FullSubNet-plus
View on GitHub
The official PyTorch implementation of "FullSubNet+: Channel Attention FullSubNet with Complex Spectrograms for Speech Enhancement".
☆293Jul 26, 2025Updated last year