alibabasglab / MossFormerLinks
This repo provides the processed samples of the manuscript "MossFormer: Pushing the Performance Limit of Monaural Speech Separation using Gated Single-head Transformer with Convolution-augmented Joint Self-Attentions", which was submitted to ICASSP 2023.
☆100Updated last year
Alternatives and similar repositories for MossFormer
Users that are interested in MossFormer are comparing it to the libraries listed below
Sorting:
- The official PyTorch implementation of "Inter-SubNet: Speech Enhancement with Subband Interaction", accepted by ICASSP 2023.☆98Updated 2 years ago
- Evaluation and Benchmarking of Speech Super-resolution Methods☆152Updated 3 years ago
- ☆166Updated last year
- Phase-aware speech enchancement with Deep Complex U-Net☆132Updated 2 years ago
- ☆68Updated 2 years ago
- HiFi++: a Unified Framework for Neural Vocoding, Bandwidth Extension and Speech Enhancement☆158Updated 3 years ago
- INTERSPEECH 2023: "DPHuBERT: Joint Distillation and Pruning of Self-Supervised Speech Models"☆116Updated last year
- Predicts the level of noise and reverberation on your audiofiles☆174Updated 7 months ago
- Unsupervised domain adaptation for conversational speech enhancement using RemixIT☆55Updated 2 years ago
- Expressive Anechoic Recordings of Speech (EARS)☆206Updated last year
- Speech Separation☆78Updated last year
- ☆201Updated last year
- ☆107Updated last year
- Source code for Consistent ensemble distillation for audio tagging☆53Updated 7 months ago
- This is the official implement of Mamba-SEUNet: Mamba UNet for Monaural Speech Enhancement☆85Updated 7 months ago
- ☆70Updated last year
- ☆61Updated 2 years ago
- Official repository of SepReformer for speech separation☆238Updated last year
- Using joint training speaker encoder with consistency loss to achieve cross-lingual voice conversion and expressive voice conversion☆153Updated 2 years ago
- ☆65Updated 2 years ago
- Transformer with Local Modeling by Convolution for Speech Separation and Enhancement☆117Updated 5 months ago
- Official Repository For VoxBlink2☆85Updated last year
- Deep Neural Pitch Extractor for Voice Conversion and TTS Training☆146Updated 3 years ago
- Unofficial SoundStream implementation of Pytorch with training code and 16kHz pretrained checkpoint☆77Updated last week
- Apply Score diffusion to improve speech signals recorded under various adverse conditions and distortions, including noise, reverberation…☆74Updated last year
- Official repository of NeXt-TDNN for speaker verification☆81Updated last year
- The official PyTorch implementation of "FullSubNet+: Channel Attention FullSubNet with Complex Spectrograms for Speech Enhancement".☆282Updated 5 months ago
- MANNER: Multi-view Attention Network for Noise ERasure (Speech enhancement in time-domain)☆64Updated 3 years ago
- Streaming Audiotransformers for online Audio tagging☆49Updated last year
- TriAAN-VC: Triple Adaptive Attention Normalization for Any-to-Any Voice Conversion☆148Updated 2 years ago