alibabasglab / MossFormer
This repo provides the processed samples of the manuscript "MossFormer: Pushing the Performance Limit of Monaural Speech Separation using Gated Single-head Transformer with Convolution-augmented Joint Self-Attentions", which was submitted to ICASSP 2023.
☆89Updated 2 months ago
Alternatives and similar repositories for MossFormer:
Users that are interested in MossFormer are comparing it to the libraries listed below
- Open implementation of UNIVERSE and UNIVERSE++ diffusion-based speech enhancement models.☆86Updated 5 months ago
- Evaluation and Benchmarking of Speech Super-resolution Methods☆144Updated 2 years ago
- The official PyTorch implementation of "Inter-SubNet: Speech Enhancement with Subband Interaction", accepted by ICASSP 2023.☆95Updated last year
- HiFi++: a Unified Framework for Neural Vocoding, Bandwidth Extension and Speech Enhancement☆155Updated 2 years ago
- ☆65Updated last year
- MANNER: Multi-view Attention Network for Noise ERasure (Speech enhancement in time-domain)☆61Updated 2 years ago
- Source code for Consistent ensemble distillation for audio tagging☆23Updated 7 months ago
- ☆63Updated 5 months ago
- Huawei Grad-TTS for Chinese☆46Updated last year
- Multi-Task Audio Source Separation, Two-Stage Model, Complex Domain.☆89Updated last year
- ☆49Updated last year
- Target Speaker Extraction Toolkit☆144Updated last week
- ☆149Updated 2 months ago
- ☆140Updated 2 months ago
- [WIP] Unofficial Implementation of Microsoft's PromptTTS2☆51Updated last year
- Unsupervised domain adaptation for conversational speech enhancement using RemixIT☆53Updated last year
- StoRM: A Diffusion-based Stochastic Regeneration Model for Speech Enhancement and Dereverberation☆205Updated 5 months ago
- Official Repository For VoxBlink2☆60Updated 6 months ago
- Official repository of NeXt-TDNN for speaker verification☆65Updated 4 months ago
- Official repository of SepReformer for speech separation☆172Updated last month
- High fidelity, lightweight, end-to-end, streaming, convolution-based neural audio codec☆94Updated 3 weeks ago
- An official implementation of the ICASSP 2024 paper: Dual-Path TFC-TDF UNet for Music Source Separation☆84Updated 10 months ago
- TriAAN-VC: Triple Adaptive Attention Normalization for Any-to-Any Voice Conversion☆146Updated last year
- ☆64Updated last year
- Source for the Interspeech 2024 Paper "Scaling up masked audio encoder learning for general audio classification"☆51Updated last week
- Implementation of DCComix TTS: An End-to-End Expressive TTS with Discrete Code Collaborated with Mixer☆75Updated last year
- Speech Separation☆60Updated 11 months ago
- Multi-Scale Temporal Frequency Convolutional Network With Axial Attention for Speech Enhancement☆203Updated 2 years ago
- Speaker identification/verification models for Machine Learning for Computer Vision class at UNIBO☆60Updated 2 years ago
- Using joint training speaker encoder with consistency loss to achieve cross-lingual voice conversion and expressive voice conversion☆140Updated last year