alibabasglab / MossFormerLinks
This repo provides the processed samples of the manuscript "MossFormer: Pushing the Performance Limit of Monaural Speech Separation using Gated Single-head Transformer with Convolution-augmented Joint Self-Attentions", which was submitted to ICASSP 2023.
☆95Updated 9 months ago
Alternatives and similar repositories for MossFormer
Users that are interested in MossFormer are comparing it to the libraries listed below
Sorting:
- The official PyTorch implementation of "Inter-SubNet: Speech Enhancement with Subband Interaction", accepted by ICASSP 2023.☆96Updated 2 years ago
- Evaluation and Benchmarking of Speech Super-resolution Methods☆152Updated 3 years ago
- ☆160Updated 9 months ago
- HiFi++: a Unified Framework for Neural Vocoding, Bandwidth Extension and Speech Enhancement☆156Updated 3 years ago
- ☆181Updated 9 months ago
- Phase-aware speech enchancement with Deep Complex U-Net☆127Updated 2 years ago
- Official repository of NeXt-TDNN for speaker verification☆78Updated 11 months ago
- Target Speaker Extraction Toolkit☆197Updated last month
- Predicts the level of noise and reverberation on your audiofiles☆163Updated 3 months ago
- Expressive Anechoic Recordings of Speech (EARS)☆192Updated last year
- iSTFTNet : Fast and Lightweight Mel-spectrogram Vocoder Incorporating Inverse Short-time Fourier Transform☆261Updated 2 months ago
- Using joint training speaker encoder with consistency loss to achieve cross-lingual voice conversion and expressive voice conversion☆149Updated last year
- ☆92Updated 11 months ago
- Transformer with Local Modeling by Convolution for Speech Separation and Enhancement☆94Updated last month
- ☆68Updated last year
- ☆62Updated 2 years ago
- Apply Score diffusion to improve speech signals recorded under various adverse conditions and distortions, including noise, reverberation…☆68Updated last year
- This is the official implement of Mamba-SEUNet: Mamba UNet for Monaural Speech Enhancement☆69Updated 3 months ago
- Huawei Grad-TTS for Chinese☆51Updated last year
- Dataset and baseline code for the VocalSound dataset (ICASSP2022).☆152Updated 2 years ago
- Source code for Consistent ensemble distillation for audio tagging☆46Updated 3 months ago
- This is the Python library for an unsupervised, fast method for robust voice activity detection (rVAD), as in the paper rVAD: An Unsuperv…☆147Updated 3 months ago
- Official repository of Spiking-FullSubNet, the Intel N-DNS Challenge Algorithmic Track Winner.☆107Updated 5 months ago
- Open implementation of UNIVERSE and UNIVERSE++ diffusion-based speech enhancement models.☆106Updated last year
- Pytorch implementation of "CleanMel: Mel-Spectrogram Enhancement for Improving Both Speech Quality and ASR".☆68Updated last month
- ☆66Updated 2 years ago
- This repo contains the official PyTorch implementation of "Audio Super Resolution in the Spectral Domain" (ICASSP 2023)☆232Updated 4 months ago
- Speech Separation☆73Updated last year
- ☆56Updated 2 years ago
- Official repository of SepReformer for speech separation☆219Updated 8 months ago