alibabasglab / MossFormerLinks
This repo provides the processed samples of the manuscript "MossFormer: Pushing the Performance Limit of Monaural Speech Separation using Gated Single-head Transformer with Convolution-augmented Joint Self-Attentions", which was submitted to ICASSP 2023.
☆99Updated last year
Alternatives and similar repositories for MossFormer
Users that are interested in MossFormer are comparing it to the libraries listed below
Sorting:
- iSTFTNet : Fast and Lightweight Mel-spectrogram Vocoder Incorporating Inverse Short-time Fourier Transform☆266Updated 4 months ago
- Evaluation and Benchmarking of Speech Super-resolution Methods☆152Updated 3 years ago
- The official PyTorch implementation of "Inter-SubNet: Speech Enhancement with Subband Interaction", accepted by ICASSP 2023.☆97Updated 2 years ago
- ☆194Updated last year
- HiFi++: a Unified Framework for Neural Vocoding, Bandwidth Extension and Speech Enhancement☆158Updated 3 years ago
- Speech Separation☆78Updated last year
- ☆69Updated last year
- ☆69Updated 2 years ago
- ☆65Updated 2 years ago
- ☆166Updated last year
- Official Repository For VoxBlink2☆85Updated last year
- Speech Representation Disentanglement with Adversarial Mutual Information Learning for One-shot Voice Conversion (Interspeech 2022)☆119Updated last year
- INTERSPEECH 2023: "DPHuBERT: Joint Distillation and Pruning of Self-Supervised Speech Models"☆115Updated last year
- Unsupervised domain adaptation for conversational speech enhancement using RemixIT☆55Updated 2 years ago
- Unofficial SoundStream implementation of Pytorch with training code and 16kHz pretrained checkpoint☆76Updated 2 years ago
- This github repo is for Neurips 2021 and Interspeech 2022 papers on Non-Matching Reference based estimation of speech quality assessment.…☆105Updated 2 years ago
- Predicts the level of noise and reverberation on your audiofiles☆171Updated 5 months ago
- Expressive Anechoic Recordings of Speech (EARS)☆200Updated last year
- Official repository of NeXt-TDNN for speaker verification☆80Updated last year
- Implementation of BEST-RQ - a model for self-supervised learning of speech signals using a random projection quantizer, in Pytorch.☆125Updated 2 years ago
- MANNER: Multi-view Attention Network for Noise ERasure (Speech enhancement in time-domain)☆64Updated 3 years ago
- Phase-aware speech enchancement with Deep Complex U-Net☆132Updated 2 years ago
- Official Implement of Multi-Stage Multi-Codebook (MSMC) TTS☆167Updated last year
- Huawei Grad-TTS for Chinese☆50Updated 2 years ago
- ☆103Updated last year
- Using joint training speaker encoder with consistency loss to achieve cross-lingual voice conversion and expressive voice conversion☆152Updated 2 years ago
- Official implement of "Dual-stream Time-Delay Neural Network with Dynamic Global Filter for Speaker Verification" in PyTorch☆41Updated 2 years ago
- This is the implementation for "ControlVC: Zero-Shot Voice Conversion with Time-Varying Controls on Pitch and Rhythm"☆133Updated 2 years ago
- Transformer with Local Modeling by Convolution for Speech Separation and Enhancement☆106Updated 4 months ago
- Streaming Audiotransformers for online Audio tagging☆49Updated last year