alibabasglab / MossFormerLinks
This repo provides the processed samples of the manuscript "MossFormer: Pushing the Performance Limit of Monaural Speech Separation using Gated Single-head Transformer with Convolution-augmented Joint Self-Attentions", which was submitted to ICASSP 2023.
☆94Updated 8 months ago
Alternatives and similar repositories for MossFormer
Users that are interested in MossFormer are comparing it to the libraries listed below
Sorting:
- The official PyTorch implementation of "Inter-SubNet: Speech Enhancement with Subband Interaction", accepted by ICASSP 2023.☆96Updated 2 years ago
- iSTFTNet : Fast and Lightweight Mel-spectrogram Vocoder Incorporating Inverse Short-time Fourier Transform☆261Updated 3 weeks ago
- Speech Representation Disentanglement with Adversarial Mutual Information Learning for One-shot Voice Conversion (Interspeech 2022)☆117Updated last year
- HiFi++: a Unified Framework for Neural Vocoding, Bandwidth Extension and Speech Enhancement☆155Updated 3 years ago
- Evaluation and Benchmarking of Speech Super-resolution Methods☆151Updated 3 years ago
- Target Speaker Extraction Toolkit☆184Updated 2 weeks ago
- MANNER: Multi-view Attention Network for Noise ERasure (Speech enhancement in time-domain)☆63Updated 2 years ago
- Huawei Grad-TTS for Chinese☆51Updated last year
- ☆68Updated 10 months ago
- Using joint training speaker encoder with consistency loss to achieve cross-lingual voice conversion and expressive voice conversion☆148Updated last year
- ☆140Updated last year
- ☆61Updated 2 years ago
- Official repository of NeXt-TDNN for speaker verification☆75Updated 10 months ago
- ☆157Updated 8 months ago
- ☆176Updated 8 months ago
- ☆88Updated 10 months ago
- ConMamba for Automatic Speech Recognition☆80Updated 11 months ago
- Transformer with Local Modeling by Convolution for Speech Separation and Enhancement☆89Updated 2 months ago
- This is the implementation for "ControlVC: Zero-Shot Voice Conversion with Time-Varying Controls on Pitch and Rhythm"☆132Updated last year
- Official Repository For VoxBlink2☆76Updated 11 months ago
- Official implementation for Fast-HuBERT: An Efficient Training Framework for Self-Supervised Speech Representation Learning☆94Updated 8 months ago
- INTERSPEECH 2023: "DPHuBERT: Joint Distillation and Pruning of Self-Supervised Speech Models"☆114Updated last year
- Predicts the level of noise and reverberation on your audiofiles☆156Updated last month
- Apply Score diffusion to improve speech signals recorded under various adverse conditions and distortions, including noise, reverberation…☆63Updated last year
- ☆55Updated 2 years ago
- Expressive Anechoic Recordings of Speech (EARS)☆181Updated last year
- Phase-aware speech enchancement with Deep Complex U-Net☆123Updated 2 years ago
- Dataset and baseline code for the VocalSound dataset (ICASSP2022).☆147Updated 2 years ago
- Source code for Consistent ensemble distillation for audio tagging☆43Updated last month
- Unsupervised domain adaptation for conversational speech enhancement using RemixIT☆54Updated 2 years ago