alibabasglab / MossFormer
This repo provides the processed samples of the manuscript "MossFormer: Pushing the Performance Limit of Monaural Speech Separation using Gated Single-head Transformer with Convolution-augmented Joint Self-Attentions", which was submitted to ICASSP 2023.
☆90Updated 3 months ago
Alternatives and similar repositories for MossFormer:
Users that are interested in MossFormer are comparing it to the libraries listed below
- The official PyTorch implementation of "Inter-SubNet: Speech Enhancement with Subband Interaction", accepted by ICASSP 2023.☆95Updated last year
- Evaluation and Benchmarking of Speech Super-resolution Methods☆148Updated 2 years ago
- Target Speaker Extraction Toolkit☆147Updated last week
- HiFi++: a Unified Framework for Neural Vocoding, Bandwidth Extension and Speech Enhancement☆155Updated 2 years ago
- Source code for Consistent ensemble distillation for audio tagging☆26Updated 7 months ago
- Open implementation of UNIVERSE and UNIVERSE++ diffusion-based speech enhancement models.☆91Updated 6 months ago
- ☆141Updated 3 months ago
- ☆63Updated 6 months ago
- MANNER: Multi-view Attention Network for Noise ERasure (Speech enhancement in time-domain)☆60Updated 2 years ago
- iSTFTNet : Fast and Lightweight Mel-spectrogram Vocoder Incorporating Inverse Short-time Fourier Transform☆242Updated last year
- ☆64Updated last year
- Deep Neural Pitch Extractor for Voice Conversion and TTS Training☆122Updated 2 years ago
- ☆98Updated last year
- Using joint training speaker encoder with consistency loss to achieve cross-lingual voice conversion and expressive voice conversion☆143Updated last year
- Huawei Grad-TTS for Chinese☆46Updated last year
- [WIP] Unofficial Implementation of Microsoft's PromptTTS2☆51Updated last year
- Unsupervised domain adaptation for conversational speech enhancement using RemixIT☆53Updated last year
- Predicts the level of noise and reverberation on your audiofiles☆144Updated 9 months ago
- Official repository of NeXt-TDNN for speaker verification☆67Updated 5 months ago
- Speech Separation☆62Updated last year
- Speech Representation Disentanglement with Adversarial Mutual Information Learning for One-shot Voice Conversion (Interspeech 2022)☆115Updated last year
- ☆25Updated last year
- ☆64Updated last year
- StoRM: A Diffusion-based Stochastic Regeneration Model for Speech Enhancement and Dereverberation☆209Updated 6 months ago
- TriAAN-VC: Triple Adaptive Attention Normalization for Any-to-Any Voice Conversion☆145Updated last year
- The official Pytorch implementation of "Frame-wise streaming end-to-end speaker diarization with non-autoregressive self-attention-based …☆120Updated last week
- The Official Implementation of “Content-Dependent Fine-Grained Speaker Embedding for Zero-Shot Speaker Adaptation in Text-to-Speech Synth…☆84Updated 2 years ago
- TransferTTS (Zero-Shot learning of VITS)☆94Updated 2 years ago
- Official implementation of SpeechSplit2☆132Updated 2 years ago
- The implementation of "Dual-branch Attention-In-Attention Transformer for single-channel speech enhancement"☆116Updated 2 years ago