alibabasglab / MossFormer
This repo provides the processed samples of the manuscript "MossFormer: Pushing the Performance Limit of Monaural Speech Separation using Gated Single-head Transformer with Convolution-augmented Joint Self-Attentions", which was submitted to ICASSP 2023.
☆91Updated 4 months ago
Alternatives and similar repositories for MossFormer:
Users that are interested in MossFormer are comparing it to the libraries listed below
- Evaluation and Benchmarking of Speech Super-resolution Methods☆148Updated 2 years ago
- Target Speaker Extraction Toolkit☆149Updated 3 weeks ago
- The official PyTorch implementation of "Inter-SubNet: Speech Enhancement with Subband Interaction", accepted by ICASSP 2023.☆95Updated last year
- ☆64Updated 6 months ago
- HiFi++: a Unified Framework for Neural Vocoding, Bandwidth Extension and Speech Enhancement☆155Updated 2 years ago
- Open implementation of UNIVERSE and UNIVERSE++ diffusion-based speech enhancement models.☆91Updated 6 months ago
- ☆160Updated 3 months ago
- ☆64Updated last year
- MANNER: Multi-view Attention Network for Noise ERasure (Speech enhancement in time-domain)☆60Updated 2 years ago
- The Official Implementation of “Content-Dependent Fine-Grained Speaker Embedding for Zero-Shot Speaker Adaptation in Text-to-Speech Synth…☆84Updated 2 years ago
- Source code for Consistent ensemble distillation for audio tagging☆27Updated 8 months ago
- TriAAN-VC: Triple Adaptive Attention Normalization for Any-to-Any Voice Conversion☆145Updated last year
- Attention Backend for Aotumatic Speaker Verification with Multiple Enrollment Utterances☆49Updated 2 years ago
- ☆64Updated last year
- An 16kHz implementation of HiFi-GAN for soft-vc.☆96Updated last year
- Speaker identification/verification models for Machine Learning for Computer Vision class at UNIBO☆62Updated 2 years ago
- Official repository of NeXt-TDNN for speaker verification☆69Updated 5 months ago
- X-E-Speech: Joint Training Framework of Non-Autoregressive Cross-lingual Emotional Text-to-Speech and Voice Conversion☆85Updated 11 months ago
- ☆50Updated last year
- StoRM: A Diffusion-based Stochastic Regeneration Model for Speech Enhancement and Dereverberation☆210Updated 6 months ago
- [WIP] Unofficial Implementation of Microsoft's PromptTTS2☆51Updated last year
- The official Pytorch implementation of "Frame-wise streaming end-to-end speaker diarization with non-autoregressive self-attention-based …☆126Updated 3 weeks ago
- ☆54Updated last year
- ☆144Updated 4 months ago
- This is the implementation for "ControlVC: Zero-Shot Voice Conversion with Time-Varying Controls on Pitch and Rhythm"☆132Updated last year
- Speech Representation Disentanglement with Adversarial Mutual Information Learning for One-shot Voice Conversion (Interspeech 2022)☆116Updated last year
- Unsupervised domain adaptation for conversational speech enhancement using RemixIT☆53Updated last year
- Deep Neural Pitch Extractor for Voice Conversion and TTS Training☆122Updated 2 years ago
- Official Implementation of StyleTTS-VC☆177Updated 2 months ago
- A sequence-to-sequence voice conversion toolkit.☆96Updated 8 months ago