slp-rl / SC-PhASE
This repo contains the official PyTorch implementation of "A Systematic Comparison of Phonetic Aware Techniques for Speech Enhancement" (Interspeech 2022)
☆28Updated 2 years ago
Related projects ⓘ
Alternatives and complementary repositories for SC-PhASE
- A Diffusion Probabilistic Model for Target Sound Extraction☆35Updated last month
- ☆27Updated last year
- NOMAD: Non-Matching Audio Distance (ICASSP 2024)☆24Updated last month
- Query-conditioned target sound extraction model☆17Updated 3 weeks ago
- A pytorch implementation of MBNET: MOS PREDICTION FOR SYNTHESIZED SPEECH WITH MEAN-BIAS NETWORK☆61Updated 3 years ago
- ☆36Updated 5 months ago
- Repo for source code of EBEN: Extreme Bandwidth Extension Network☆69Updated last month
- ☆28Updated last year
- ☆48Updated 5 months ago
- Pytorch implementation of subband decomposition☆89Updated 2 years ago
- Implementation of SpatialCodec.☆55Updated last year
- ☆47Updated last week
- ☆28Updated 6 months ago
- Open implementation of UNIVERSE and UNIVERSE++ diffusion-based speech enhancement models.☆72Updated 2 months ago
- ☆20Updated 10 months ago
- ☆64Updated last year
- ☆34Updated 3 years ago
- Towards High-Quality and Efficient Speech Bandwidth Extension with Parallel Amplitude and Phase Prediction☆49Updated 2 weeks ago
- Code and data recipes for the paper: Heterogeneous Target Speech Separation☆39Updated last year
- Multipurpose Multi Speaker Mixture Signal Generator☆43Updated last month
- Boosting Self-Supervised Embeddings for Speech Enhancement☆45Updated 2 years ago
- For students who would like to apply for RA, PhD, postdoc in audio research.☆24Updated 3 weeks ago
- Unsupervised speech enhancement using DVAEs☆19Updated 10 months ago
- This is official repository of new SOTA diffusion models based method for speech enhancement☆33Updated 3 months ago
- Denoising Diffusion Autoregressive Model for Raw Speech Waveform Generation☆23Updated 8 months ago
- Official data preparation scripts for the URGENT 2024 Challenge☆67Updated this week
- This repository contains the code of the CP JKU submission to DCASE23 Task 1 "Low-complexity Acoustic Scene Classification"☆23Updated last year
- Official implementation for our paper "Audio Mamba: Selective State Spaces for Self-Supervised Audio Representations"☆31Updated 5 months ago
- Official implementation of the paper "BigCodec: Pushing the Limits of Low-Bitrate Neural Speech Codec"☆84Updated 2 months ago
- Improving Perceptual Quality by Phone-Fortified Perceptual Loss using Wasserstein Distance for Speech Enhancement☆73Updated 3 years ago