Self-supervised Speech Enhancement network
☆11Aug 27, 2020Updated 5 years ago
Alternatives and similar repositories for SSE
Users that are interested in SSE are comparing it to the libraries listed below
Sorting:
- Removing various types of noises present in the speech using Deep Neural Networks☆30Apr 17, 2021Updated 4 years ago
- An unofficial code reproduction of Channel Attention Dense U-Net for Multichannel Speech Enhancement☆13Jul 17, 2023Updated 2 years ago
- An implementation of Neural Style Transfer for Audio using Pytorch.☆10Dec 14, 2017Updated 8 years ago
- Tacotron2 with BERT examples☆10Jul 8, 2019Updated 6 years ago
- The implementation of MDNet, which is in submission to Interspeech2022☆14May 1, 2022Updated 3 years ago
- This repository contains the video files (download links) and corresponding annotations used in the paper "Long-Term Face Tracking for Cr…☆14Dec 18, 2020Updated 5 years ago
- The code for our work☆18Apr 7, 2024Updated last year
- Sharing valuable knowledge about TCP/IP.☆10Sep 13, 2021Updated 4 years ago
- Official code release for "TDFNet: An Efficient Audio-Visual Speech Separation Model with Top-down Fusion", accepted ICIST 2023☆12Mar 17, 2024Updated last year
- Homemade LightGBM and VGG-net experiment setup for DCASE2017 task 1☆11Aug 8, 2017Updated 8 years ago
- Joint magnitude estimation and phase recovery using Cycle-in-Cycle GAN for non-parallel speech enhancement☆10Jan 24, 2022Updated 4 years ago
- Adaptive front ends☆15Oct 1, 2018Updated 7 years ago
- ☆10Jan 18, 2024Updated 2 years ago
- Boosting Self-Supervised Embeddings for Speech Enhancement☆47Jun 23, 2022Updated 3 years ago
- Decoding of the speech envelope from EEG using the VLAAI deep neural network☆15Sep 28, 2022Updated 3 years ago
- A collection of trending speech enhancement papers☆11Dec 4, 2020Updated 5 years ago
- recent audio generation papers (including speech, music and general audios)☆13Mar 14, 2023Updated 2 years ago
- ☆11Mar 15, 2017Updated 8 years ago
- ☆10Dec 6, 2019Updated 6 years ago
- Methods used in the paper "Plausible Uncertainties for Human Pose Regression".☆14Aug 13, 2024Updated last year
- 【ICCV 2023】Towards Instance-adaptive Inference for Federated Learning☆13Mar 31, 2025Updated 11 months ago
- Official Implementation and Dataset of paper - DFADD: The Diffusion and Flow-matching based Audio Deepfake Dataset☆15Apr 7, 2025Updated 11 months ago
- Reimplementation of speech decoding 2022 paper by MetaAI☆14Oct 17, 2023Updated 2 years ago
- Hiearchical Grid Refinement (HiGRID): DOA Estimation using Rigid Spherical Microphone Arrays☆12Apr 11, 2019Updated 6 years ago
- The code used to create the ARCA23K and ARCA23K-FSD datasets☆15Nov 9, 2021Updated 4 years ago
- ☆16Nov 25, 2024Updated last year
- ☆14Jun 27, 2024Updated last year
- Korean phoneme dictionary generator for training Montreal Forced Aligner (MFA)☆13Feb 27, 2021Updated 5 years ago
- ☆13Mar 7, 2022Updated 4 years ago
- A light Python script that can compute accuracy, exact match, precision, recall, f1 score, and Hamming score.☆19Jun 23, 2017Updated 8 years ago
- Code to implement the model of No.2 in Task 1 of the Auditory EEG Challenge (ICASSP 2024)☆12Jan 29, 2024Updated 2 years ago
- Sound classification using neural networks☆12Jun 6, 2018Updated 7 years ago
- CycleGAN-based Emotion Style Transfer as Data Augmentation for Speech Emotion Recognition☆12Oct 7, 2019Updated 6 years ago
- Power-Guided Grouped SRU for Real-Time Causal Audio-Visual Speech Separation☆23Nov 4, 2025Updated 4 months ago
- Agile reading group that works☆13Feb 2, 2022Updated 4 years ago
- Fall 2022☆14Nov 29, 2022Updated 3 years ago
- Template for creating audio encoders compatible with X-ARES☆19Feb 11, 2026Updated 3 weeks ago
- ☆13Dec 8, 2022Updated 3 years ago
- Self-Supervised Dataset Distillation for Transfer Learning☆16Apr 10, 2024Updated last year