jeffreyjeffreywang / SSEView external linksLinks
Self-supervised Speech Enhancement network
☆11Aug 27, 2020Updated 5 years ago
Alternatives and similar repositories for SSE
Users that are interested in SSE are comparing it to the libraries listed below
Sorting:
- Removing various types of noises present in the speech using Deep Neural Networks☆30Apr 17, 2021Updated 4 years ago
- Tacotron2 with BERT examples☆10Jul 8, 2019Updated 6 years ago
- An implementation of Neural Style Transfer for Audio using Pytorch.☆10Dec 14, 2017Updated 8 years ago
- Homemade LightGBM and VGG-net experiment setup for DCASE2017 task 1☆11Aug 8, 2017Updated 8 years ago
- The code for our work☆18Apr 7, 2024Updated last year
- ☆10Jan 18, 2024Updated 2 years ago
- Adaptive front ends☆15Oct 1, 2018Updated 7 years ago
- This repository contains the video files (download links) and corresponding annotations used in the paper "Long-Term Face Tracking for Cr…☆14Dec 18, 2020Updated 5 years ago
- Sharing valuable knowledge about TCP/IP.☆10Sep 13, 2021Updated 4 years ago
- The implementation of MDNet, which is in submission to Interspeech2022☆14May 1, 2022Updated 3 years ago
- An unofficial code reproduction of Channel Attention Dense U-Net for Multichannel Speech Enhancement☆13Jul 17, 2023Updated 2 years ago
- Official code release for "TDFNet: An Efficient Audio-Visual Speech Separation Model with Top-down Fusion", accepted ICIST 2023☆12Mar 17, 2024Updated last year
- Joint magnitude estimation and phase recovery using Cycle-in-Cycle GAN for non-parallel speech enhancement☆10Jan 24, 2022Updated 4 years ago
- Boosting Self-Supervised Embeddings for Speech Enhancement☆47Jun 23, 2022Updated 3 years ago
- Methods used in the paper "Plausible Uncertainties for Human Pose Regression".☆14Aug 13, 2024Updated last year
- 【ICCV 2023】Towards Instance-adaptive Inference for Federated Learning☆13Mar 31, 2025Updated 10 months ago
- recent audio generation papers (including speech, music and general audios)☆13Mar 14, 2023Updated 2 years ago
- ☆11Mar 15, 2017Updated 8 years ago
- A collection of trending speech enhancement papers☆11Dec 4, 2020Updated 5 years ago
- The code used to create the ARCA23K and ARCA23K-FSD datasets☆14Nov 9, 2021Updated 4 years ago
- Hiearchical Grid Refinement (HiGRID): DOA Estimation using Rigid Spherical Microphone Arrays☆12Apr 11, 2019Updated 6 years ago
- ☆16Nov 25, 2024Updated last year
- Official Implementation and Dataset of paper - DFADD: The Diffusion and Flow-matching based Audio Deepfake Dataset☆15Apr 7, 2025Updated 10 months ago
- Korean phoneme dictionary generator for training Montreal Forced Aligner (MFA)☆13Feb 27, 2021Updated 4 years ago
- CycleGAN-based Emotion Style Transfer as Data Augmentation for Speech Emotion Recognition☆12Oct 7, 2019Updated 6 years ago
- Power-Guided Grouped SRU for Real-Time Causal Audio-Visual Speech Separation☆22Nov 4, 2025Updated 3 months ago
- Template for creating audio encoders compatible with X-ARES☆19Dec 8, 2025Updated 2 months ago
- ☆13Jul 3, 2025Updated 7 months ago
- Self-Supervised Dataset Distillation for Transfer Learning☆16Apr 10, 2024Updated last year
- ☆12Apr 18, 2025Updated 9 months ago
- ☆12Jun 10, 2021Updated 4 years ago
- Official baseline for ICASSP 2026 URGENT Challenge Track 2 (Speech Quality Assessment)☆25Jan 8, 2026Updated last month
- Training with Product Digital Twins for AutoRetail Checkout☆18Aug 29, 2023Updated 2 years ago
- A living document for all things Common Voice.☆14Jun 24, 2024Updated last year
- 把 wave-u-net 网络应用于语音增强领域中☆14May 29, 2020Updated 5 years ago
- ☆15Jun 15, 2022Updated 3 years ago
- Repository for inzva AI Labs Joint Program☆11Sep 21, 2021Updated 4 years ago
- Designing efficient architectures for modeling temporal features with convolutional neural networks☆16Mar 17, 2017Updated 8 years ago
- Code for the ICML 2025 paper "SelfCite Self-Supervised Alignment for Context Attribution in Large Language Models"☆23Feb 5, 2026Updated last week