☆16May 26, 2022Updated 4 years ago
Alternatives and similar repositories for D4AM
Users that are interested in D4AM are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Code for our paper "Auxiliary Task Reweighting for Minimum-data Learning" (NeurIPS 2020)☆18Dec 21, 2020Updated 5 years ago
- This is the official implement of Mamba-SEUNet: Mamba UNet for Monaural Speech Enhancement☆93May 26, 2025Updated last year
- Code for paper "Gradient Remedy for Multi-Task Learning in End-to-End Noise-Robust Speech Recognition"☆21May 24, 2023Updated 3 years ago
- Pytorch implementation of "CleanMel: Mel-Spectrogram Enhancement for Improving Both Speech Quality and ASR".☆93Feb 2, 2026Updated 4 months ago
- VOICOR: A Residual Iterative Voice Correction Framework for Monaural Speech Enhancement☆46Sep 12, 2024Updated last year
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- This is the official implementation of the SEMamba paper. (Accepted to IEEE SLT 2024)☆269Dec 12, 2025Updated 6 months ago
- Pytorch implementation of paper "High Fidelity Speech Regeneration With Application to Speech Enhancement"☆15May 8, 2021Updated 5 years ago
- ☆28Jun 1, 2023Updated 3 years ago
- Code to train a custom time-domain autoencoder to dereverb audio☆16Nov 30, 2023Updated 2 years ago
- ☆68Aug 16, 2023Updated 2 years ago
- ☆21Jul 15, 2024Updated last year
- Official Implementation for the paper: A Variational Framework for Improving Naturalness in Generative Spoken Language Models☆24Jun 18, 2025Updated last year
- Official code for MUSE: Flexible Voiceprint Receptive Fields and Multi-Path Fusion Enhanced Taylor Transformer for U-Net-based Speech Enh…☆56Mar 5, 2025Updated last year
- ☆31Jan 9, 2024Updated 2 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Attention-Enhanced Short-Time Wiener Solution for Acoustic Echo Cancellation☆28Nov 12, 2025Updated 7 months ago
- A neural speech codec based on discrete WavLM representations☆26Aug 28, 2024Updated last year
- ☆135Apr 24, 2023Updated 3 years ago
- Fréchet Gesture Distance from (Yoon et al.) exploration and eventual improvment☆21Mar 10, 2023Updated 3 years ago
- VIUNet: Deep Visual–Inertial–UWB Fusion for Indoor UAV Localization (IEEE ACCESS'23)☆19Aug 23, 2023Updated 2 years ago
- ☆15Sep 16, 2024Updated last year
- offical code for Dense-TSNet☆12Sep 17, 2024Updated last year
- Code for "Superpixel Segmentation via Convolutional Neural Networks with Regularized Information Maximization", ICASSP2020☆21Jun 22, 2020Updated 5 years ago
- A pytorch implementation of D3Net.☆11Aug 8, 2021Updated 4 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Boosting Self-Supervised Embeddings for Speech Enhancement☆47Jun 23, 2022Updated 3 years ago
- ☆10Sep 25, 2024Updated last year
- Implementation for paper: Multi-Metric Optimization using Generative Adversarial Networks for Near-End Speech Intelligibility Enhancement☆22Sep 21, 2021Updated 4 years ago
- Streaming Audiotransformers for online Audio tagging☆56Jun 14, 2024Updated 2 years ago
- Perceptual Contrast Stretching on Target Feature for Speech Enhancement (Accepted by INTERSPEECH 2022)☆73May 11, 2024Updated 2 years ago
- Source code for "BLOOM-Net: Blockwise Optimization for Masking Networks Toward Scalable and Efficient Speech Enhancement"☆14Feb 13, 2022Updated 4 years ago
- A Singing Style Conversion Framework Based On Audio Infilling☆35Apr 28, 2025Updated last year
- An unofficial implementation of Lite-RTSE, a cost-effective lite model for real-time speech enhancement☆14Nov 19, 2023Updated 2 years ago
- Andes DSP Library☆21May 21, 2026Updated 3 weeks ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Official Implementation of SERIL in Pytorch☆27Sep 29, 2020Updated 5 years ago
- This is a project of Interspeech2021 paper "SpecMix : A Mixed Sample Data Augmentation method for Training with Time-Frequency Domain Fea…☆11Sep 27, 2022Updated 3 years ago
- ☆12Nov 16, 2020Updated 5 years ago
- Official implementation of "Clustering as Attention: Unified Image Segmentation with Hierarchical Clustering"☆32Jun 16, 2022Updated 4 years ago
- Pytorch implementation of MDensenet and sparse NMF. Made for my undergraduate thesis "Music Source Separation with Supervised Learning Me…☆11Jan 31, 2021Updated 5 years ago
- Pytorch implementation of the paper : A Global-local Attention Framework for Weakly Labelled Audio Tagging.☆13Feb 6, 2021Updated 5 years ago
- This repo contains required files for the INTERSPEECH 2022 Audio Deep Packet Loss Concealment (PLC) Challenge.☆92Feb 13, 2026Updated 4 months ago