chenchy/D3Net

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/chenchy/D3Net)

chenchy / D3Net

A pytorch implementation of D3Net.

☆11

Alternatives and similar repositories for D3Net

Users that are interested in D3Net are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

JinjiangLiu / ICCRN
View on GitHub
☆18Mar 10, 2023Updated 3 years ago
Qinwen-Hu / SDCM
View on GitHub
☆25Feb 28, 2023Updated 3 years ago
JusperLee / TFACM
View on GitHub
☆23Jul 16, 2025Updated last year
wangtianrui / APC-SNR
View on GitHub
Implementation of "A Deep Learning Loss Function based on Auditory Power Compression for Speech Enhancement" by pytorch
☆28Jan 31, 2022Updated 4 years ago
gitwukeyi / FSPEN
View on GitHub
☆59Apr 24, 2024Updated 2 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
aleXiehta / Causal-SE
View on GitHub
Official Implementation of "Inference and Denoise: Causal Inference-based Neural Speech Enhancement"
☆28Feb 26, 2023Updated 3 years ago
eagomez2 / upf-smc-speech-enhancement-thesis
View on GitHub
Deep Noise Suppression for Real Time Speech Enhancement in a Single Channel Wide Band Scenario
☆27Jan 25, 2024Updated 2 years ago
shengyuzhang / Poet
View on GitHub
Poet: Product-oriented Video Captioner for E-commerce
☆12Sep 21, 2020Updated 5 years ago
google / df-conformer
View on GitHub
Audio samples accompanying publications related to DF-Conformer, a speech enhancement model.
☆36Jun 23, 2026Updated 3 weeks ago
shengyuzhang / VideoTitling
View on GitHub
Comprehensive Information Integration Modeling Framework for Video Titling
☆11Aug 27, 2020Updated 5 years ago
sharathadavanne / seld-dcase2021
View on GitHub
Baseline method for sound event localization task of DCASE 2021 challenge
☆45Jun 15, 2021Updated 5 years ago
vBaiCai / vc_tacotron
View on GitHub
Voice Conversion using Tacotron.
☆11Dec 29, 2022Updated 3 years ago
adrianbarahona / conditional_wavegan_knocking_sounds
View on GitHub
Keras implementation of conditional waveGAN. Application to knocking sound effects with emotion.
☆10Jun 22, 2020Updated 6 years ago
shawntan / neural-transducers
View on GitHub
An attempt at implementing ideas in "Learning to Transduce with Unbounded Memory" (http://arxiv.org/abs/1506.02516)
☆11Jul 27, 2016Updated 9 years ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
NikolaiKyhne / xLSTM-SENet
View on GitHub
Official repository for the paper "xLSTM-SENet: xLSTM for Single-Channel Speech Enhancement" (Accepted to INTERSPEECH 2025)
☆60Aug 28, 2025Updated 10 months ago
Kizuna-AII / Realm-Before-the-Omniscience
View on GitHub
Project for ZJU-Game-2021
☆10Sep 20, 2021Updated 4 years ago
huaidanquede / Dense-TSNet
View on GitHub
offical code for Dense-TSNet
☆12Sep 17, 2024Updated last year
yuzhouhe2000 / OMLSA-IMCRA
View on GitHub
Python implementation of OMLSA+IMCRA algorithm for speech enhancement.
☆70Jun 29, 2021Updated 5 years ago
kimsunwiub / BLOOM-Net
View on GitHub
Source code for "BLOOM-Net: Blockwise Optimization for Masking Networks Toward Scalable and Efficient Speech Enhancement"
☆14Feb 13, 2022Updated 4 years ago
JohnnieXDU / LISTA-based-KSVD-for-Sparse-Coding
View on GitHub
520.648 project: OMP/KSVD/ISTA/FISTA/LISTA for image inpainting and denoising
☆14Sep 20, 2020Updated 5 years ago
WelkinYang / EMPHASIS-pytorch
View on GitHub
EMPHASIS: An Emotional Phoneme-based Acoustic Model for Speech Synthesis System
☆15Mar 31, 2019Updated 7 years ago
lhwcv / self_attention_alignment
View on GitHub
Deep model with built-in self-attention alignment for acoustic echo cancellation, Pytorch implement
☆39Jul 25, 2023Updated 2 years ago
Hell-en / AEKF
View on GitHub
Adaptive Adjustment of Noise Covariance in Kalman Filter for Dynamic State Estimation
☆15Nov 21, 2023Updated 2 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
NYElegance / SimulLR
View on GitHub
PyTorch Implementation of SimulLR
☆11Dec 30, 2021Updated 4 years ago
cnlinxi / tpse_tacotron2
View on GitHub
TPSE-GST Tacotron2
☆14May 1, 2019Updated 7 years ago
mutiann / neural-lexicon-reader
View on GitHub
Neural Lexicon Reader: Reduce Pronunciation Errors in End-to-end TTS by Leveraging External Textual Knowledge
☆21Jul 25, 2022Updated 3 years ago
enhancer12 / TSPNN
View on GitHub
Two-stage progressive neural network for acoustic echo cancellation
☆54May 22, 2023Updated 3 years ago
automl / GoLU
View on GitHub
GoLU, a novel, self-gated and element-wise activation function that performs well over a diverse set of tasks
☆27Oct 4, 2025Updated 9 months ago
Xiaobin-Rong / lite-rtse
View on GitHub
An unofficial implementation of Lite-RTSE, a cost-effective lite model for real-time speech enhancement
☆14Nov 19, 2023Updated 2 years ago
Audio-WestlakeU / RealMAN
View on GitHub
A description of "RealMAN: A Real-Recorded and Annotated Microphone Array Dataset for Dynamic Speech Enhancement and Localization" [NeurI…
☆175Apr 29, 2025Updated last year
zyy-fc / CGMM-MVDR
View on GitHub
☆10Aug 3, 2020Updated 5 years ago
caoruitju / RUI_SE
View on GitHub
VOICOR: A Residual Iterative Voice Correction Framework for Monaural Speech Enhancement
☆46Sep 12, 2024Updated last year
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
GT-KIM / specmix
View on GitHub
This is a project of Interspeech2021 paper "SpecMix : A Mixed Sample Data Augmentation method for Training with Time-Frequency Domain Fea…
☆11Sep 27, 2022Updated 3 years ago
MengboLi / MS-SENet
View on GitHub
☆11Jul 16, 2024Updated 2 years ago
nefarius / libsbc
View on GitHub
Bluetooth low-complexity, subband codec (SBC) library
☆15Aug 16, 2025Updated 11 months ago
elis2496 / maxup_implementation
View on GitHub
☆12Nov 16, 2020Updated 5 years ago
fakufaku / create_wsj1_2345_db
View on GitHub
Collection of scripts to create a dataset of noisy multi-channel reverberant mixtures based on wsj1 and CHiME3 datasets.
☆15Dec 6, 2021Updated 4 years ago
alibabasglab / D2Former
View on GitHub
This repository contains the audio samples for "D2Former: A Fully Complex Dual-Path Dual-Decoder Conformer Network using Joint Complex Ma…
☆46Sep 6, 2023Updated 2 years ago
metu-sparg / higrid
View on GitHub
Hiearchical Grid Refinement (HiGRID): DOA Estimation using Rigid Spherical Microphone Arrays
☆14Apr 11, 2019Updated 7 years ago