A pytorch implementation of D3Net.
☆11Aug 8, 2021Updated 4 years ago
Alternatives and similar repositories for D3Net
Users that are interested in D3Net are comparing it to the libraries listed below
Sorting:
- GoLU, a novel, self-gated and element-wise activation function that performs well over a diverse set of tasks☆24Oct 4, 2025Updated 4 months ago
- Official repository for the paper "xLSTM-SENet: xLSTM for Single-Channel Speech Enhancement" (Accepted to INTERSPEECH 2025)☆57Aug 28, 2025Updated 5 months ago
- ☆18Mar 10, 2023Updated 2 years ago
- ☆21Jul 16, 2025Updated 7 months ago
- ☆57Apr 24, 2024Updated last year
- ☆25Feb 28, 2023Updated 2 years ago
- Official Implementation of "Inference and Denoise: Causal Inference-based Neural Speech Enhancement"☆29Feb 26, 2023Updated 3 years ago
- Deep Noise Suppression for Real Time Speech Enhancement in a Single Channel Wide Band Scenario☆27Jan 25, 2024Updated 2 years ago
- Implementation of "A Deep Learning Loss Function based on Auditory Power Compression for Speech Enhancement" by pytorch☆28Jan 31, 2022Updated 4 years ago
- Python implementation of OMLSA+IMCRA algorithm for speech enhancement.☆68Jun 29, 2021Updated 4 years ago
- Audio samples accompanying publications related to DF-Conformer, a speech enhancement model.☆31May 22, 2025Updated 9 months ago
- A description of "RealMAN: A Real-Recorded and Annotated Microphone Array Dataset for Dynamic Speech Enhancement and Localization" [NeurI…☆151Apr 29, 2025Updated 9 months ago
- arxiv daily for speech translation, legal. Ref: Vincentqyw/cv-arxiv-daily☆15Jan 6, 2025Updated last year
- Deep model with built-in self-attention alignment for acoustic echo cancellation, Pytorch implement☆39Jul 25, 2023Updated 2 years ago
- VOICOR: A Residual Iterative Voice Correction Framework for Monaural Speech Enhancement☆46Sep 12, 2024Updated last year
- Russian phonetical transcription☆11Nov 19, 2025Updated 3 months ago
- Official repo for ICCV 2025 paper "Is Less More? Exploring Token Condensation as Training-free Test-time Adaptation"☆17Sep 3, 2025Updated 5 months ago
- ☆98Apr 29, 2021Updated 4 years ago
- This repository contains the audio samples for "D2Former: A Fully Complex Dual-Path Dual-Decoder Conformer Network using Joint Complex Ma…☆46Sep 6, 2023Updated 2 years ago
- Baseline method for sound event localization task of DCASE 2021 challenge☆42Jun 15, 2021Updated 4 years ago
- Poet: Product-oriented Video Captioner for E-commerce☆12Sep 21, 2020Updated 5 years ago
- ☆13Nov 22, 2022Updated 3 years ago
- HippoMM: Hippocampal-inspired Multimodal Memory☆15May 22, 2025Updated 9 months ago
- Neural Lexicon Reader: Reduce Pronunciation Errors in End-to-end TTS by Leveraging External Textual Knowledge☆21Jul 25, 2022Updated 3 years ago
- AD-TUNING: An Adaptive CHILD-TUNING Approach to Efficient Hyperparameter Optimization of Child Networks for Speech Processing Tasks in th…☆11Feb 23, 2024Updated 2 years ago
- [ACM-MM 2025 Workshop] More Is Better: A MoE-Based Emotion Recognition Framework with Human Preference Alignment.☆25Nov 25, 2025Updated 3 months ago
- ☆10Aug 3, 2020Updated 5 years ago
- Details of the datasets for Few-shot class-incremental audio classification☆11Dec 6, 2023Updated 2 years ago
- Andes DSP Library☆18Dec 15, 2025Updated 2 months ago
- Adaptive Adjustment of Noise Covariance in Kalman Filter for Dynamic State Estimation☆12Nov 21, 2023Updated 2 years ago
- YSC 2023 Papers: A complete collection of research papers, code and data from the International Young Scientists Conference 2023 for youn…☆12Jan 17, 2024Updated 2 years ago
- Bluetooth low-complexity, subband codec (SBC) library☆14Aug 16, 2025Updated 6 months ago
- Target speaker automatic speech recognition (TS-ASR)☆12Oct 14, 2023Updated 2 years ago
- [ICASSP 2023] This repository includes the official project of C2FVL, presented in our paper: COARSE-TO-FINE COVID-19 SEGMENTATION VIA VI…☆12Sep 18, 2025Updated 5 months ago
- offical code for Dense-TSNet☆12Sep 17, 2024Updated last year
- ☆10Jul 16, 2024Updated last year
- An unofficial implementation of Lite-RTSE, a cost-effective lite model for real-time speech enhancement☆14Nov 19, 2023Updated 2 years ago
- ☆14Jun 11, 2025Updated 8 months ago
- Deep Visual Speech Recognition in arabic words☆16Oct 18, 2023Updated 2 years ago