unofficial implementation of "CPTNN: CROSS-PARALLEL TRANSFORMER NEURAL NETWORK FOR TIME-DOMAIN SPEECH ENHANCEMENT"
☆15Nov 14, 2023Updated 2 years ago
Alternatives and similar repositories for CPTNN
Users that are interested in CPTNN are comparing it to the libraries listed below
Sorting:
- This is the unofficial implementation of MFNet, from paper''a Mask Free Neural Network for Monaural Speech Enhancement''☆13Dec 20, 2024Updated last year
- Ablation study of local spectral attention (LSA) for full-band speech enhancement (SE)☆28Sep 16, 2023Updated 2 years ago
- Implementation of Sheffield entry for Clarity enhancement challenge.☆18Apr 19, 2022Updated 3 years ago
- ☆20Apr 27, 2024Updated last year
- Official code for MUSE: Flexible Voiceprint Receptive Fields and Multi-Path Fusion Enhanced Taylor Transformer for U-Net-based Speech Enh…☆54Mar 5, 2025Updated last year
- ☆32May 17, 2024Updated last year
- Attention-Enhanced Short-Time Wiener Solution for Acoustic Echo Cancellation☆24Nov 12, 2025Updated 3 months ago
- Unofficial Pytorch Lightning Implementation of "Real-time Speech Frequency Bandwidth Extension"☆41Oct 20, 2025Updated 4 months ago
- Unofficial Pytorch Lightning Implementation of "A New Framework for CNN-Based Speech Enhancement in the Time Domain"☆24May 9, 2023Updated 2 years ago
- ☆21Jul 15, 2024Updated last year
- microphone array speech generator (MASG) in room acoustic☆39Jan 2, 2020Updated 6 years ago
- Official repository for LMFCA-Net: A Lightweight Model for Multi-Channel Speech Enhancement with Efficient Narrow-Band and Cross-Band Att…☆29Feb 26, 2025Updated last year
- Code for paper "Unifying Speech Enhancement and Separation with Gradient Modulation for End-to-End Noise-Robust Speech Separation"☆44Jul 10, 2024Updated last year
- ☆12Jun 17, 2019Updated 6 years ago
- Reimplementation of Miipher☆29Aug 16, 2023Updated 2 years ago
- Lightweight Speech Representation Learning for One-Shot Voice Conversion☆24Dec 12, 2024Updated last year
- Pytorch Models for Speech Enhancement☆23Mar 31, 2023Updated 2 years ago
- ☆25Feb 28, 2023Updated 3 years ago
- System that ranks 2nd in DCASE 2022 Challenge Task 5: Few-shot Bioacoustic Event Detection☆28Jul 6, 2022Updated 3 years ago
- This is a project of Interspeech2021 paper "SpecMix : A Mixed Sample Data Augmentation method for Training with Time-Frequency Domain Fea…☆11Sep 27, 2022Updated 3 years ago
- ☆11Jun 6, 2022Updated 3 years ago
- ☆33Nov 18, 2025Updated 3 months ago
- offical code for Dense-TSNet☆12Sep 17, 2024Updated last year
- An unofficial implementation of Lite-RTSE, a cost-effective lite model for real-time speech enhancement☆14Nov 19, 2023Updated 2 years ago
- PyTorch implementation of "Nextformer: A ConvNeXt Augmented Conformer For End-To-End Speech Recognition"☆11Dec 15, 2022Updated 3 years ago
- Text-to-dysarthric speech (TTDS) synthesis. An implementation using the Grad-TTS model with the TORGO database.☆12Mar 15, 2025Updated 11 months ago
- ☆11Nov 7, 2024Updated last year
- A neural speech codec based on discrete WavLM representations☆24Aug 28, 2024Updated last year
- Coarse implement of the paper "A Simultaneous Denoising and Dereverberation Framework with Target Decoupling", On DNS-2020 dataset, the D…☆64Jan 8, 2022Updated 4 years ago
- ☆52Sep 10, 2024Updated last year
- ☆13Oct 11, 2024Updated last year
- Room impulse response simulation for various array architectures using Monte-Carlo simulation and quaternions (Python)☆17Feb 25, 2026Updated last week
- ☆11Mar 22, 2023Updated 2 years ago
- LLaSE: Maximizing Acoustic Preservation for LLaMA based Speech Enhancement☆16Jul 11, 2025Updated 7 months ago
- modules for the evaluation of acoustic echo cancellation systems☆17Nov 2, 2021Updated 4 years ago
- ☆16Jun 15, 2022Updated 3 years ago
- Implementation of "Improving Whispered Speech Recognition Performance using Pseudo-whispered based Data Augmentation"☆13Oct 31, 2024Updated last year
- Source code for "BLOOM-Net: Blockwise Optimization for Masking Networks Toward Scalable and Efficient Speech Enhancement"☆14Feb 13, 2022Updated 4 years ago
- Llama-Mimi is a speech language model that uses a unified tokenizer (Mimi) and a single Transformer decoder (Llama) to jointly model sequ…☆28Sep 20, 2025Updated 5 months ago