xinliu9451 / awesome-denoiserView external linksLinks
This is a repository that collects common audio noise reduction models, using Gradio to demonstrate the use of each model, which is very friendly for beginners.
☆50Jan 26, 2026Updated 3 weeks ago
Alternatives and similar repositories for awesome-denoiser
Users that are interested in awesome-denoiser are comparing it to the libraries listed below
Sorting:
- silero-vad pytorch implement☆34Nov 23, 2024Updated last year
- Offline Speaker Diarization with SenseVoice by Sherpa ONNX.☆15Dec 23, 2024Updated last year
- Models and codes for INTERSPEECH 2023 paper DistilXLSR: A Light Weight Cross-Lingual Speech Representation Model☆13Mar 30, 2025Updated 10 months ago
- SpeechDenoiser: Real-Time Speech Denoising with ONNX Welcome to SpeechDenoiser, a simple and effective solution for real-time speech den…☆109Aug 16, 2024Updated last year
- Official implementation of the paper "Distilling a Pretrained Language Model to a Multilingual ASR Model" (Interspeech 2022)☆12Mar 12, 2024Updated last year
- ☆14May 26, 2022Updated 3 years ago
- Pytorch implementation of "CleanMel: Mel-Spectrogram Enhancement for Improving Both Speech Quality and ASR".☆89Feb 2, 2026Updated 2 weeks ago
- Official code for "EmoVoice: LLM-based Emotional Text-To-Speech Model with Freestyle Text Prompting"☆109Oct 16, 2025Updated 4 months ago
- 一个简单的音频降噪工具,提高web UI界面和api接口☆44Nov 21, 2024Updated last year
- Crowdsourced and Automatic Speech Prominence Estimation☆24Apr 12, 2024Updated last year
- データ分析コンペにAIエージェントを活用したい!☆31Aug 20, 2025Updated 5 months ago
- speaker-disentangled speech linguistic content quantizer☆24Mar 19, 2025Updated 10 months ago
- Speech enhancement in noisy and reverberant environments using deep neural networks☆22Oct 10, 2025Updated 4 months ago
- finetune your florence2 model easy☆21Jul 27, 2024Updated last year
- [EMNLP Main '25] LiteASR: Efficient Automatic Speech Recognition with Low-Rank Approximation☆147May 18, 2025Updated 8 months ago
- A training code template for DNN-based speech enhancement.☆166Sep 4, 2025Updated 5 months ago
- Official code for MUSE: Flexible Voiceprint Receptive Fields and Multi-Path Fusion Enhanced Taylor Transformer for U-Net-based Speech Enh…☆54Mar 5, 2025Updated 11 months ago
- Variable Bitrate Residual Vector Quantization for Audio Coding☆51May 1, 2025Updated 9 months ago
- ☆23Oct 19, 2024Updated last year
- ☆23Oct 17, 2024Updated last year
- Speech-MASSIVE is a multilingual Spoken Language Understanding (SLU) dataset comprising the speech counterpart for a portion of the MASSI…☆24Oct 8, 2025Updated 4 months ago
- A toolkit dedicate for speech evaluation.☆24Sep 26, 2024Updated last year
- CTC decoder with hotwords for ASR.☆34Apr 13, 2025Updated 10 months ago
- A neural speech codec based on discrete WavLM representations☆24Aug 28, 2024Updated last year
- Fine-Tune Whisper with Transformers and PEFT☆58Nov 4, 2023Updated 2 years ago
- ☆37Sep 21, 2025Updated 4 months ago
- ☆27Nov 4, 2024Updated last year
- Speech Human Evaluation Estimation Toolkit (SHEET)☆132Oct 2, 2025Updated 4 months ago
- The official implementation of GTCRN, an ultra-lightweight SE model.☆561Jan 18, 2026Updated 3 weeks ago
- ☆31Jun 6, 2023Updated 2 years ago
- faster inference☆28Jan 20, 2025Updated last year
- 💠 Aivis: AI Voice Imitation System☆27Feb 25, 2024Updated last year
- A enterprise-grade Chinese-English code switch punctuator from funasr.☆30Apr 26, 2024Updated last year
- Official repo for DisCoder: High-Fidelity Music Vocoder using Neural Audio Codecs presented at ICASSP 2025☆37Feb 24, 2025Updated 11 months ago
- PitchVC: Pitch Conditioned Any-to-Many Voice Conversion☆36Jun 6, 2024Updated last year
- ☆37Jul 15, 2025Updated 7 months ago
- ☆31Aug 23, 2022Updated 3 years ago
- FPGA based, Real-time processing of audio, including voiceprint recognition, adaptive noise suppression, et al.☆15May 8, 2025Updated 9 months ago
- My vocoder experiments☆31Jul 26, 2025Updated 6 months ago