tennisonliu / noise_reductionView external linksLinks
Using Spectral Noise Gating (SNG) techniques to reduce background noise in streaming microphone input for enhanced vocal recognition
☆25Dec 10, 2018Updated 7 years ago
Alternatives and similar repositories for noise_reduction
Users that are interested in noise_reduction are comparing it to the libraries listed below
Sorting:
- My vocoder experiments☆31Jul 26, 2025Updated 6 months ago
- ☆16Dec 31, 2021Updated 4 years ago
- A PyTorch implementation of the Modified Discrete Cosine Transform (MDCT) and its inverse for audio processing.☆32Dec 17, 2024Updated last year
- ☆10Apr 8, 2024Updated last year
- T5Voice is a lightweight PyTorch implementation of T5-based text-to-speech synthesis, supporting both streaming and non-streaming speech …☆28Nov 7, 2025Updated 3 months ago
- A TTS Trained on Universal Audio.☆41Jun 6, 2025Updated 8 months ago
- Incorporating AutoVocoder to MB-iSTFT-VITS☆48Dec 1, 2022Updated 3 years ago
- Any-to-any voice conversion using synthetic specific-speaker speeches as intermedium features☆55Oct 11, 2021Updated 4 years ago
- GPT for FACodec☆13Mar 25, 2024Updated last year
- Lightweight knowledge distillation pipeline☆28Nov 29, 2021Updated 4 years ago
- Codebase and project page for EDMSound☆35Nov 20, 2023Updated 2 years ago
- ☆16Apr 4, 2022Updated 3 years ago
- MelNet-Tensorflow implementation☆40Dec 1, 2020Updated 5 years ago
- Try to replicate the architecture of MiniMaxTTS mentioned in it's technical report☆49Sep 2, 2025Updated 5 months ago
- ☆25Jun 19, 2025Updated 7 months ago
- Chat data cleaning, filtering and deduplication pipeline.☆21Jul 25, 2023Updated 2 years ago
- Official Code Implementation for 'A Simple Early Exiting Framework for Accelerated Sampling in Diffusion Models'☆20Jul 24, 2024Updated last year
- Basic wavenet and fftnet vocoder model.☆19Feb 7, 2022Updated 4 years ago
- Parallel waveform generation with DiffusionGAN☆17Mar 26, 2022Updated 3 years ago
- Upsampling Artifacts in Neural Audio Synthesis – https://arxiv.org/abs/2010.14356☆82Feb 9, 2021Updated 5 years ago
- This repo contains conv-tasnet for basis-melgan. If you want to get code of basis-melgan, please refer to FastVocoder.☆21Jul 21, 2021Updated 4 years ago
- ☆18Feb 9, 2020Updated 6 years ago
- ☆17Aug 27, 2025Updated 5 months ago
- An imporved version of Fastsinging singing voice synthesising system.☆20Nov 3, 2020Updated 5 years ago
- The Official Implementation of “Content-Dependent Fine-Grained Speaker Embedding for Zero-Shot Speaker Adaptation in Text-to-Speech Synth…☆87Dec 20, 2022Updated 3 years ago
- Copy objects from real life and directly paste them on a background image using only your phone's camera☆23Updated this week
- Kaggle Seizure Prediction Competition☆20Nov 29, 2016Updated 9 years ago
- Official implementation of OSSGAN [CVPR 2022]☆21May 2, 2022Updated 3 years ago
- [SpeechCom Journal] Learning and controlling the source-filter representation of speech with a variational autoencoder☆45Apr 18, 2023Updated 2 years ago
- A Neural Audio Codec (NAC) for Universal Audio☆44May 30, 2025Updated 8 months ago
- Temporary anonymous version☆22Mar 20, 2024Updated last year
- My hybrid TTS network that combines, VALL-E, VoiceBox, SpeechFlow, Seamless and TortoiseTTS into one☆26Aug 5, 2024Updated last year
- ☆55Jan 13, 2023Updated 3 years ago
- Multi-GPU training using Keras with a Tensorflow backend.☆20Jul 8, 2017Updated 8 years ago
- ☆25Jan 24, 2023Updated 3 years ago
- Based on https://github.com/fatchord/WaveRNN☆24May 3, 2020Updated 5 years ago
- A trainer for SNAC (Multi-Scale Neural Audio Codec) has replaced the decoder with Vocos.☆66Oct 28, 2024Updated last year
- Synthesized singing voice demos of WeSinger 2 paper.☆26Feb 20, 2023Updated 2 years ago
- ☆25Mar 12, 2022Updated 3 years ago