tennisonliu / noise_reduction
Using Spectral Noise Gating (SNG) techniques to reduce background noise in streaming microphone input for enhanced vocal recognition
☆23Updated 6 years ago
Alternatives and similar repositories for noise_reduction:
Users that are interested in noise_reduction are comparing it to the libraries listed below
- Simple baseline model for the HEAR benchmark☆23Updated last month
- Sound examples for the Neural Parametric Singing Synthesizer (NPSS)☆22Updated 2 years ago
- Implementation of the paper "BERTphone: Phonetically-aware Encoder Representations for Utterance-level Speaker and Language Recognition"☆17Updated 4 years ago
- A handy dataset of noises for ASR☆19Updated 5 years ago
- Paderbox: A collection of utilities for audio / speech processing☆38Updated 8 months ago
- Da - ECHO - RetrievAl - daTasEt☆25Updated 7 months ago
- follow NVIDIA, simplify it and support data parallel.☆13Updated 5 years ago
- ☆32Updated 4 years ago
- ☆15Updated 2 years ago
- Enriching Music Descriptions with a Finetuned-LLM and Metadata for Text-to-Music Retrieval (TTMR++) [ICASSP24]☆33Updated 4 months ago
- ☆23Updated 2 years ago
- Streaming source separation for music and speech files, using the Open-Unmix LSTM architecture.☆18Updated 2 years ago
- ☆32Updated 3 years ago
- Conditional Variational Auto-Encoder with Jointly Training FastSpeech2 and HiFi-GAN for End to End Text to Speech☆22Updated 2 years ago
- A C++/Cython audio limiter for Python.☆25Updated 2 years ago
- Differentiable Mean Opinion Score Regularization for Perceptual Speech Enhancement☆22Updated last year
- SoundPy (alpha stage) is a research-based python package for speech and sound. Applications include deep-learning, filtering, speech-enha…☆73Updated last month
- Filtering and Noise Adding Tool☆29Updated 2 years ago
- System that ranks 2nd in DCASE 2022 Challenge Task 5: Few-shot Bioacoustic Event Detection☆28Updated 2 years ago
- This is the code of the ICASSP 2020 paper "Joint phoneme alignment and text-informed speech separation on highly corrupted speech"☆15Updated 10 months ago
- Speaker change detection using SincNet and an LSTM/Transformer☆46Updated 7 months ago
- Autovocoder: Fast Waveform Generation from a Learned Speech Representation using Differentiable Digital Signal Processing☆69Updated 2 years ago
- End-to-end diarization loss☆22Updated 3 years ago
- Who calls the shots? Rethinking Few-Shot Learning for Audio (WASPAA 2021)☆42Updated 2 years ago
- Deep Speech Distances PyTorch☆27Updated 3 years ago
- A speech signal processing library in Python with emphasis on deep learning.☆31Updated 2 years ago
- Frechet Audio Distance evaluation in PyTorch☆36Updated last year
- A PyTorch implementation: "LASAFT-Net-v2: Listen, Attend and Separate by Attentively aggregating Frequency Transformation"☆33Updated 2 years ago
- End-to-End SpeechSynthesis system with fastspeech2 & hifigan☆13Updated 2 years ago