tennisonliu / noise_reduction
Using Spectral Noise Gating (SNG) techniques to reduce background noise in streaming microphone input for enhanced vocal recognition
☆24Updated 6 years ago
Alternatives and similar repositories for noise_reduction:
Users that are interested in noise_reduction are comparing it to the libraries listed below
- Speaker change detection using SincNet and an LSTM/Transformer☆48Updated 8 months ago
- Paderbox: A collection of utilities for audio / speech processing☆38Updated last month
- Code for the paper: "Leveraging speaker attribute information using multi task learning for speaker verification and diarization" present…☆25Updated 2 years ago
- Clustering-based methods for overlapping diarization☆77Updated last year
- Implementation of the paper "BERTphone: Phonetically-aware Encoder Representations for Utterance-level Speaker and Language Recognition"☆17Updated 4 years ago
- This code is to run the WARP-Q speech quality metric.☆35Updated 5 months ago
- ☆31Updated 11 months ago
- 🎵 A repository for manually annotating files to create labeled acoustic datasets for machine learning.☆41Updated 3 years ago
- Reproduction of the paper SFSRNet: Super-resolution for single-channel Audio Source Separation by me (@arda-num) and @dritx16. Navigate P…☆11Updated 2 years ago
- 📊 Easily apply audio-related machine learning models trained on the AudioSet dataset (527+ models/classes).☆29Updated 9 months ago
- A speech signal processing library in Python with emphasis on deep learning.☆31Updated 2 years ago
- Official repository for the "Powerset multi-class cross entropy loss for neural speaker diarization" paper published in Interspeech 2023.☆82Updated last year
- Download and create a tfreader for the audioset dataset☆16Updated 4 years ago
- Filtering and Noise Adding Tool☆29Updated 2 years ago
- Sound event detection with depthwise separable and dilated convolutions.☆53Updated 4 years ago
- ☆32Updated 3 years ago
- SpeechNAS-Better-Trade-off-between-Latency-and-Accuracy-for-Large-Scale-Speaker-Verification☆30Updated 2 years ago
- Who calls the shots? Rethinking Few-Shot Learning for Audio (WASPAA 2021)☆42Updated 2 years ago
- Streaming Audiotransformers for online Audio tagging☆43Updated 9 months ago
- Constrained Permutation Invariant Training, Speech Separation☆47Updated 4 years ago
- Python library for audio augmentation☆83Updated last year
- Source Code for the Paper "UNIFIED KEYWORD SPOTTING AND AUDIO TAGGING ON MOBILE DEVICES WITH TRANSFORMERS"☆23Updated 2 years ago
- PyTorch based speaker embedding model☆15Updated 11 months ago
- follow NVIDIA, simplify it and support data parallel.☆13Updated 5 years ago
- Autovocoder: Fast Waveform Generation from a Learned Speech Representation using Differentiable Digital Signal Processing☆70Updated 2 years ago
- Official implementation of DualCycleGAN for nonparallel audio super resolution☆53Updated 2 years ago
- ☆23Updated 2 years ago
- Official PyTorch implementation of TTS Style Transfer☆23Updated 2 years ago
- Code for the paper "MULTI-BAND MASKING FOR WAVEFORM-BASED SINGING VOICE SEPARATION" that was accepted on EUSIPCO2022☆15Updated 2 years ago
- Pytorch: Channel-wise subband (CWS) input for better voice and accompaniment separation☆95Updated 3 years ago