heungky / trainable-STFT-Mel
Understanding Audio Features via Trainable Basis Functions
☆9Updated 3 years ago
Alternatives and similar repositories for trainable-STFT-Mel:
Users that are interested in trainable-STFT-Mel are comparing it to the libraries listed below
- ☆16Updated last year
- ☆13Updated 2 years ago
- Whisper Speech Quality Assessment (WhiSQA)☆9Updated 5 months ago
- Dynamic Mixing For Speech Processing (mix-on-the-fly)☆18Updated 2 years ago
- Efficient Personalized Speech Enhancement through Self-Supervised Learning☆21Updated 2 years ago
- ☆16Updated 4 years ago
- FNSE-SBGAN: Far-field Speech Enhancement with Schrödinger Bridge and Generative Adversarial Networks☆9Updated 2 months ago
- ☆26Updated last year
- ☆17Updated 9 months ago
- System that ranks 2nd in DCASE 2022 Challenge Task 5: Few-shot Bioacoustic Event Detection☆28Updated 2 years ago
- Implementation of Sheffield entry for Clarity enhancement challenge.☆17Updated 3 years ago
- Code and data recipes for the paper: Heterogeneous Target Speech Separation☆41Updated 2 years ago
- Prediction of sound event bounding boxes (SEBBs)☆26Updated 9 months ago
- unofficial implementation of "CPTNN: CROSS-PARALLEL TRANSFORMER NEURAL NETWORK FOR TIME-DOMAIN SPEECH ENHANCEMENT"☆15Updated last year
- ☆13Updated 4 months ago
- ☆13Updated last month
- Code for the paper "MULTI-BAND MASKING FOR WAVEFORM-BASED SINGING VOICE SEPARATION" that was accepted on EUSIPCO2022☆15Updated 2 years ago
- A small tool to calculate the distribution of audio durations in a directory☆14Updated 2 years ago
- ICASSP2025Dynamic Embedding Causal Target Speech Extraction☆3Updated last month
- Spherical residual vector quantization (SRVQ)☆28Updated 8 months ago
- Code for the submitted 2021 DCASE Workshop paper: "Waveforms and Spectrograms: Enhancing Acoustic Scene Classification Using Multimodal F…☆14Updated 3 years ago
- ☆16Updated last year
- A PyTorch implementation: "LASAFT-Net-v2: Listen, Attend and Separate by Attentively aggregating Frequency Transformation"☆33Updated 3 years ago
- ☆21Updated last year
- real-time speech enhance☆15Updated last year
- ☆11Updated 9 months ago
- ☆18Updated 5 years ago
- ☆25Updated last year
- The source code of Tim-TSENet☆12Updated 3 years ago
- Demo for Neural Spatio-Temporal Beamformer for Target Speech Separation accepted to INTERSPEECH2020☆16Updated 4 years ago