heungky / trainable-STFT-Mel
Understanding Audio Features via Trainable Basis Functions
☆9Updated 2 years ago
Alternatives and similar repositories for trainable-STFT-Mel:
Users that are interested in trainable-STFT-Mel are comparing it to the libraries listed below
- Whisper Speech Quality Assessment (WhiSQA)☆9Updated 4 months ago
- ☆16Updated 8 months ago
- Efficient Personalized Speech Enhancement through Self-Supervised Learning☆21Updated 2 years ago
- Prediction of sound event bounding boxes (SEBBs)☆26Updated 8 months ago
- ☆11Updated 8 months ago
- ☆13Updated 2 years ago
- PyTorch implementation of "Nextformer: A ConvNeXt Augmented Conformer For End-To-End Speech Recognition"☆11Updated 2 years ago
- unofficial implementation of "CPTNN: CROSS-PARALLEL TRANSFORMER NEURAL NETWORK FOR TIME-DOMAIN SPEECH ENHANCEMENT"☆15Updated last year
- ☆16Updated 4 years ago
- Implementation of Sheffield entry for Clarity enhancement challenge.☆17Updated 2 years ago
- Code for the paper "MULTI-BAND MASKING FOR WAVEFORM-BASED SINGING VOICE SEPARATION" that was accepted on EUSIPCO2022☆15Updated 2 years ago
- ICASSP2025Dynamic Embedding Causal Target Speech Extraction☆2Updated 3 weeks ago
- (ICASSP 2025) Learning Source Disentanglement in Neural Audio Codec☆30Updated 3 months ago
- ☆10Updated 2 years ago
- ☆13Updated 3 months ago
- Addressing the confounds of accompaniments in singer identification☆18Updated 5 years ago
- A benchmark for evaluating audio encoders on various audio tasks.☆13Updated this week
- Baseline for DCASE 2024 Task 9: "Language-Queried Audio Source Separation"☆25Updated last year
- ☆10Updated 6 months ago
- Audio activity detector based on per-channel energy normalization (PCEN)☆29Updated 6 years ago
- Dynamic Mixing For Speech Processing (mix-on-the-fly)☆18Updated 2 years ago
- US-based professors who work on audio. For students who would like to apply for RA, PhD, postdoc in audio research.☆25Updated last week
- ☆21Updated last year
- This is the implementation of the manuscript "Learning General All-Neural Speech Enhancement based on Taylor's Approximation Theory", whi…☆14Updated 2 years ago
- System that ranks 2nd in DCASE 2022 Challenge Task 5: Few-shot Bioacoustic Event Detection☆28Updated 2 years ago
- Code for the submitted 2021 DCASE Workshop paper: "Waveforms and Spectrograms: Enhancing Acoustic Scene Classification Using Multimodal F…☆13Updated 3 years ago
- acnn for text-independent speaker recognition☆9Updated 3 years ago
- Unsupervised Representation Learning for Singing Voice Separation☆22Updated 2 years ago
- ☆28Updated 10 months ago
- Speech enhancement using mimic loss☆16Updated 5 years ago