heungky / trainable-STFT-MelLinks

Understanding Audio Features via Trainable Basis Functions

☆9

Alternatives and similar repositories for trainable-STFT-Mel

Users that are interested in trainable-STFT-Mel are comparing it to the libraries listed below

Sorting:

IU-SAIGE / pse
Efficient Personalized Speech Enhancement through Self-Supervised Learning
☆21Updated 2 years ago
leto19 / WhiSQA
Whisper Speech Quality Assessment (WhiSQA)
☆10Updated 7 months ago
dhimasryan / TMHINT-QI-VoiceMOS2023
☆17Updated last year
apple-yinhan / Noise-robust-SED
☆13Updated 6 months ago
lab-emi / CleanUMamba
CleanUMamba: A Compact Mamba Network for Speech Denoising using Channel Pruning [Official PyTorch implementation]
☆12Updated last month
fgnt / speaker_reassignment
Once more Diarization: Improving meeting transcription systems through segment-level speaker reassignment
☆12Updated 5 months ago
haoheliu / ontology-aware-audio-tagging
☆13Updated 2 years ago
Taltt / FNSE-SBGAN
FNSE-SBGAN: Far-field Speech Enhancement with Schrödinger Bridge and Generative Adversarial Networks
☆13Updated 2 months ago
Honee-W / CPTNN
unofficial implementation of "CPTNN: CROSS-PARALLEL TRANSFORMER NEURAL NETWORK FOR TIME-DOMAIN SPEECH ENHANCEMENT"
☆15Updated last year
tuanio / nextformer
PyTorch implementation of "Nextformer: A ConvNeXt Augmented Conformer For End-To-End Speech Recognition"
☆11Updated 2 years ago
onolab-tmu / libss
A Python library for blind source separation.
☆4Updated 3 months ago
TuZehai / Sheffield_Clarity_CEC1_Entry
Implementation of Sheffield entry for Clarity enhancement challenge.
☆17Updated 3 years ago
merlresearch / sebbs
Prediction of sound event bounding boxes (SEBBs)
☆29Updated 11 months ago
FrancoisGrondin / kissdsp
☆11Updated 11 months ago
Visitor-W / MTDA
MTDA-HSED: Mutual-Assistance Tuning and Dual-Branch Aggregating for Heterogeneous Sound Event Detection
☆9Updated 9 months ago
Yip-Jia-Qi / codecformer
☆17Updated last year
haoheliu / DCASE_2022_Task_5
System that ranks 2nd in DCASE 2022 Challenge Task 5: Few-shot Bioacoustic Event Detection
☆28Updated 3 years ago
zjzser / WMCodec
PyTorch Implementation of [WMCodec: End-to-End Neural Speech Codec with Deep Watermarking for Authenticity Verification](https://arxiv.or…
☆14Updated 8 months ago
chaufanglin / Normal2Whisper
Implementation of "Improving Whispered Speech Recognition Performance using Pseudo-whispered based Data Augmentation"
☆11Updated 8 months ago
hmohebbi / disentangling_representations
☆12Updated 9 months ago
guozixunnicolas / DENT_DDSP
☆22Updated 2 years ago
prairie-schooner / wav2vec-vc
☆11Updated 2 years ago
wyw97 / DENSE
ICASSP2025Dynamic Embedding Causal Target Speech Extraction
☆3Updated 4 months ago
popcornell / OSDC
☆16Updated 4 years ago
khanld / Dynamic-Mixing
Dynamic Mixing For Speech Processing (mix-on-the-fly)
☆20Updated 3 years ago
zeroone-universe / RealTimeBWE
Unofficial Pytorch Lightning Implementation of "Real-time Speech Frequency Bandwidth Extension"
☆35Updated last year
bastibe / MAPS-Scripts
A fundamental frequency estimation algorithm using features from the magnitude and phase spectrogram.
☆24Updated 4 years ago
BUTSpeechFIT / TS_SUPERB
☆15Updated 3 months ago
yluo42 / SRVQ
Spherical residual vector quantization (SRVQ)
☆30Updated 10 months ago
yongyizang / TrainingFreeMultiStepASR
Official Repository for "Training-Free Multi-Step Audio Source Separation"
☆46Updated last month