sithu31296 / audio-taggingLinks
Easy to use Audio Tagging in PyTorch
☆22Updated 3 years ago
Alternatives and similar repositories for audio-tagging
Users that are interested in audio-tagging are comparing it to the libraries listed below
Sorting:
- Streaming Audiotransformers for online Audio tagging☆44Updated 11 months ago
- ☆13Updated last year
- Pytorch implementation of subband decomposition☆92Updated 2 years ago
- multi-channel target speech extraction with channel decorrelation and target speaker adaptation☆25Updated 4 years ago
- Repo for source code of EBEN: Extreme Bandwidth Extension Network☆74Updated 2 weeks ago
- The implementation of "Optimizing Shoulder to Shoulder: A Coordinated Sub-Band Fusion Model for Real-Time Full-Band Speech Enhancement"☆52Updated 2 years ago
- ☆65Updated last year
- Production first, nn-based on-device signal processing toolkit.☆65Updated 2 years ago
- ☆59Updated 4 years ago
- Fully Quantized Neural Networks For Speech Enhancement☆61Updated last year
- Paderborn Sound Event Detection☆74Updated last year
- Conformer-based Metric GAN for speech enhancement☆26Updated last year
- PyTorch implementation of LiMuSE☆31Updated 2 years ago
- Author's repository for reproducing DcaseNet, an integrated pre-trained DNN that performs acoustic scene classification, audio tagging, a…☆41Updated 3 years ago
- ☆14Updated 2 years ago
- ICASSP2025Dynamic Embedding Causal Target Speech Extraction☆3Updated 2 months ago
- ☆33Updated 3 weeks ago
- The implementation of "X-TF-GridNet: A Time-Frequency Domain Target Speaker Extraction Network with Adaptive Speaker Embedding Fusion", w…☆58Updated 7 months ago
- Learning differentiable temporal resolution on time-series data.☆36Updated 2 years ago
- Single channel speech source separation by diffusion process (ICASSP 2023)☆108Updated last year
- Unsupervised domain adaptation for conversational speech enhancement using RemixIT☆53Updated 2 years ago
- PAM is a no-reference audio quality metric for audio generation tasks☆64Updated 10 months ago
- This is the code and dataset repo for Interspeech 2024 paper "Target conversation extraction: Source separation using turn-taking dynamic…☆48Updated 8 months ago
- Source code and demo for INTERSPEECH 2024 paper: Noise-robust Speech Separation with Fast Generative Correction☆39Updated 6 months ago
- This code is to run the WARP-Q speech quality metric.☆35Updated 7 months ago
- A Diffusion Probabilistic Model for Target Sound Extraction☆38Updated 8 months ago
- ☆44Updated last year
- ☆21Updated last year
- SERAB: a multi-lingual benchmark for speech emotion recognition☆28Updated 2 years ago
- ☆69Updated 2 years ago