sithu31296 / audio-taggingLinks
Easy to use Audio Tagging in PyTorch
☆22Updated 3 years ago
Alternatives and similar repositories for audio-tagging
Users that are interested in audio-tagging are comparing it to the libraries listed below
Sorting:
- Unsupervised domain adaptation for conversational speech enhancement using RemixIT☆54Updated 2 years ago
- ☆65Updated 2 years ago
- This is the code and dataset repo for Interspeech 2024 paper "Target conversation extraction: Source separation using turn-taking dynamic…☆48Updated 9 months ago
- ☆25Updated 8 months ago
- Pytorch implementation of subband decomposition☆92Updated 2 years ago
- Repo for source code of EBEN: Extreme Bandwidth Extension Network☆75Updated last month
- A Diffusion Probabilistic Model for Target Sound Extraction☆40Updated 9 months ago
- Official PyTorch implementation of "RVAE-EM: Generative speech dereverberation based on recurrent variational auto-encoder and convolutiv…☆45Updated 4 months ago
- ☆13Updated 2 years ago
- ICASSP2025Dynamic Embedding Causal Target Speech Extraction☆3Updated 4 months ago
- Fully Quantized Neural Networks For Speech Enhancement☆61Updated last year
- SERAB: a multi-lingual benchmark for speech emotion recognition☆28Updated 2 years ago
- ☆30Updated 4 years ago
- TS-SEP: Joint Diarization and Separation Conditioned on Estimated Speaker Embeddings☆33Updated 9 months ago
- An official implementation of the ICASSP 2024 paper: Dual-Path TFC-TDF UNet for Music Source Separation☆90Updated last year
- CDER (Conversational Diarization Error Rate) Scoring Tool☆21Updated 2 years ago
- Speech Dereverberation using Fully Convolutional Networks☆72Updated 4 years ago
- Conformer-based Metric GAN for speech enhancement☆26Updated last year
- ☆35Updated 2 months ago
- ☆85Updated last year
- Paderborn Sound Event Detection☆74Updated last year
- Learning differentiable temporal resolution on time-series data.☆36Updated 2 years ago
- Discriminative Training of VBx Diarization☆25Updated 9 months ago
- Streaming Audiotransformers for online Audio tagging☆45Updated last year
- Author's repository for reproducing DcaseNet, an integrated pre-trained DNN that performs acoustic scene classification, audio tagging, a…☆41Updated 3 years ago
- PyTorch implementation of LiMuSE☆31Updated 2 years ago
- End-to-End Keyword Spotting (E2E-KWS) using a character level LSTM☆40Updated 2 years ago
- ☆13Updated last year
- The implementation of "X-TF-GridNet: A Time-Frequency Domain Target Speaker Extraction Network with Adaptive Speaker Embedding Fusion", w…☆60Updated 9 months ago
- ☆22Updated 2 years ago