SarthakYadav / fsd50k-pytorchLinks
Unofficial implementation of FSD50k baselines for Sound Event Recognition
☆26Updated last year
Alternatives and similar repositories for fsd50k-pytorch
Users that are interested in fsd50k-pytorch are comparing it to the libraries listed below
Sorting:
- Learning differentiable temporal resolution on time-series data.☆36Updated 2 years ago
- EVAR ~ Evaluation package for Audio Representations☆62Updated last month
- ☆18Updated 3 years ago
- Official Pytorch implementation of PULSE: Positive–Unlabelled Learning for audio Signal Enhancement (Best Paper Award at ICASSP 2023)☆43Updated 2 years ago
- ☆83Updated 2 months ago
- Official implementation of EfficientLEAF, a learnable audio frontend.☆47Updated 2 years ago
- A library built for easier audio self-supervised training, downstream tasks evaluation☆126Updated 11 months ago
- experiments about AudioSet☆44Updated 2 years ago
- A new comprehensive and diverse few-shot acoustic classification benchmark.☆64Updated 10 months ago
- A Pytorch implementation of the paper : SpecAugment++: A Hidden Space Data Augmentation Method for Acoustic Scene Classification☆33Updated 4 years ago
- ☆37Updated 2 months ago
- Public Code for the paper MAE-AST: Masked Autoencoding Audio Spectrogram Transformer☆88Updated 3 years ago
- Pytorch implementation of subband decomposition☆92Updated 3 years ago
- PyTorch implementation of the LEAF audio frontend☆73Updated 2 years ago
- Code for the paper "Unsupervised Contrastive Learning of Sound Event Representations", ICASSP 2021.☆92Updated 2 years ago
- Pytorch port of Google Research's LEAF Audio paper☆93Updated 4 years ago
- CoNeTTE: An efficient Audio Captioning system leveraging multiple datasets with Task Embedding☆20Updated 7 months ago
- Single channel speech source separation by diffusion process (ICASSP 2023)☆112Updated last year
- Unsupervised domain adaptation for conversational speech enhancement using RemixIT☆54Updated 2 years ago
- Masked Spectrogram Modeling using Masked Autoencoders for Learning General-purpose Audio Representations☆92Updated last year
- Source code for training models and using the hyperbolic interface proposed in our ICASSP 2023 paper, “Hyperbolic Audio Source Separation…☆70Updated 2 years ago
- A Diffusion Probabilistic Model for Target Sound Extraction☆40Updated 10 months ago
- Author's repository for reproducing DcaseNet, an integrated pre-trained DNN that performs acoustic scene classification, audio tagging, a…☆42Updated 3 years ago
- ☆55Updated 8 months ago
- Code and data recipes for the paper: Heterogeneous Target Speech Separation☆42Updated 2 years ago
- US-based professors who work on audio. For students who would like to apply for RA, PhD, postdoc in audio research.☆26Updated 4 months ago
- Paderborn Sound Event Detection☆74Updated 2 years ago
- Data generator for creating synthetic audio mixtures suitable for DCASE Challenge 2022 Task 3☆39Updated 2 years ago
- Boosting Self-Supervised Embeddings for Speech Enhancement☆47Updated 3 years ago
- Baseline method for sound event localization task of DCASE 2022 challenge☆55Updated 3 years ago