felixgontier / dcase-2023-baseline
☆14Updated last year
Related projects ⓘ
Alternatives and complementary repositories for dcase-2023-baseline
- A Pytorch implementation of the paper : SpecAugment++: A Hidden Space Data Augmentation Method for Acoustic Scene Classification☆31Updated 3 years ago
- ☆36Updated 2 years ago
- experiments about AudioSet☆43Updated last year
- Learning differentiable temporal resolution on time-series data.☆33Updated 2 years ago
- The Pytorch implementation of paper: Masked Spectrogram Prediction For Self-Supervised Audio Pre-Training☆39Updated last year
- Tools for the evaluation of audio captioning.☆14Updated 4 years ago
- System that ranks 2nd in DCASE 2022 Challenge Task 5: Few-shot Bioacoustic Event Detection☆27Updated 2 years ago
- Source code for ICASSP2022 "Pseudo Strong labels for large scale weakly supervised audio tagging"☆30Updated 2 years ago
- ARCH: Audio Representations benCHmark☆37Updated 2 months ago
- ☆26Updated last year
- Code for CVSSP submission to DCASE 2021 Task 6☆35Updated 2 years ago
- Typing to Listen at the Cocktail Party: Text-Guided Target Speaker Extraction (LLM-TSE)☆32Updated last year
- DinoSR: Self-Distillation and Online Clustering for Self-supervised Speech Representation Learning☆47Updated 10 months ago
- Multi-Task Speech classification of accent and gender of an english speaker on Mozilla's common voice dataset☆24Updated 2 months ago
- Code and data recipes for the paper: Heterogeneous Target Speech Separation☆39Updated last year
- ☆21Updated 4 months ago
- WildDESED: A LLM-Powered Dataset for Wild Domestic Environment Sound Event Detection☆11Updated this week
- Pytorch implementation of the paper : Modeling Label Dependencies for Audio Tagging with Graph Convolutional Network☆14Updated 4 years ago
- (Interspeech 2023 & ICASSP 2024) Official repository for ARMHuBERT and STaRHuBERT☆38Updated 2 months ago
- Public Code for the paper MAE-AST: Masked Autoencoding Audio Spectrogram Transformer☆83Updated 2 years ago
- SRTNet☆24Updated last year
- CoNeTTE: An efficient Audio Captioning system leveraging multiple datasets with Task Embedding☆14Updated 2 weeks ago
- EVAR ~ Evaluation package for Audio Representations☆43Updated 2 weeks ago
- Pytorch implementation of INTEGRATED PARAMETER-EFFICIENT TUNING FOR GENERAL-PURPOSE AUDIO MODELS☆10Updated last year
- This is the official repository of the papers "Parameter-Efficient Transfer Learning of Audio Spectrogram Transformers" and "Efficient Fi…☆36Updated 3 months ago
- Official Pytorch implementation of PULSE: Positive–Unlabelled Learning for audio Signal Enhancement (Best Paper Award at ICASSP 2023)☆39Updated last year
- A CSRankings-like index for speech researchers☆31Updated last month
- A toolkit for researchers in the multimodal sound separation.☆16Updated last year
- Streaming Audiotransformers for online Audio tagging☆41Updated 5 months ago
- An official repo for the paper "Adapting Language-Audio Models as Few-Shot Audio Learners"☆28Updated last year