kkoutini / passt_hear21
Inference code for PaSST, using the HEAR API.
☆32Updated last year
Alternatives and similar repositories for passt_hear21:
Users that are interested in passt_hear21 are comparing it to the libraries listed below
- Learning differentiable temporal resolution on time-series data.☆36Updated 2 years ago
- EVAR ~ Evaluation package for Audio Representations☆50Updated 5 months ago
- ☆31Updated 9 months ago
- Official implementation of EfficientLEAF, a learnable audio frontend.☆40Updated 2 years ago
- Improving Recording Device Generalization using Impulse Response Augmentation☆15Updated 2 years ago
- Pytorch implementation of subband decomposition☆92Updated 2 years ago
- ☆43Updated 3 weeks ago
- ☆24Updated 5 months ago
- A library built for easier audio self-supervised training, downstream tasks evaluation☆117Updated 7 months ago
- ☆18Updated 2 years ago
- Prediction of sound event bounding boxes (SEBBs)☆26Updated 8 months ago
- Source code for training models and using the hyperbolic interface proposed in our ICASSP 2023 paper, “Hyperbolic Audio Source Separation…☆64Updated last year
- Official implementation of "Learning Music Audio Representations Via Weak Language Supervision" (ICASSP 2022)☆46Updated 4 months ago
- Code and data recipes for the paper: Heterogeneous Target Speech Separation☆41Updated 2 years ago
- ☆44Updated last year
- An invertible and differentiable implementation of the Constant-Q Transform (CQT).☆59Updated 2 years ago
- Who calls the shots? Rethinking Few-Shot Learning for Audio (WASPAA 2021)☆42Updated 2 years ago
- Official implementation for our paper "Audio Mamba: Selective State Spaces for Self-Supervised Audio Representations"☆37Updated 10 months ago
- Unsupervised Representation Learning for Singing Voice Separation☆22Updated 2 years ago
- Official repository for the paper "Audio xLSTMs: Learning Self-supervised audio representations with xLSTMs"☆14Updated 6 months ago
- A standardized toolkit of Kernel Audio Distance (KAD)—a distribution-free, unbiased, and computationally efficient metric for evaluating …☆65Updated last month
- PodcastMix A dataset for separating music and speech in podcasts.☆43Updated 7 months ago
- CoNeTTE: An efficient Audio Captioning system leveraging multiple datasets with Task Embedding☆16Updated 4 months ago
- System that ranks 2nd in DCASE 2022 Challenge Task 5: Few-shot Bioacoustic Event Detection☆28Updated 2 years ago
- experiments about AudioSet☆44Updated last year
- Simple baseline model for the HEAR benchmark☆23Updated 3 weeks ago
- Masked Modeling Duo: Towards a Universal Audio Pre-training Framework☆94Updated 8 months ago
- ☆43Updated 8 months ago
- ☆33Updated 2 years ago
- A Diffusion Probabilistic Model for Target Sound Extraction☆37Updated 6 months ago