kkoutini / passt_hear21Links
Inference code for PaSST, using the HEAR API.
☆32Updated last year
Alternatives and similar repositories for passt_hear21
Users that are interested in passt_hear21 are comparing it to the libraries listed below
Sorting:
- EVAR ~ Evaluation package for Audio Representations☆65Updated 2 weeks ago
- Official implementation of EfficientLEAF, a learnable audio frontend.☆47Updated 2 years ago
- A library built for easier audio self-supervised training, downstream tasks evaluation☆130Updated last week
- Learning differentiable temporal resolution on time-series data.☆36Updated 2 years ago
- Evaluation kit for the HEAR Benchmark☆61Updated last week
- ☆96Updated 4 months ago
- ☆59Updated 2 months ago
- ☆38Updated 4 months ago
- Masked Modeling Duo: Towards a Universal Audio Pre-training Framework☆114Updated 2 weeks ago
- Who calls the shots? Rethinking Few-Shot Learning for Audio (WASPAA 2021)☆44Updated 3 years ago
- Pytorch implementation of subband decomposition☆92Updated 3 years ago
- A fast implementation of bss_eval metrics for blind source separation☆139Updated last month
- Source code for training models and using the hyperbolic interface proposed in our ICASSP 2023 paper, “Hyperbolic Audio Source Separation…☆70Updated 2 years ago
- Paderborn Sound Event Detection☆76Updated 2 years ago
- A list of papers about audio captioning☆79Updated 3 years ago
- Asteroid's filterbanks☆87Updated 8 months ago
- Unofficial implementation of FSD50k baselines for Sound Event Recognition☆26Updated last year
- Unsupervised domain adaptation for conversational speech enhancement using RemixIT☆54Updated 2 years ago
- Code and data recipes for the paper: Heterogeneous Target Speech Separation☆42Updated 2 years ago
- Translating Synthetic RIRs to Real RIRs☆43Updated 2 years ago
- ☆23Updated 11 months ago
- A new comprehensive and diverse few-shot acoustic classification benchmark.☆64Updated last year
- A Diffusion Probabilistic Model for Target Sound Extraction☆40Updated last year
- CoNeTTE: An efficient Audio Captioning system leveraging multiple datasets with Task Embedding☆20Updated 9 months ago
- An official implementation of the ICASSP 2024 paper: Dual-Path TFC-TDF UNet for Music Source Separation☆94Updated last year
- An invertible and differentiable implementation of the Constant-Q Transform (CQT).☆65Updated 2 years ago
- ☆18Updated 3 years ago
- Official implementation of "Learning Music Audio Representations Via Weak Language Supervision" (ICASSP 2022)☆47Updated 10 months ago
- Pytorch port of Google Research's LEAF Audio paper☆93Updated 4 years ago
- Repo associated to the DESED dataset, download and creation of data☆139Updated last year