Inference code for PaSST, using the HEAR API.
☆33Jan 2, 2024Updated 2 years ago
Alternatives and similar repositories for passt_hear21
Users that are interested in passt_hear21 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Efficient Training of Audio Transformers with Patchout☆374Jan 12, 2024Updated 2 years ago
- Evaluate EfficientAT models on the Holistic Evaluation of Audio Representations Benchmark.☆32Jun 23, 2023Updated 2 years ago
- CP-JKU submission to DCASE 20☆45Apr 19, 2021Updated 5 years ago
- ☆19Jul 15, 2022Updated 3 years ago
- Evaluation kit for the HEAR Benchmark☆63Feb 12, 2026Updated 2 months ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Improving Recording Device Generalization using Impulse Response Augmentation☆20Apr 24, 2025Updated 11 months ago
- Code of our ISMIR 2025 paper - D. Afchar, G. Meseguer Brocal, K. Akesbi, R. Hennequin☆36Nov 12, 2025Updated 5 months ago
- Python bindings for minimp3☆17Sep 11, 2023Updated 2 years ago
- ☆26Mar 5, 2018Updated 8 years ago
- Who calls the shots? Rethinking Few-Shot Learning for Audio (WASPAA 2021)☆43May 24, 2022Updated 3 years ago
- Baseline for DCASE 2024 Task 9: "Language-Queried Audio Source Separation"☆26Mar 27, 2024Updated 2 years ago
- This repository aims at providing efficient CNNs for Audio Tagging. We provide AudioSet pre-trained models ready for downstream training …☆339Nov 20, 2024Updated last year
- Repository for subjective and objective evaluation of source separation algorithms☆12Apr 18, 2018Updated 8 years ago
- A python implementation of a traditional Dynamic Range Compressor☆14Oct 30, 2020Updated 5 years ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Information Retrieval Gender Bias Dataset☆13Apr 21, 2023Updated 2 years ago
- ☆30Apr 22, 2024Updated last year
- ☆18Jun 12, 2025Updated 10 months ago
- MobileNetV2-based baseline system for DCASE2021 Challenge Task 2.☆24Jun 9, 2021Updated 4 years ago
- PyTorch implementation of the NSGT/sliCQT☆17Nov 10, 2023Updated 2 years ago
- ☆20Aug 26, 2022Updated 3 years ago
- Results and Models for Learning Audio Representations of Music Content☆107Dec 3, 2024Updated last year
- Simple baseline model for the HEAR benchmark☆23Feb 17, 2026Updated 2 months ago
- ☆30Sep 12, 2021Updated 4 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆30Jun 22, 2022Updated 3 years ago
- A PyTorch implementation of the paper: "LaSAFT: Latent Source Attentive Frequency Transformation for Conditioned Source Separation" (ICAS…☆87Nov 13, 2022Updated 3 years ago
- ☆12Jun 26, 2024Updated last year
- ☆13Mar 7, 2022Updated 4 years ago
- Autoencoder-based baseline system for DCASE2021 Challenge Task 2.☆27Jun 9, 2021Updated 4 years ago
- ☆18Aug 16, 2025Updated 8 months ago
- This repository contains the code of the CP JKU submission to DCASE23 Task 1 "Low-complexity Acoustic Scene Classification"☆30Sep 18, 2023Updated 2 years ago
- Ravescript☆19Mar 9, 2025Updated last year
- A toolkit dedicate for speech evaluation.☆23Sep 26, 2024Updated last year
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Domestic environment sound event detection task☆155Jun 11, 2024Updated last year
- Python code used to analyze and process symbolic drum patterns☆14May 8, 2023Updated 2 years ago
- ☆28Oct 17, 2024Updated last year
- Codebase and utilities for using models trained by multiple music related tasks☆12Jul 6, 2023Updated 2 years ago
- Enriching Music Descriptions with a Finetuned-LLM and Metadata for Text-to-Music Retrieval (TTMR++) [ICASSP24]☆43Oct 7, 2024Updated last year
- Official repository for the paper - SLAP: Siamese Language-Audio Pretraining without negative samples for Music Understanding☆58Sep 25, 2025Updated 6 months ago
- Prediction of sound event bounding boxes (SEBBs)☆34Aug 2, 2024Updated last year