Inference code for PaSST, using the HEAR API.
☆33Jan 2, 2024Updated 2 years ago
Alternatives and similar repositories for passt_hear21
Users that are interested in passt_hear21 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Efficient Training of Audio Transformers with Patchout☆371Jan 12, 2024Updated 2 years ago
- Evaluate EfficientAT models on the Holistic Evaluation of Audio Representations Benchmark.☆32Jun 23, 2023Updated 2 years ago
- CP-JKU submission to DCASE 20☆45Apr 19, 2021Updated 4 years ago
- ☆19Jul 15, 2022Updated 3 years ago
- Evaluation kit for the HEAR Benchmark☆63Feb 12, 2026Updated last month
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- Improving Recording Device Generalization using Impulse Response Augmentation☆20Apr 24, 2025Updated 11 months ago
- Code of our ISMIR 2025 paper - D. Afchar, G. Meseguer Brocal, K. Akesbi, R. Hennequin☆35Nov 12, 2025Updated 4 months ago
- Python bindings for minimp3☆17Sep 11, 2023Updated 2 years ago
- ☆26Mar 5, 2018Updated 8 years ago
- Who calls the shots? Rethinking Few-Shot Learning for Audio (WASPAA 2021)☆43May 24, 2022Updated 3 years ago
- Baseline for DCASE 2024 Task 9: "Language-Queried Audio Source Separation"☆26Mar 27, 2024Updated 2 years ago
- This repository aims at providing efficient CNNs for Audio Tagging. We provide AudioSet pre-trained models ready for downstream training …☆335Nov 20, 2024Updated last year
- Repository for subjective and objective evaluation of source separation algorithms☆12Apr 18, 2018Updated 7 years ago
- A python implementation of a traditional Dynamic Range Compressor☆14Oct 30, 2020Updated 5 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- ☆31Apr 22, 2024Updated last year
- ☆18Jun 12, 2025Updated 9 months ago
- MobileNetV2-based baseline system for DCASE2021 Challenge Task 2.☆24Jun 9, 2021Updated 4 years ago
- PyTorch implementation of the NSGT/sliCQT☆17Nov 10, 2023Updated 2 years ago
- ☆20Aug 26, 2022Updated 3 years ago
- Results and Models for Learning Audio Representations of Music Content☆107Dec 3, 2024Updated last year
- Simple baseline model for the HEAR benchmark☆23Feb 17, 2026Updated last month
- ☆30Sep 12, 2021Updated 4 years ago
- ☆30Jun 22, 2022Updated 3 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- A PyTorch implementation of the paper: "LaSAFT: Latent Source Attentive Frequency Transformation for Conditioned Source Separation" (ICAS…☆87Nov 13, 2022Updated 3 years ago
- ☆13Mar 7, 2022Updated 4 years ago
- Autoencoder-based baseline system for DCASE2021 Challenge Task 2.☆27Jun 9, 2021Updated 4 years ago
- ☆19Aug 16, 2025Updated 7 months ago
- This repository contains the code of the CP JKU submission to DCASE23 Task 1 "Low-complexity Acoustic Scene Classification"☆30Sep 18, 2023Updated 2 years ago
- Ravescript☆18Mar 9, 2025Updated last year
- A toolkit dedicate for speech evaluation.☆23Sep 26, 2024Updated last year
- Domestic environment sound event detection task☆154Jun 11, 2024Updated last year
- Python code used to analyze and process symbolic drum patterns☆14May 8, 2023Updated 2 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- ☆28Oct 17, 2024Updated last year
- Codebase and utilities for using models trained by multiple music related tasks☆12Jul 6, 2023Updated 2 years ago
- Enriching Music Descriptions with a Finetuned-LLM and Metadata for Text-to-Music Retrieval (TTMR++) [ICASSP24]☆43Oct 7, 2024Updated last year
- Official repository for the paper - SLAP: Siamese Language-Audio Pretraining without negative samples for Music Understanding☆58Sep 25, 2025Updated 6 months ago
- Prediction of sound event bounding boxes (SEBBs)☆32Aug 2, 2024Updated last year
- Streaming source separation for music and speech files, using the Open-Unmix LSTM architecture.☆21Dec 8, 2022Updated 3 years ago
- Python library to write, read, and verify transparency metadata in audio files for AI transparency compliance.☆19Aug 17, 2025Updated 7 months ago