MTG / DCASE-models
Python library for rapid prototyping of environmental sound analysis systems
☆42Updated 2 years ago
Alternatives and similar repositories for DCASE-models:
Users that are interested in DCASE-models are comparing it to the libraries listed below
- Who calls the shots? Rethinking Few-Shot Learning for Audio (WASPAA 2021)☆42Updated 2 years ago
- A PyTorch implementation: "LASAFT-Net-v2: Listen, Attend and Separate by Attentively aggregating Frequency Transformation"☆33Updated 2 years ago
- COALA: Co-Aligned Autoencoders for Learning Semantically Enriched Audio Representations☆48Updated 6 months ago
- Improving Recording Device Generalization using Impulse Response Augmentation☆11Updated last year
- Baseline systems for the FSD50K dataset☆67Updated 3 years ago
- Official implementation of EfficientLEAF, a learnable audio frontend.☆39Updated 2 years ago
- Paderbox: A collection of utilities for audio / speech processing☆38Updated 7 months ago
- Unsupervised Representation Learning for Singing Voice Separation☆21Updated last year
- Sound event detection with depthwise separable and dilated convolutions.☆53Updated 4 years ago
- Source code for training models and using the hyperbolic interface proposed in our ICASSP 2023 paper, “Hyperbolic Audio Source Separation…☆62Updated last year
- PyTorch Dataset for Speech and Music audio☆73Updated 6 months ago
- Code for the paper "Unsupervised Contrastive Learning of Sound Event Representations", ICASSP 2021.☆93Updated 2 years ago
- Learning Complex Basis Functions for Invariant Signal Representations with the Complex Autoencoder☆34Updated last month
- Audio activity detector based on per-channel energy normalization (PCEN)☆29Updated 6 years ago
- Asteroid's filterbanks☆82Updated 2 weeks ago
- Evaluation kit for the HEAR Benchmark☆56Updated 3 weeks ago
- ☆17Updated 3 years ago
- A pytorch implementation of the paper : Acoustic Scene Classification with Multiple Decision Schemes.☆20Updated 4 years ago
- Simple baseline model for the HEAR benchmark☆23Updated 3 weeks ago
- This code is to run the WARP-Q speech quality metric.☆34Updated 3 months ago
- Author's repository for reproducing DcaseNet, an integrated pre-trained DNN that performs acoustic scene classification, audio tagging, a…☆39Updated 3 years ago
- Supplementary code for the experiments described in the 2021 ISMIR submission: Leveraging Hierarchical Structures for Few Shot Musical In…☆40Updated 2 years ago
- Addressing the confounds of accompaniments in singer identification☆18Updated 4 years ago
- CNN-based singing voice detection experiments☆37Updated 6 years ago
- ☆32Updated 4 years ago
- Streaming Audiotransformers for online Audio tagging☆43Updated 7 months ago
- A speech signal processing library in Python with emphasis on deep learning.☆31Updated 2 years ago
- ☆17Updated 2 years ago
- DCASE2020 Challenge Task 1 baseline system☆25Updated 4 years ago
- Translating Synthetic RIRs to Real RIRs☆41Updated last year