Python library for rapid prototyping of environmental sound analysis systems
β44May 20, 2022Updated 3 years ago
Alternatives and similar repositories for DCASE-models
Users that are interested in DCASE-models are comparing it to the libraries listed below
Sorting:
- Permutation invariant training in PyTorchβ13Oct 2, 2020Updated 5 years ago
- π΅ A repository for manually annotating files to create labeled acoustic datasets for machine learning.β47Feb 20, 2022Updated 4 years ago
- A Playground for Variational Autoencodersβ12Feb 11, 2018Updated 8 years ago
- Bag-of-Features Acoustic Event Detectionβ14Oct 5, 2016Updated 9 years ago
- Sound event detection with depthwise separable and dilated convolutions.β53Mar 30, 2020Updated 5 years ago
- Language modelling for sound event detectionβ20Jan 2, 2020Updated 6 years ago
- Easy to use Audio Tagging in PyTorchβ23Aug 22, 2021Updated 4 years ago
- An audio classification system for learning with out-of-distribution dataβ33Dec 8, 2022Updated 3 years ago
- Baseline method for sound event localization task of DCASE 2021 challengeβ42Jun 15, 2021Updated 4 years ago
- Implementation of an attack/decay model for piano transcriptionβ11Feb 1, 2018Updated 8 years ago
- The training code for the 4th place model at MDX 2021 leaderboard A.β36Sep 1, 2021Updated 4 years ago
- Documentation of the Two!Ears Auditory Modelβ13Feb 14, 2019Updated 7 years ago
- The code used to create the ARCA23K and ARCA23K-FSD datasetsβ15Nov 9, 2021Updated 4 years ago
- β60Feb 2, 2023Updated 3 years ago
- Pypi installable TDNN and TDNN-F layers for PyTorch based acoustic model trainingβ41Dec 18, 2020Updated 5 years ago
- A collection of utilities for Detection and Classification of Acoustic Scenes and Eventsβ134Apr 3, 2025Updated 11 months ago
- Filter Bank Implementaion as Convolutional Neural Network using Python Kerasβ17Dec 18, 2024Updated last year
- 1st place solution to the DCASE 2019 - Task 5 - Urban Sound Taggingβ30Mar 19, 2021Updated 5 years ago
- My system for the DCASE 2022 Task 3 Sound Event Localizaiton and Detection.β12Nov 12, 2022Updated 3 years ago
- A library of speech gadgets.β14Oct 15, 2022Updated 3 years ago
- MIR conference deadline countdownsβ19Jun 24, 2022Updated 3 years ago
- MiRA (Music Replication Assessment) tool is a model-independent open evaluation method based on four diverse audio music similarity metriβ¦β34Nov 14, 2025Updated 4 months ago
- MobileNetV2-based baseline system for DCASE2021 Challenge Task 2.β24Jun 9, 2021Updated 4 years ago
- On-going VA modeling research. Modeling dynamic range compressor using S4.β19Nov 29, 2025Updated 3 months ago
- Implementation of different noise embeddings for noise aware training of Kaldi acoustic models.β13Feb 13, 2021Updated 5 years ago
- Tutorial covering Open Source tools for Source Separation.β15Nov 12, 2021Updated 4 years ago
- Reading list for research topics in Sound AIβ196Aug 8, 2024Updated last year
- β22Jun 30, 2021Updated 4 years ago
- Python toolkit for likelihood-ratio calibration of binary classifiersβ25Feb 21, 2023Updated 3 years ago
- Unconditional music synthesis using a diffusion model in the STFT domainβ12May 31, 2022Updated 3 years ago
- Code base for WaveTransformer: A novel architecture for automated audio captioningβ43Mar 1, 2021Updated 5 years ago
- Read audio with FFmpeg into NumPy/PyTorch via ctypes (standard library module)β11Aug 12, 2020Updated 5 years ago
- Adaptive pooling operators for multiple instance learningβ78May 12, 2022Updated 3 years ago
- Interspeech 2019 tutorial materialsβ49Sep 26, 2019Updated 6 years ago
- Fast and differentiable hidden Markov model in C++β19Jan 20, 2023Updated 3 years ago
- A tool to collect/validate audio recordings from workers on Amazon Mechanical Turk. Written in Python/Flask. (originally hosted on githubβ¦β14Dec 19, 2022Updated 3 years ago
- Digital Signals Theory book and source materialsβ39Jan 7, 2026Updated 2 months ago
- A toolkit for generating datasets of midi files which have been degraded to be 'un-musical'.β40Feb 27, 2025Updated last year
- Automatic speech annotator processing speech with voice activaty detection, overlapping speech detection, speaker diarization and automatβ¦β33Jun 14, 2024Updated last year