w-hc / torch_audioset
PyTorch transcribed audioset classifier, including VGGish and YAMNet, along with utils to manipulate autioset category ontology.
β66Updated 3 years ago
Related projects β
Alternatives and complementary repositories for torch_audioset
- π This repo makes it easy to download the raw audio files from AudioSet (32.45 GB, 632 classes).β98Updated last year
- Audio transformations library for PyTorchβ226Updated 2 years ago
- Repo associated to the DESED dataset, download and creation of dataβ127Updated 4 months ago
- SpecAugment: A Simple Data Augmentation Method for Automatic Speech Recognitionβ70Updated 4 years ago
- Baseline systems for the FSD50K datasetβ67Updated 3 years ago
- Domestic environment sound event detection taskβ129Updated 5 months ago
- Code for the TASLP paper "PSLA: Improving Audio Tagging With Pretraining, Sampling, Labeling, and Aggregation".β140Updated last year
- Companion repository for the paper "A Comparison of Metric Learning Loss Functions for End-to-End Speaker Verification" published at SLSPβ¦β59Updated 4 years ago
- WavEncoder is a Python library for encoding audio signals, transforms for audio augmentation, and training audio classification models wiβ¦β89Updated 3 years ago
- Reading list for research topics in Sound AIβ166Updated 3 months ago
- A library built for easier audio self-supervised training, downstream tasks evaluationβ106Updated 2 months ago
- β101Updated 4 years ago
- Baseline method for sound event localization task of DCASE 2020 challengeβ53Updated 4 years ago
- Dataset and baseline code for the VocalSound dataset (ICASSP2022).β123Updated 2 years ago
- Code for the paper "Unsupervised Contrastive Learning of Sound Event Representations", ICASSP 2021.β93Updated last year
- β79Updated last year
- Visualization toolbox for Sound Event Detectionβ116Updated 8 months ago
- A new comprehensive and diverse few-shot acoustic classification benchmark.β60Updated last month
- Code and data repository for paper "VoxCeleb enrichment for Age and Gender recognition" submitted at ASRU 2021β64Updated 2 years ago
- β27Updated 4 months ago
- This repository contains the code related to the paper 'DENet: a deep architecture for audio surveillance applications'.β41Updated last year
- This code aims at weakly-labeled semi-supervised sound event detection. The code embraces two methods we proposed to solve this task: spβ¦β125Updated 4 years ago
- Baseline of DCASE 2020 task 4β42Updated 2 years ago
- A PyTorch implementation of Meta-TasNet from "Meta-learning Extractors for Music Source Separationβ137Updated 3 months ago
- β53Updated 4 years ago
- Benchmark for sound event localization task of DCASE 2019 challengeβ73Updated 4 years ago
- Source code for models described in the paper "ESResNe(X)t-fbsp: Learning Robust Time-Frequency Transformation of Audio" (https://arxiv.oβ¦β44Updated 3 years ago
- Pytorch code for "Rethinking CNN Models for Audio Classification"β122Updated 3 years ago
- Masked Modeling Duo: Towards a Universal Audio Pre-training Frameworkβ76Updated 3 months ago
- Source code for models described in the paper "ESResNet: Environmental Sound Classification Based on Visual Domain Models" (https://arxivβ¦β31Updated last year