qiuqiangkong/audioset_tagging_cnn

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/qiuqiangkong/audioset_tagging_cnn)

qiuqiangkong / audioset_tagging_cnn

☆1,766

Alternatives and similar repositories for audioset_tagging_cnn

Users that are interested in audioset_tagging_cnn are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

qiuqiangkong / panns_inference
View on GitHub
☆266Mar 5, 2024Updated 2 years ago
qiuqiangkong / panns_transfer_to_gtzan
View on GitHub
☆113Jul 12, 2020Updated 6 years ago
qiuqiangkong / torchlibrosa
View on GitHub
☆512Jun 25, 2024Updated 2 years ago
yinkalario / General-Purpose-Sound-Recognition-Demo
View on GitHub
General purpose sound recognition demo
☆161Oct 3, 2023Updated 2 years ago
kkoutini / PaSST
View on GitHub
Efficient Training of Audio Transformers with Patchout
☆386Jan 12, 2024Updated 2 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
qiuqiangkong / audioset_classification
View on GitHub
☆229Feb 9, 2020Updated 6 years ago
YuanGongND / ast
View on GitHub
Code for the Interspeech 2021 paper "AST: Audio Spectrogram Transformer".
☆1,465May 21, 2023Updated 3 years ago
RetroCirce / HTS-Audio-Transformer
View on GitHub
The official code repo of "HTS-AT: A Hierarchical Token-Semantic Audio Transformer for Sound Classification and Detection"
☆504Sep 18, 2025Updated 10 months ago
fschmid56 / EfficientAT
View on GitHub
This repository aims at providing efficient CNNs for Audio Tagging. We provide AudioSet pre-trained models ready for downstream training …
☆353Nov 20, 2024Updated last year
YuanGongND / psla
View on GitHub
Code for the TASLP paper "PSLA: Improving Audio Tagging With Pretraining, Sampling, Labeling, and Aggregation".
☆150Jul 13, 2023Updated 3 years ago
iver56 / audiomentations
View on GitHub
A Python library for audio data augmentation. Useful for making audio ML models work well in the real world, not just in the lab.
☆2,301Apr 13, 2026Updated 3 months ago
turpaultn / DESED
View on GitHub
Repo associated to the DESED dataset, download and creation of data
☆155Jul 16, 2024Updated 2 years ago
karolpiczak / ESC-50
View on GitHub
ESC-50: Dataset for Environmental Sound Classification
☆1,849Mar 20, 2024Updated 2 years ago
Kikyo-16 / Sound_event_detection
View on GitHub
This code aims at weakly-labeled semi-supervised sound event detection. The code embraces two methods we proposed to solve this task: sp…
☆129Jul 24, 2020Updated 6 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
iver56 / torch-audiomentations
View on GitHub
Fast audio data augmentation in PyTorch. Inspired by audiomentations. Useful for deep learning.
☆1,162Nov 24, 2025Updated 8 months ago
LAION-AI / CLAP
View on GitHub
Contrastive Language-Audio Pretraining
☆2,233May 15, 2025Updated last year
XinhaoMei / WavCaps
View on GitHub
This reporsitory contains metadata of WavCaps dataset and codes for downstream tasks.
☆264Jul 25, 2024Updated 2 years ago
lRomul / argus-freesound
View on GitHub
Kaggle | 1st place solution for Freesound Audio Tagging 2019
☆313Jun 22, 2022Updated 4 years ago
yinkalario / Sound-Event-Detection-AudioSet
View on GitHub
☆48Aug 30, 2024Updated last year
audio-captioning / audio-captioning-papers
View on GitHub
A list of papers about audio captioning
☆79Jul 1, 2022Updated 4 years ago
KinWaiCheuk / nnAudio
View on GitHub
Audio processing by using pytorch 1D convolution network
☆1,129May 21, 2026Updated 2 months ago
hche11 / VGGSound
View on GitHub
VGGSound: A Large-scale Audio-Visual Dataset
☆359Sep 13, 2021Updated 4 years ago
facebookresearch / AudioMAE
View on GitHub
This repo hosts the code and models of "Masked Autoencoders that Listen".
☆673Apr 5, 2024Updated 2 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
s3prl / s3prl
View on GitHub
Self-Supervised Speech Pre-training and Representation Learning Toolkit
☆2,556Mar 12, 2026Updated 4 months ago
YuanGongND / ssast
View on GitHub
Code for the AAAI 2022 paper "SSAST: Self-Supervised Audio Spectrogram Transformer".
☆428Aug 14, 2022Updated 3 years ago
qiuqiangkong / sound_event_detection_dcase2017_task4
View on GitHub
☆55Jun 3, 2020Updated 6 years ago
MaigoAkisame / cmu-thesis
View on GitHub
Code for Yun Wang's PhD Thesis: Polyphonic Sound Event Detection with Weak Labeling
☆169May 14, 2022Updated 4 years ago
google-research / leaf-audio
View on GitHub
LEAF is a learnable alternative to audio features such as mel-filterbanks, that can be initialized as an approximation of mel-filterbanks…
☆531Mar 1, 2022Updated 4 years ago
justinsalamon / scaper
View on GitHub
A library for soundscape synthesis and augmentation
☆426May 4, 2022Updated 4 years ago
asteroid-team / asteroid
View on GitHub
The PyTorch-based audio source separation toolkit for researchers
☆2,579May 13, 2026Updated 2 months ago
facebookresearch / WavAugment
View on GitHub
A library for speech data augmentation in time-domain
☆689Aug 30, 2021Updated 4 years ago
Spijkervet / CLMR
View on GitHub
Official PyTorch implementation of Contrastive Learning of Musical Representations
☆338Jul 25, 2024Updated 2 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
ryanwongsa / kaggle-birdsong-recognition
View on GitHub
1st Place solution to the Cornell Birdcall Identification competition.
☆155Sep 19, 2020Updated 5 years ago
sharathadavanne / seld-net
View on GitHub
Sound event localization, detection, and tracking of multiple overlapping and moving sources in 2D spherical space using convolutional re…
☆405Nov 21, 2022Updated 3 years ago
DCASE-REPO / DESED_task
View on GitHub
Domestic environment sound event detection task
☆157Jun 11, 2024Updated 2 years ago
jordipons / sklearn-audio-transfer-learning
View on GitHub
A didactic toolkit to rapidly prototype audio classifiers with pre-trained Tensorflow models and Scikit-learn
☆148Nov 21, 2022Updated 3 years ago
yinkalario / Two-Stage-Polyphonic-Sound-Event-Detection-and-Localization
View on GitHub
A two-stage polyphonic sound event detection and localization method for both SED and DOA.
☆126Jan 8, 2023Updated 3 years ago
JishengBai / AudioSetCaps
View on GitHub
A 6-million Audio-Caption Paired Dataset Built with a LLMs and ALMs-based Automatic Pipeline
☆208Dec 13, 2024Updated last year
Audio-WestlakeU / ATST-SED
View on GitHub
This repo includes the official implementations of "Fine-tune the pretrained ATST model for sound event detection".
☆174Jun 8, 2026Updated last month