qiuqiangkong/panns_inference

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/qiuqiangkong/panns_inference)

qiuqiangkong / panns_inference

☆266

Alternatives and similar repositories for panns_inference

Users that are interested in panns_inference are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

qiuqiangkong / audioset_tagging_cnn
View on GitHub
☆1,765Jul 25, 2024Updated last year
qiuqiangkong / panns_transfer_to_gtzan
View on GitHub
☆113Jul 12, 2020Updated 6 years ago
yinkalario / General-Purpose-Sound-Recognition-Demo
View on GitHub
General purpose sound recognition demo
☆161Oct 3, 2023Updated 2 years ago
Arshdeep-Singh-Boparai / E-PANNs
View on GitHub
☆19Aug 19, 2025Updated 11 months ago
qiuqiangkong / torchlibrosa
View on GitHub
☆512Jun 25, 2024Updated 2 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
YuanGongND / ast
View on GitHub
Code for the Interspeech 2021 paper "AST: Audio Spectrogram Transformer".
☆1,464May 21, 2023Updated 3 years ago
YuanGongND / psla
View on GitHub
Code for the TASLP paper "PSLA: Improving Audio Tagging With Pretraining, Sampling, Labeling, and Aggregation".
☆150Jul 13, 2023Updated 3 years ago
audio-captioning / clotho-dataset
View on GitHub
Python code for handling the Clotho dataset.
☆85Nov 24, 2020Updated 5 years ago
yinkalario / EIN-SELD
View on GitHub
An Improved Event-Independent Network for Polyphonic Sound Event Localization and Detection
☆79Aug 5, 2021Updated 4 years ago
kkoutini / PaSST
View on GitHub
Efficient Training of Audio Transformers with Patchout
☆386Jan 12, 2024Updated 2 years ago
RetroCirce / HTS-Audio-Transformer
View on GitHub
The official code repo of "HTS-AT: A Hierarchical Token-Semantic Audio Transformer for Sound Classification and Detection"
☆502Sep 18, 2025Updated 10 months ago
turpaultn / DESED
View on GitHub
Repo associated to the DESED dataset, download and creation of data
☆154Jul 16, 2024Updated 2 years ago
lukewys / dcase_2020_T6
View on GitHub
2nd place solution for 2020 DCASE challenge task 6 audio captioning. http://dcase.community/challenge2020/task-automatic-audio-captioning…
☆24Aug 3, 2023Updated 2 years ago
JishengBai / AudioSetCaps
View on GitHub
A 6-million Audio-Caption Paired Dataset Built with a LLMs and ALMs-based Automatic Pipeline
☆208Dec 13, 2024Updated last year
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
audio-captioning / dcase-2020-baseline
View on GitHub
Audio captioning baseline system for DCASE 2020 challenge.
☆38Aug 22, 2023Updated 2 years ago
qiuqiangkong / sampleRNN_acoustic_scene_generation
View on GitHub
☆14Apr 18, 2019Updated 7 years ago
yinkalario / DCASE2019-TASK3
View on GitHub
Our DCASE 2019 challenge task 3 method
☆32Jan 17, 2023Updated 3 years ago
yinkalario / Sound-Event-Detection-AudioSet
View on GitHub
☆48Aug 30, 2024Updated last year
RicherMans / CDur
View on GitHub
Repository for the paper "Towards duration robust weakly supervised sound event detection"
☆23Aug 3, 2023Updated 2 years ago
iver56 / audiomentations
View on GitHub
A Python library for audio data augmentation. Useful for making audio ML models work well in the real world, not just in the lab.
☆2,302Apr 13, 2026Updated 3 months ago
justinsalamon / scaper
View on GitHub
A library for soundscape synthesis and augmentation
☆426May 4, 2022Updated 4 years ago
LAION-AI / emotional-speech-annotations
View on GitHub
This repository contains prompts & best practices to annotate audio clips with a very high degree of details using Audio-Language-Models
☆35Oct 13, 2024Updated last year
sharathadavanne / seld-dcase2021
View on GitHub
Baseline method for sound event localization task of DCASE 2021 challenge
☆45Jun 15, 2021Updated 5 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
iver56 / torch-audiomentations
View on GitHub
Fast audio data augmentation in PyTorch. Inspired by audiomentations. Useful for deep learning.
☆1,160Nov 24, 2025Updated 7 months ago
Spijkervet / torchaudio-augmentations
View on GitHub
Audio transformations library for PyTorch
☆239Apr 19, 2022Updated 4 years ago
fschmid56 / EfficientAT
View on GitHub
This repository aims at providing efficient CNNs for Audio Tagging. We provide AudioSet pre-trained models ready for downstream training …
☆353Nov 20, 2024Updated last year
modelscope / FunCodec
View on GitHub
FunCodec is a research-oriented toolkit for audio quantization and downstream applications, such as text-to-speech synthesis, music gener…
☆445Jan 25, 2024Updated 2 years ago
XinhaoMei / WavCaps
View on GitHub
This reporsitory contains metadata of WavCaps dataset and codes for downstream tasks.
☆264Jul 25, 2024Updated last year
k2-fsa / Flow2GAN
View on GitHub
Hybrid Flow Matching and GAN with Multi-Resolution Network for Few-Step High-Fidelity Audio Generation
☆144Mar 8, 2026Updated 4 months ago
kyungyunlee / mono2mixed-singer
View on GitHub
[ismir2019] Learning a Joint Embedding Space of Monophonic and Mixed Music Signals for Singing Voice
☆28Dec 8, 2022Updated 3 years ago
WangHelin1997 / LibriLightMix-WHAMR
View on GitHub
Python scripts to create noisy and reverberant 2-speaker mixture audio with Libri-Light and WHAM
☆17Nov 7, 2024Updated last year
morgan76 / LinkSeg
View on GitHub
PyTorch implementation of the paper Using Pairwise Link Prediction and Graph Attention Networks for Music Structure Analysis presented at…
☆23Apr 2, 2025Updated last year
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
marl / openl3
View on GitHub
OpenL3: Open-source deep audio and image embeddings
☆598Jun 17, 2023Updated 3 years ago
yinkalario / Two-Stage-Polyphonic-Sound-Event-Detection-and-Localization
View on GitHub
A two-stage polyphonic sound event detection and localization method for both SED and DOA.
☆126Jan 8, 2023Updated 3 years ago
LAION-AI / CLAP
View on GitHub
Contrastive Language-Audio Pretraining
☆2,226May 15, 2025Updated last year
YuanGongND / ssast
View on GitHub
Code for the AAAI 2022 paper "SSAST: Self-Supervised Audio Spectrogram Transformer".
☆426Aug 14, 2022Updated 3 years ago
c4dm / dcase-few-shot-bioacoustic
View on GitHub
☆61Jul 2, 2024Updated 2 years ago
huggingface / dataspeech
View on GitHub
☆399Sep 3, 2024Updated last year
yangdongchao / AcademiCodec
View on GitHub
AcademiCodec: An Open Source Audio Codec Model for Academic Research
☆674Dec 27, 2023Updated 2 years ago