yinkalario/General-Purpose-Sound-Recognition-Demo

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/yinkalario/General-Purpose-Sound-Recognition-Demo)

yinkalario / General-Purpose-Sound-Recognition-Demo

General purpose sound recognition demo

☆161

Alternatives and similar repositories for General-Purpose-Sound-Recognition-Demo

Users that are interested in General-Purpose-Sound-Recognition-Demo are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

yinkalario / Sound-Event-Detection-AudioSet
View on GitHub
☆48Aug 30, 2024Updated last year
FishMaster93 / AFFIA3K
View on GitHub
☆10Apr 12, 2023Updated 3 years ago
qiuqiangkong / audioset_tagging_cnn
View on GitHub
☆1,765Jul 25, 2024Updated last year
qiuqiangkong / panns_inference
View on GitHub
☆266Mar 5, 2024Updated 2 years ago
etzinis / heterogeneous_separation
View on GitHub
Code and data recipes for the paper: Heterogeneous Target Speech Separation
☆44Dec 6, 2022Updated 3 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
microsoft / WavText5K
View on GitHub
Web-crawl for "Audio Retrieval with WavText5K and CLAP Training"
☆50Nov 10, 2022Updated 3 years ago
Kikyo-16 / Sound_event_detection
View on GitHub
This code aims at weakly-labeled semi-supervised sound event detection. The code embraces two methods we proposed to solve this task: sp…
☆129Jul 24, 2020Updated 5 years ago
haoheliu / diffres-python
View on GitHub
Learning differentiable temporal resolution on time-series data.
☆36Nov 12, 2022Updated 3 years ago
qiuqiangkong / panns_transfer_to_gtzan
View on GitHub
☆113Jul 12, 2020Updated 6 years ago
frednam93 / FDY-SED
View on GitHub
☆96Jun 22, 2023Updated 3 years ago
yinkalario / Two-Stage-Polyphonic-Sound-Event-Detection-and-Localization
View on GitHub
A two-stage polyphonic sound event detection and localization method for both SED and DOA.
☆126Jan 8, 2023Updated 3 years ago
turpaultn / DESED
View on GitHub
Repo associated to the DESED dataset, download and creation of data
☆154Jul 16, 2024Updated 2 years ago
DCASE-REPO / DESED_task
View on GitHub
Domestic environment sound event detection task
☆157Jun 11, 2024Updated 2 years ago
hqsiswiliam / persona-adaptive-attention
View on GitHub
☆26Oct 13, 2023Updated 2 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
akoepke / audio-retrieval-benchmark
View on GitHub
Code for "Audio Retrieval with Natural Language Queries: A Benchmark Study", Transactions on Multimedia 2022
☆54Jul 16, 2025Updated last year
XinhaoMei / audio-text_retrieval
View on GitHub
Implementation of our paper 'On Metric Learning For Audio-Text Cross-Modal Retrieval'
☆51May 17, 2022Updated 4 years ago
yinkalario / DCASE2019-TASK3
View on GitHub
Our DCASE 2019 challenge task 3 method
☆32Jan 17, 2023Updated 3 years ago
TUT-ARG / sed_eval
View on GitHub
Evaluation toolbox for Sound Event Detection
☆161Jun 12, 2024Updated 2 years ago
hche11 / VGGSound
View on GitHub
VGGSound: A Large-scale Audio-Visual Dataset
☆359Sep 13, 2021Updated 4 years ago
Anaesthesiaye / sound_event_detection_transformer
View on GitHub
code for sound event detection transformer (SEDT) and self-supervised pre-training SEDT (SP-SEDT)
☆46May 9, 2022Updated 4 years ago
kyuyeonpooh / objects-that-sound
View on GitHub
The unofficial implementation of paper, "Objects that Sound", from ECCV 2018.
☆31Jan 29, 2024Updated 2 years ago
Labbeti / aac-datasets
View on GitHub
Audio Captioning datasets for PyTorch.
☆129Mar 25, 2026Updated 3 months ago
liuxubo717 / SimPFs
View on GitHub
Code for "Simple Pooling Front-ends for Efficient Audio Calssification", ICASSP 2023
☆57Mar 3, 2023Updated 3 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
yinkalario / EIN-SELD
View on GitHub
An Improved Event-Independent Network for Polyphonic Sound Event Localization and Detection
☆79Aug 5, 2021Updated 4 years ago
mcusi / gammatonegram
View on GitHub
Python version of http://www.ee.columbia.edu/ln/rosa/matlab/gammatonegram/
☆15Oct 15, 2018Updated 7 years ago
qiuqiangkong / torchlibrosa
View on GitHub
☆512Jun 25, 2024Updated 2 years ago
soham97 / awesome-sound_event_detection
View on GitHub
Reading list for research topics in Sound AI
☆201Aug 8, 2024Updated last year
tqbl / ood_audio
View on GitHub
An audio classification system for learning with out-of-distribution data
☆33Dec 8, 2022Updated 3 years ago
swagshaw / ASC-CL
View on GitHub
Official Pytorch Implementation for Continual Learning For On-Device Environmental Sound Classification
☆14Jul 19, 2022Updated 4 years ago
qiuqiangkong / sed_time_freq_segmentation
View on GitHub
☆46Dec 17, 2018Updated 7 years ago
kkoutini / PaSST
View on GitHub
Efficient Training of Audio Transformers with Patchout
☆386Jan 12, 2024Updated 2 years ago
thomeou / General-network-architecture-for-sound-event-localization-and-detection
View on GitHub
This repository consists of python code to train sound event localization and detection models.
☆22Jan 21, 2021Updated 5 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
qiuqiangkong / audioset_source_separation
View on GitHub
☆17Feb 14, 2020Updated 6 years ago
liuxubo717 / sound_generation
View on GitHub
Code and generated sounds for "Conditional Sound Generation Using Neural Discrete Time-Frequency Representation Learning", MLSP 2021
☆69Sep 3, 2021Updated 4 years ago
MaigoAkisame / cmu-thesis
View on GitHub
Code for Yun Wang's PhD Thesis: Polyphonic Sound Event Detection with Weak Labeling
☆169May 14, 2022Updated 4 years ago
sharathadavanne / seld-dcase2020
View on GitHub
Baseline method for sound event localization task of DCASE 2020 challenge
☆60Nov 20, 2020Updated 5 years ago
edufonseca / uclser20
View on GitHub
Code for the paper "Unsupervised Contrastive Learning of Sound Event Representations", ICASSP 2021.
☆93Dec 22, 2022Updated 3 years ago
RicherMans / AudioCaption
View on GitHub
Dataset and baseline for the first Audiocaption task
☆79Jul 25, 2024Updated last year
liuxubo717 / LASS
View on GitHub
This repo hosts the code and model of "Separate What You Describe: Language-Queried Audio Source Separation", Interspeech 2022
☆146Oct 11, 2023Updated 2 years ago