tqbl/ood_audio

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/tqbl/ood_audio)

tqbl / ood_audio

An audio classification system for learning with out-of-distribution data

☆33

Alternatives and similar repositories for ood_audio

Users that are interested in ood_audio are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Quint-e / musicnn_keras
View on GitHub
Keras implementation of musicnn, a set of pre-trained deep convolutional neural networks for music audio tagging
☆27May 17, 2021Updated 5 years ago
RicherMans / AudioCaption
View on GitHub
Dataset and baseline for the first Audiocaption task
☆79Jul 25, 2024Updated 2 years ago
qiuqiangkong / music_transcription_MAPS
View on GitHub
☆56Jul 6, 2023Updated 3 years ago
Spijkervet / torchaudio-augmentations
View on GitHub
Audio transformations library for PyTorch
☆239Apr 19, 2022Updated 4 years ago
audio-captioning / audio-captioning-resources
View on GitHub
A list of resources that can help in research for automated audio captioning
☆34Feb 17, 2021Updated 5 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
tqbl / gccaps
View on GitHub
An implementation of capsule routing for sound event detection
☆15Jan 29, 2019Updated 7 years ago
Jinbo-Hu / L3DAS22-TASK2
View on GitHub
A Track-Wise Ensemble Event Independent Network for 3D Polyphonic Sound Event Localization and Detection
☆23Nov 14, 2024Updated last year
audio-captioning / audio-captioning-papers
View on GitHub
A list of papers about audio captioning
☆78Jul 1, 2022Updated 4 years ago
soobinseo / wavenet
View on GitHub
Audio source separation (mixture to vocal) using the Wavenet
☆21Sep 6, 2017Updated 8 years ago
fallonchen / ismir-klio
View on GitHub
Code supporting the ISMIR 2020 Klio Tutorial
☆20Oct 11, 2020Updated 5 years ago
beiciliang / sustain-pedal-detection
View on GitHub
Piano Sustain-Pedal Detection Using Convolutional Neural Networks and Transfer Learning
☆21Mar 24, 2023Updated 3 years ago
XinhaoMei / DCASE2021_task6_v2
View on GitHub
Code for CVSSP submission to DCASE 2021 Task 6
☆36Nov 22, 2022Updated 3 years ago
KinWaiCheuk / AudioLoader
View on GitHub
PyTorch Dataset for Speech and Music audio
☆79Jul 12, 2024Updated 2 years ago
liuxubo717 / LASS
View on GitHub
This repo hosts the code and model of "Separate What You Describe: Language-Queried Audio Source Separation", Interspeech 2022
☆146Oct 11, 2023Updated 2 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
keunwoochoi / music4all_contrib
View on GitHub
☆32Dec 29, 2020Updated 5 years ago
kyama0321 / gammachirpy
View on GitHub
A Python package of the dynamic compressive gammachirp filterbank (dcGC-FB)
☆32May 14, 2024Updated 2 years ago
urinieto / MotivesExtractor
View on GitHub
Extract Polyphonic Musical Motives from Audio Recordings
☆22Jul 20, 2019Updated 7 years ago
SAGNIKMJR / move2hear-active-AV-separation
View on GitHub
Code and datasets for 'Move2Hear: Active Audio-Visual Source Separation' (ICCV 2021)
☆16Jun 17, 2026Updated last month
seungheondoh / msu-benchmark
View on GitHub
music semantic understanding evaluation benchmark
☆24Aug 12, 2023Updated 2 years ago
salesforce / speech-datasets
View on GitHub
Simplified recipes for preparing commonly used speech datasets, and a PyTorch-compatible Python data loader that can perform standard fea…
☆15Jun 25, 2026Updated last month
emirdemirel / ASA_ICASSP2021
View on GitHub
A duration-invariant audio-to-lyrics alignment pipeline with low memory footprint which segments long music recordings via a recursive bi…
☆15Oct 13, 2022Updated 3 years ago
ws-choi / ISMIR2020_U_Nets_SVS
View on GitHub
A PyTorch Implementation of the paper - Choi, Woosung, et al. "Investigating u-nets with various intermediate blocks for spectrogram-base…
☆80Jul 1, 2022Updated 4 years ago
tqbl / arca23k-dataset
View on GitHub
The code used to create the ARCA23K and ARCA23K-FSD datasets
☆16Nov 9, 2021Updated 4 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
nomonosound / numpy-minmax
View on GitHub
A fast function (SIMD-accelerated) for finding the minimum and maximum value in a NumPy array
☆15Updated this week
jhtonyKoo / e2e_music_remastering_system
View on GitHub
source code of "End-to-end Music Remastering System Using Self-supervised and Adversarial Training"
☆47Sep 7, 2023Updated 2 years ago
edufonseca / uclser20
View on GitHub
Code for the paper "Unsupervised Contrastive Learning of Sound Event Representations", ICASSP 2021.
☆93Dec 22, 2022Updated 3 years ago
QxLabIreland / AQP
View on GitHub
☆23Jun 13, 2022Updated 4 years ago
wsntxxn / TextToAudioGrounding
View on GitHub
The dataset and baseline code for Text-to-Audio Grounding (TAG)
☆49Oct 23, 2025Updated 9 months ago
minzwon / tag-based-music-retrieval
View on GitHub
☆58Nov 2, 2020Updated 5 years ago
MTG / Podcastmix
View on GitHub
PodcastMix A dataset for separating music and speech in podcasts.
☆44Aug 20, 2024Updated last year
yongyizang / TrainingFreeMultiStepASR
View on GitHub
Official Repository for "Training-Free Multi-Step Audio Source Separation"
☆54May 26, 2025Updated last year
ws-choi / AMSS-Net
View on GitHub
A PyTorch implementation of the paper: "AMSS-Net: Audio Manipulation on User-Specified Sources with Textual Queries" (ACM Multimedia 2021…
☆21Jul 4, 2021Updated 5 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
hendriks73 / key-cnn
View on GitHub
Framework for estimating harmonic properties of music tracks.
☆31Mar 24, 2023Updated 3 years ago
qiuqiangkong / gan_separation_deconvolution
View on GitHub
☆11Jun 2, 2019Updated 7 years ago
PanagiotisP / svs-multiband
View on GitHub
Code for the paper "MULTI-BAND MASKING FOR WAVEFORM-BASED SINGING VOICE SEPARATION" that was accepted on EUSIPCO2022
☆15Jun 18, 2022Updated 4 years ago
etzinis / heterogeneous_separation
View on GitHub
Code and data recipes for the paper: Heterogeneous Target Speech Separation
☆44Dec 6, 2022Updated 3 years ago
TEAMuP-dev / audacitorch
View on GitHub
PyTorch wrappers for using your model in audacity!
☆181Aug 13, 2023Updated 2 years ago
fgnt / paderbox
View on GitHub
Paderbox: A collection of utilities for audio / speech processing
☆43Jul 21, 2025Updated last year
rabitt / motif
View on GitHub
melodic object transcription framework
☆26Nov 15, 2017Updated 8 years ago