unixpickle/audioset

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/unixpickle/audioset)

unixpickle / audioset

Fetch and use Google's AudioSet dataset

☆127

Alternatives and similar repositories for audioset

Users that are interested in audioset are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

DantesLegacy / TensorFlow_AudioSet_Example
View on GitHub
This repository contains code that was used as an example of how to use Python to download part of the AudioSet dataset and use Tensorflo…
☆13Aug 24, 2017Updated 8 years ago
jim-schwoebel / download_audioset
View on GitHub
📁 This repo makes it easy to download the raw audio files from AudioSet (32.45 GB, 632 classes).
☆106Aug 1, 2023Updated 2 years ago
audioset / ontology
View on GitHub
The Audio Set Ontology aims to provide a comprehensive set of categories to describe sound events.
☆714May 21, 2018Updated 8 years ago
marc-moreaux / audioset_raw
View on GitHub
Download and create a tfreader for the audioset dataset
☆17Apr 16, 2020Updated 6 years ago
qiuqiangkong / audioset_classification
View on GitHub
☆229Feb 9, 2020Updated 6 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
jim-schwoebel / audioset_models
View on GitHub
📊 Easily apply audio-related machine learning models trained on the AudioSet dataset (527+ models/classes).
☆31Jun 17, 2024Updated 2 years ago
MTG / Podcastmix
View on GitHub
PodcastMix A dataset for separating music and speech in podcasts.
☆44Aug 20, 2024Updated last year
edufonseca / uclser20
View on GitHub
Code for the paper "Unsupervised Contrastive Learning of Sound Event Representations", ICASSP 2021.
☆93Dec 22, 2022Updated 3 years ago
fgnt / paderbox
View on GitHub
Paderbox: A collection of utilities for audio / speech processing
☆43Jul 21, 2025Updated last year
MaigoAkisame / cmu-thesis
View on GitHub
Code for Yun Wang's PhD Thesis: Polyphonic Sound Event Detection with Weak Labeling
☆169May 14, 2022Updated 4 years ago
qiuqiangkong / audioset_source_separation
View on GitHub
☆17Feb 14, 2020Updated 6 years ago
cyfer0618 / kaldi-pytorch-rnnlm
View on GitHub
Enable RNNLM lattice rescoring with Pytorch [kaldi]
☆12Jun 5, 2020Updated 6 years ago
DemisEom / SpecAugment
View on GitHub
A Implementation of SpecAugment with Tensorflow & Pytorch, introduced by Google Brain
☆655Apr 5, 2022Updated 4 years ago
WangHelin1997 / AT-GCN
View on GitHub
Pytorch implementation of the paper : Modeling Label Dependencies for Audio Tagging with Graph Convolutional Network
☆15Sep 18, 2020Updated 5 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
etzinis / unsup_speech_enh_adaptation
View on GitHub
Unsupervised domain adaptation for conversational speech enhancement using RemixIT
☆59Apr 25, 2023Updated 3 years ago
h-munakata / Lighthouse-Wrapper-for-Audio-Moment-Retrieval
View on GitHub
☆13Mar 23, 2026Updated 4 months ago
ynop / audiomate
View on GitHub
Python library for handling audio datasets.
☆139Jul 6, 2023Updated 3 years ago
xavierfav / coala
View on GitHub
COALA: Co-Aligned Autoencoders for Learning Semantically Enriched Audio Representations
☆48Jul 25, 2024Updated 2 years ago
hche11 / VGGSound
View on GitHub
VGGSound: A Large-scale Audio-Visual Dataset
☆359Sep 13, 2021Updated 4 years ago
nextco / audio-classification
View on GitHub
Audio Classification - Multilayer Neural Networks using TensorFlow
☆28Mar 9, 2017Updated 9 years ago
ganesh-srinivas / audioset-tutorial
View on GitHub
Tutorials and examples for Google AudioSet
☆17Sep 19, 2017Updated 8 years ago
yuhogun0908 / MISOnet
View on GitHub
Unofficial Multi-microphone complex spectral mapping for utterance-wise and continuous speech separation(MISO-BF-MISO)
☆52Jan 13, 2022Updated 4 years ago
unixpickle / torch-bandpass
View on GitHub
An implementation of the Prism layer (https://arxiv.org/abs/2011.04823)
☆12Nov 13, 2020Updated 5 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
qiuqiangkong / ICASSP2018_audioset
View on GitHub
☆27Apr 12, 2018Updated 8 years ago
qiuqiangkong / sed_time_freq_segmentation
View on GitHub
☆46Dec 17, 2018Updated 7 years ago
aoifemcdonagh / audioset-processing
View on GitHub
Toolkit for downloading and processing Google's AudioSet dataset.
☆180Aug 22, 2025Updated 11 months ago
amanteur / CHAD
View on GitHub
Official Code of "A Semi-Supervised Deep Learning Approach to Dataset Collection for Query-by-Humming Task" (ISMIR 2023)
☆19Nov 7, 2023Updated 2 years ago
TUT-ARG / DCASE2017-baseline-system
View on GitHub
DCASE 2017 Baseline system
☆81Jun 26, 2020Updated 6 years ago
TUIlmenauAMS / FilterBanks_FastPythonImplementation
View on GitHub
Filter Banks, Fast Python Implementation
☆42Jul 9, 2022Updated 4 years ago
qiuqiangkong / audioset_tagging_cnn
View on GitHub
☆1,766Jul 25, 2024Updated 2 years ago
mdx-tutorial / mdx-tutorial.github.io
View on GitHub
Tutorial covering Open Source tools for Source Separation.
☆15Nov 12, 2021Updated 4 years ago
google / df-conformer
View on GitHub
Audio samples accompanying publications related to DF-Conformer, a speech enhancement model.
☆36Jun 23, 2026Updated last month
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
iver56 / torch-audiomentations
View on GitHub
Fast audio data augmentation in PyTorch. Inspired by audiomentations. Useful for deep learning.
☆1,162Nov 24, 2025Updated 8 months ago
wangyu / rethink-audio-fsl
View on GitHub
Who calls the shots? Rethinking Few-Shot Learning for Audio (WASPAA 2021)
☆43May 24, 2022Updated 4 years ago
tqbl / ood_audio
View on GitHub
An audio classification system for learning with out-of-distribution data
☆33Dec 8, 2022Updated 3 years ago
JaesungHuh / VoxSRC2022
View on GitHub
VoxSRC2022 workshop development kit
☆19Jul 21, 2022Updated 4 years ago
nttcslab / eval-audio-repr
View on GitHub
EVAR ~ Evaluation package for Audio Representations
☆81Feb 19, 2026Updated 5 months ago
AppleHolic / audioset_augmentor
View on GitHub
Sound augmentation using Large-scale audio dataset (Audioset)
☆45Jun 29, 2021Updated 5 years ago
fakufaku / 2020_interspeech_gmdp
View on GitHub
Generalized Minimal Distortion Principle for Blind Source Separation
☆22Sep 16, 2020Updated 5 years ago